Module: Documentrix::Documents::Splitters

Defined in:
lib/documentrix/documents.rb,
lib/documentrix/documents/splitters/semantic.rb,
lib/documentrix/documents/splitters/character.rb

Overview

Module for text splitting operations in Documentrix

This module provides functionality for splitting text into smaller chunks using various strategies. It includes both simple character-based splitting and more sophisticated semantic splitting that considers the meaning and structure of the text when determining split points.

The splitters are designed to work with the Documentrix::Documents class to prepare text data for embedding and storage in vector databases.

Defined Under Namespace

Modules: Common Classes: Character, RecursiveCharacter, Semantic