| Name | Type | Description |
|---|---|---|
chunk_size | int | Default: 4000Maximum size of chunks to return |
chunk_overlap | int | Default: 200Overlap in characters between chunks |
length_function | Callable[[str], int] | Default: len |
keep_separator | bool | Literal['start', 'end'] | Default: False |
add_start_index | bool | Default: False |
strip_whitespace | bool | Default: True |
Interface for splitting text into chunks.
Function that measures the length of given chunks
Whether to keep the separator and where to place it
in each corresponding chunk (True='start')
If True, includes chunk's start index in metadata
If True, strips whitespace from the start and end of
every document