Interface: IngestionConfig

Defined in: packages/agentos/src/cognition/memory/io/facade/types.ts:136

Controls how documents are split into chunks before being stored and indexed.

Properties

chunkOverlap?

optional chunkOverlap: number

Defined in: packages/agentos/src/cognition/memory/io/facade/types.ts:158

Overlap between consecutive chunks in tokens/characters. Prevents context loss at chunk boundaries.

Default

chunkSize?

optional chunkSize: number

Defined in: packages/agentos/src/cognition/memory/io/facade/types.ts:151

Target token/character count for each chunk.

Default

chunkStrategy?

optional chunkStrategy: "fixed" | "semantic" | "hierarchical" | "layout"

Defined in: packages/agentos/src/cognition/memory/io/facade/types.ts:145

Strategy for splitting a document into indexable chunks.

'fixed' – split at a fixed token/character count.
'semantic' – split at semantic boundaries (paragraphs, sections).
'hierarchical'– build a tree of coarse → fine chunks (good for Q&A).
'layout' – preserve the visual layout of the source (PDF columns etc.).

Default

'semantic'

doclingEnabled?

optional doclingEnabled: boolean

Defined in: packages/agentos/src/cognition/memory/io/facade/types.ts:179

Whether to use the Docling library for high-fidelity PDF/DOCX parsing. When false, a simpler text-extraction path is used.

Default

false

extractImages?

optional extractImages: boolean

Defined in: packages/agentos/src/cognition/memory/io/facade/types.ts:165

Whether to extract embedded images from documents (PDF, DOCX, etc.). Extracted images are stored as ExtractedImage objects.

Default

false

ocrEnabled?

optional ocrEnabled: boolean

Defined in: packages/agentos/src/cognition/memory/io/facade/types.ts:172

Whether to run Optical Character Recognition on extracted images. Requires extractImages: true.

Default

false

visionLlm?

optional visionLlm: string

Defined in: packages/agentos/src/cognition/memory/io/facade/types.ts:186

Vision-capable LLM model identifier used to caption extracted images. Only consulted when extractImages: true.

Example

'gpt-4o'

Properties​

chunkOverlap?​

Default​

chunkSize?​

Default​

chunkStrategy?​

Default​

doclingEnabled?​

Default​

extractImages?​

Default​

ocrEnabled?​

Default​

visionLlm?​

Example​

Properties

chunkOverlap?

Default

chunkSize?

Default

chunkStrategy?

Default

doclingEnabled?

Default

extractImages?

Default

ocrEnabled?

Default

visionLlm?

Example