Interface: RagRetrievalOptions
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:141
Options controlling retrieval behavior.
Properties
includeEmbeddings?
optionalincludeEmbeddings:boolean
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:188
Include chunk embeddings in the response.
metadataFilter?
optionalmetadataFilter:MetadataFilter
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:149
Metadata filter applied at the vector-store layer.
queryEmbeddingModelId?
optionalqueryEmbeddingModelId:string
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:190
Query embedding model override.
rerankerConfig?
optionalrerankerConfig:object
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:171
Cross-encoder reranking configuration.
When enabled, retrieved chunks are re-scored using a cross-encoder model for improved relevance ranking. Disabled by default due to added latency.
Recommended use cases:
- Background analysis tasks (accuracy over speed)
- Batch processing (no user waiting)
- Knowledge-intensive tasks (reduces hallucination)
NOT recommended for real-time chat (latency sensitive).
enabled?
optionalenabled:boolean
Enable cross-encoder reranking. Default: false
maxDocuments?
optionalmaxDocuments:number
Max documents to send to reranker (limits cost/latency). Default: 100
modelId?
optionalmodelId:string
Reranker model ID (e.g., 'rerank-english-v3.0', 'cross-encoder/ms-marco-MiniLM-L-6-v2')
params?
optionalparams:Record<string,any>
Provider-specific parameters
providerId?
optionalproviderId:string
Provider ID ('cohere', 'local')
timeoutMs?
optionaltimeoutMs:number
Request timeout in ms. Default: 30000
topN?
optionaltopN:number
Number of top results to return after reranking
strategy?
optionalstrategy:"hybrid"|"similarity"|"mmr"
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:151
Retrieval strategy (defaults to similarity search).
strategyParams?
optionalstrategyParams:object
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:153
Strategy-specific parameters (MMR lambda, hybrid alpha, etc.).
custom?
optionalcustom:Record<string,any>
hybridAlpha?
optionalhybridAlpha:number
mmrLambda?
optionalmmrLambda:number
targetDataSourceIds?
optionaltargetDataSourceIds:string[]
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:145
Set of explicit data sources to query.
targetMemoryCategories?
optionaltargetMemoryCategories:RagMemoryCategory[]
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:147
Memory categories to consult (maps to data sources via config).
tokenBudgetForContext?
optionaltokenBudgetForContext:number
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:192
Advisory token/character budget for final context construction.
topK?
optionaltopK:number
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:143
Maximum number of chunks per query.
userId?
optionaluserId:string
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:194
Caller identity for logging/billing.