Interface: RagRetrievalOptions
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:142
Options controlling retrieval behavior.
Properties
hyde?
optionalhyde:object
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:197
HyDE (Hypothetical Document Embedding) configuration. When enabled, generates a hypothetical answer before embedding for improved retrieval quality. Adds one LLM call per retrieval.
enabled?
optionalenabled:boolean
Enable HyDE for this retrieval. Default: false.
hypothesis?
optionalhypothesis:string
Pre-generated hypothesis (skip LLM call if provided).
initialThreshold?
optionalinitialThreshold:number
Initial similarity threshold for adaptive thresholding. Default: 0.7.
minThreshold?
optionalminThreshold:number
Minimum threshold to step down to. Default: 0.3.
includeAudit?
optionalincludeAudit:boolean
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:212
When true, generates a RAGAuditTrail with per-operation transparency.
includeEmbeddings?
optionalincludeEmbeddings:boolean
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:189
Include chunk embeddings in the response.
metadataFilter?
optionalmetadataFilter:MetadataFilter
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:150
Metadata filter applied at the vector-store layer.
queryEmbeddingModelId?
optionalqueryEmbeddingModelId:string
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:191
Query embedding model override.
rerankerConfig?
optionalrerankerConfig:object
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:172
Cross-encoder reranking configuration.
When enabled, retrieved chunks are re-scored using a cross-encoder model for improved relevance ranking. Disabled by default due to added latency.
Recommended use cases:
- Background analysis tasks (accuracy over speed)
- Batch processing (no user waiting)
- Knowledge-intensive tasks (reduces hallucination)
NOT recommended for real-time chat (latency sensitive).
enabled?
optionalenabled:boolean
Enable cross-encoder reranking. Default: false
maxDocuments?
optionalmaxDocuments:number
Max documents to send to reranker (limits cost/latency). Default: 100
modelId?
optionalmodelId:string
Reranker model ID (e.g., 'rerank-v3.5', 'cross-encoder/ms-marco-MiniLM-L-6-v2')
params?
optionalparams:Record<string,any>
Provider-specific parameters
providerId?
optionalproviderId:string
Provider ID ('cohere', 'local')
timeoutMs?
optionaltimeoutMs:number
Request timeout in ms. Default: 30000
topN?
optionaltopN:number
Number of top results to return after reranking
strategy?
optionalstrategy:"hybrid"|"similarity"|"mmr"
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:152
Retrieval strategy (defaults to similarity search).
strategyParams?
optionalstrategyParams:object
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:154
Strategy-specific parameters (MMR lambda, hybrid alpha, etc.).
custom?
optionalcustom:Record<string,any>
hybridAlpha?
optionalhybridAlpha:number
mmrLambda?
optionalmmrLambda:number
targetDataSourceIds?
optionaltargetDataSourceIds:string[]
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:146
Set of explicit data sources to query.
targetMemoryCategories?
optionaltargetMemoryCategories:RagMemoryCategory[]
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:148
Memory categories to consult (maps to data sources via config).
tokenBudgetForContext?
optionaltokenBudgetForContext:number
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:208
Advisory token/character budget for final context construction.
topK?
optionaltopK:number
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:144
Maximum number of chunks per query.
userId?
optionaluserId:string
Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:210
Caller identity for logging/billing.