Interface: RagRetrievalOptions

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:164

Options controlling retrieval behavior.

Properties

hyde?

optional hyde: object

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:219

HyDE (Hypothetical Document Embedding) configuration. When enabled, generates a hypothetical answer before embedding for improved retrieval quality. Adds one LLM call per retrieval.

enabled?

optional enabled: boolean

Enable HyDE for this retrieval. Default: false.

hypothesis?

optional hypothesis: string

Pre-generated hypothesis (skip LLM call if provided).

initialThreshold?

optional initialThreshold: number

Initial similarity threshold for adaptive thresholding. Default: 0.7.

minThreshold?

optional minThreshold: number

Minimum threshold to step down to. Default: 0.3.

includeAudit?

optional includeAudit: boolean

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:234

When true, generates a RAGAuditTrail with per-operation transparency.

includeEmbeddings?

optional includeEmbeddings: boolean

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:211

Include chunk embeddings in the response.

metadataFilter?

optional metadataFilter: MetadataFilter

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:172

Metadata filter applied at the vector-store layer.

policy?

optional policy: MemoryRetrievalPolicy

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:236

Optional shared retrieval policy overlay.

queryEmbeddingModelId?

optional queryEmbeddingModelId: string

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:213

Query embedding model override.

rerankerConfig?

optional rerankerConfig: object

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:194

Cross-encoder reranking configuration.

When enabled, retrieved chunks are re-scored using a cross-encoder model for improved relevance ranking. Disabled by default due to added latency.

Recommended use cases:

Background analysis tasks (accuracy over speed)
Batch processing (no user waiting)
Knowledge-intensive tasks (reduces hallucination)

NOT recommended for real-time chat (latency sensitive).

enabled?

optional enabled: boolean

Enable cross-encoder reranking. Default: false

maxDocuments?

optional maxDocuments: number

Max documents to send to reranker (limits cost/latency). Default: 100

modelId?

optional modelId: string

Reranker model ID (e.g., 'rerank-v3.5', 'cross-encoder/ms-marco-MiniLM-L-6-v2')

params?

optional params: Record<string, any>

Provider-specific parameters

providerId?

optional providerId: string

Provider ID ('cohere', 'local')

timeoutMs?

optional timeoutMs: number

Request timeout in ms. Default: 30000

topN?

optional topN: number

Number of top results to return after reranking

strategy?

optional strategy: "hybrid" | "similarity" | "mmr"

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:174

Retrieval strategy (defaults to similarity search).

strategyParams?

optional strategyParams: object

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:176

Strategy-specific parameters (MMR lambda, hybrid alpha, etc.).

custom?

optional custom: Record<string, any>

hybridAlpha?

optional hybridAlpha: number

mmrLambda?

optional mmrLambda: number

targetDataSourceIds?

optional targetDataSourceIds: string[]

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:168

Set of explicit data sources to query.

targetMemoryCategories?

optional targetMemoryCategories: RagMemoryCategory[]

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:170

Memory categories to consult (maps to data sources via config).

tokenBudgetForContext?

optional tokenBudgetForContext: number

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:230

Advisory token/character budget for final context construction.

topK?

optional topK: number

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:166

Maximum number of chunks per query.

userId?

optional userId: string

Defined in: packages/agentos/src/rag/IRetrievalAugmentor.ts:232

Caller identity for logging/billing.

Properties​

hyde?​

enabled?​

hypothesis?​

initialThreshold?​

minThreshold?​

includeAudit?​

includeEmbeddings?​

metadataFilter?​

policy?​

queryEmbeddingModelId?​

rerankerConfig?​

enabled?​

maxDocuments?​

modelId?​

params?​

providerId?​

timeoutMs?​

topN?​

strategy?​

strategyParams?​

custom?​

hybridAlpha?​

mmrLambda?​

targetDataSourceIds?​

targetMemoryCategories?​

tokenBudgetForContext?​

topK?​

userId?​

Properties

hyde?

enabled?

hypothesis?

initialThreshold?

minThreshold?

includeAudit?

includeEmbeddings?

metadataFilter?

policy?

queryEmbeddingModelId?

rerankerConfig?

enabled?

maxDocuments?

modelId?

params?

providerId?

timeoutMs?

topN?

strategy?

strategyParams?

custom?

hybridAlpha?

mmrLambda?

targetDataSourceIds?

targetMemoryCategories?

tokenBudgetForContext?

topK?

userId?