Topic Hub
Retrieval
71 linked pages across the LLM-Docs library.
doc
Embeddings & Semantic Search
Building production semantic search systems — embedding model selection, indexing strategies, query processing, relevance tuning, and hybrid search
doc
Data Platform Retrieval Agent Implementation Guide
Architecture, workflow design, metrics, and rollout guidance for a data platform retrieval agent in production.
doc
Developer Productivity Retrieval Agent Implementation Guide
Architecture, workflow design, metrics, and rollout guidance for a developer productivity retrieval agent in production.
doc
Finance Operations Retrieval Agent Implementation Guide
Architecture, workflow design, metrics, and rollout guidance for a finance operations retrieval agent in production.
doc
Growth Marketing Retrieval Agent Implementation Guide
Architecture, workflow design, metrics, and rollout guidance for a growth marketing retrieval agent in production.
doc
Healthcare Operations Retrieval Agent Implementation Guide
Architecture, workflow design, metrics, and rollout guidance for a healthcare operations retrieval agent in production.
doc
Legal Compliance Retrieval Agent Implementation Guide
Architecture, workflow design, metrics, and rollout guidance for a legal compliance retrieval agent in production.
doc
Research Intelligence Retrieval Agent Implementation Guide
Architecture, workflow design, metrics, and rollout guidance for a research intelligence retrieval agent in production.
doc
Sales Enablement Retrieval Agent Implementation Guide
Architecture, workflow design, metrics, and rollout guidance for a sales enablement retrieval agent in production.
doc
Security Operations Retrieval Agent Implementation Guide
Architecture, workflow design, metrics, and rollout guidance for a security operations retrieval agent in production.
doc
Support Operations Retrieval Agent Implementation Guide
Architecture, workflow design, metrics, and rollout guidance for a support operations retrieval agent in production.
doc
Embeddings Architecture Patterns
Reference patterns, tradeoffs, and building blocks for designing embeddings systems.
doc
Embeddings Architecture Patterns
Reference patterns, tradeoffs, and building blocks for designing embeddings systems.
doc
Embeddings Cost and Performance
How to trade off latency, throughput, quality, and spend when operating embeddings.
doc
Embeddings Cost and Performance
How to trade off latency, throughput, quality, and spend when operating embeddings.
doc
Embeddings Evaluation Metrics
Metrics, scorecards, and review methods for measuring embeddings quality in practice.
doc
Embeddings Evaluation Metrics
Metrics, scorecards, and review methods for measuring embeddings quality in practice.
doc
Embeddings Failure Modes
Common failure patterns, debugging workflows, and prevention strategies for embeddings.
doc
Embeddings Failure Modes
Common failure patterns, debugging workflows, and prevention strategies for embeddings.
doc
Embeddings Foundations
Core concepts, terminology, workflows, and mental models for representing text, code, or multimodal inputs for semantic search and ranking in modern AI systems.
doc
Embeddings Foundations
Core concepts, terminology, workflows, and mental models for representing text, code, or multimodal inputs for semantic search and ranking in modern AI systems.
doc
Embeddings Implementation Guide
A practical step-by-step guide for implementing embeddings with production constraints in mind.
doc
Embeddings Implementation Guide
A practical step-by-step guide for implementing embeddings with production constraints in mind.
doc
Embeddings Production Checklist
Deployment checklist, operational controls, and rollout guidance for embeddings workloads.
doc
Embeddings Production Checklist
Deployment checklist, operational controls, and rollout guidance for embeddings workloads.
doc
Embeddings Vendor Landscape
How vendors, open-source options, and ecosystem tools compare for embeddings use cases.
doc
Embeddings Vendor Landscape
How vendors, open-source options, and ecosystem tools compare for embeddings use cases.
doc
Long-Context Systems Architecture Patterns
Reference patterns, tradeoffs, and building blocks for designing long-context systems systems.
doc
Long-Context Systems Cost and Performance
How to trade off latency, throughput, quality, and spend when operating long-context systems.
doc
Long-Context Systems Evaluation Metrics
Metrics, scorecards, and review methods for measuring long-context systems quality in practice.
doc
Long-Context Systems Failure Modes
Common failure patterns, debugging workflows, and prevention strategies for long-context systems.
doc
Long-Context Systems Foundations
Core concepts, terminology, workflows, and mental models for working with very large prompts and documents without losing relevance or speed in modern AI systems.
doc
Long-Context Systems Implementation Guide
A practical step-by-step guide for implementing long-context systems with production constraints in mind.
doc
Long-Context Systems Production Checklist
Deployment checklist, operational controls, and rollout guidance for long-context systems workloads.
doc
Long-Context Systems Vendor Landscape
How vendors, open-source options, and ecosystem tools compare for long-context systems use cases.
doc
Retrieval-Augmented Generation Architecture Patterns
Reference patterns, tradeoffs, and building blocks for designing retrieval-augmented generation systems.
doc
Retrieval-Augmented Generation Cost and Performance
How to trade off latency, throughput, quality, and spend when operating retrieval-augmented generation.
doc
Retrieval-Augmented Generation Evaluation Metrics
Metrics, scorecards, and review methods for measuring retrieval-augmented generation quality in practice.
doc
Retrieval-Augmented Generation Failure Modes
Common failure patterns, debugging workflows, and prevention strategies for retrieval-augmented generation.
doc
Retrieval-Augmented Generation Foundations
Core concepts, terminology, workflows, and mental models for grounding model output in trusted external knowledge at runtime in modern AI systems.
doc
Retrieval-Augmented Generation Implementation Guide
A practical step-by-step guide for implementing retrieval-augmented generation with production constraints in mind.
doc
Retrieval-Augmented Generation Production Checklist
Deployment checklist, operational controls, and rollout guidance for retrieval-augmented generation workloads.
doc
Retrieval-Augmented Generation Vendor Landscape
How vendors, open-source options, and ecosystem tools compare for retrieval-augmented generation use cases.
doc
Vector Databases Architecture Patterns
Reference patterns, tradeoffs, and building blocks for designing vector databases systems.
doc
Vector Databases Architecture Patterns
Reference patterns, tradeoffs, and building blocks for designing vector databases systems.
doc
Vector Databases Cost and Performance
How to trade off latency, throughput, quality, and spend when operating vector databases.
doc
Vector Databases Cost and Performance
How to trade off latency, throughput, quality, and spend when operating vector databases.
doc
Vector Databases Evaluation Metrics
Metrics, scorecards, and review methods for measuring vector databases quality in practice.
doc
Vector Databases Evaluation Metrics
Metrics, scorecards, and review methods for measuring vector databases quality in practice.
doc
Vector Databases Failure Modes
Common failure patterns, debugging workflows, and prevention strategies for vector databases.
doc
Vector Databases Failure Modes
Common failure patterns, debugging workflows, and prevention strategies for vector databases.
doc
Vector Databases Foundations
Core concepts, terminology, workflows, and mental models for storing and searching embeddings efficiently for similarity and hybrid retrieval in modern AI systems.
doc
Vector Databases Foundations
Core concepts, terminology, workflows, and mental models for storing and searching embeddings efficiently for similarity and hybrid retrieval in modern AI systems.
doc
Vector Databases Implementation Guide
A practical step-by-step guide for implementing vector databases with production constraints in mind.
doc
Vector Databases Implementation Guide
A practical step-by-step guide for implementing vector databases with production constraints in mind.
doc
Vector Databases Production Checklist
Deployment checklist, operational controls, and rollout guidance for vector databases workloads.
doc
Vector Databases Production Checklist
Deployment checklist, operational controls, and rollout guidance for vector databases workloads.
doc
Vector Databases Vendor Landscape
How vendors, open-source options, and ecosystem tools compare for vector databases use cases.
doc
Vector Databases Vendor Landscape
How vendors, open-source options, and ecosystem tools compare for vector databases use cases.
doc
RAG — Retrieval-Augmented Generation
Ground LLM outputs in your own data — vector databases, embedding models, chunking strategies, and production RAG architectures
doc
Context Window and Long-Context Understanding
How context windows work, techniques for extending them, and strategies for managing long documents with LLMs
agent
Data Platform Retrieval Agent
Data Platform agent blueprint focused on find the right internal knowledge quickly and package it into grounded context for downstream responses or actions for analysts and engineers need better query generation, pipeline debugging, and dataset explanation across changing schemas.
agent
Developer Productivity Retrieval Agent
Developer Productivity agent blueprint focused on find the right internal knowledge quickly and package it into grounded context for downstream responses or actions for engineering teams want reliable help with issue triage, runbook guidance, and change review without obscuring system ownership.
agent
Finance Operations Retrieval Agent
Finance Operations agent blueprint focused on find the right internal knowledge quickly and package it into grounded context for downstream responses or actions for finance teams need faster reconciliation, exception review, and policy-aware reporting for recurring operational workflows.
agent
Growth Marketing Retrieval Agent
Growth Marketing agent blueprint focused on find the right internal knowledge quickly and package it into grounded context for downstream responses or actions for campaign teams need faster experimentation, channel-specific copy, and clearer measurement loops without losing brand control.
agent
Healthcare Operations Retrieval Agent
Healthcare Operations agent blueprint focused on find the right internal knowledge quickly and package it into grounded context for downstream responses or actions for care and operations teams need workflow assistance around intake, documentation, and coordination while preserving safety review.
agent
Legal Compliance Retrieval Agent
Legal Compliance agent blueprint focused on find the right internal knowledge quickly and package it into grounded context for downstream responses or actions for legal teams need structured review support for contracts, obligations, and policy mapping under strict approval controls.
agent
Research Intelligence Retrieval Agent
Research Intelligence agent blueprint focused on find the right internal knowledge quickly and package it into grounded context for downstream responses or actions for research and strategy teams need synthesis across large source sets with explicit provenance, tradeoffs, and update tracking.
agent
Sales Enablement Retrieval Agent
Sales Enablement agent blueprint focused on find the right internal knowledge quickly and package it into grounded context for downstream responses or actions for fragmented deal context, inconsistent follow-up quality, and too much rep time spent gathering account intelligence.
agent
Security Operations Retrieval Agent
Security Operations agent blueprint focused on find the right internal knowledge quickly and package it into grounded context for downstream responses or actions for security teams must classify alerts, enrich incidents, and reduce analyst fatigue without introducing unsafe automation.
agent
Support Operations Retrieval Agent
Support Operations agent blueprint focused on find the right internal knowledge quickly and package it into grounded context for downstream responses or actions for high ticket volume, inconsistent routing, and slow escalation paths across chat, email, and in-product support.