
| Strategy | Best for |
|---|---|
semantic | Narrative documents where topics matter more than page boundaries. |
header | Reports, policies, filings, and manuals with useful headings. |
page | Legal, regulatory, and audit workflows where page provenance matters. |
recursive | Strict size windows for embedding models or downstream systems. |
extensions.chunking. Keep page chunking on when auditors or users must be able to reconcile an answer back to exact page boundaries.
Chunking For RAG
Choose a retrieval-friendly chunking strategy.
LangChain Vector Store
Embed chunks with source metadata.