RAG Chunking Playground

Drop in any text and compare chunking strategies — fixed-size, recursive, by-sentence, by-paragraph — with overlap highlighted and an estimated token count per chunk. Stop guessing your chunk size; see exactly how your RAG pipeline will split a document.

Your document200 tokensToken counts are estimated (~4 chars/token).

Strategy

Chunk size300 chars

3 chunks66 avg tokens/chunk

#1 · 71 tok · 284 ch

Retrieval-augmented generation (RAG) grounds a language model in your own data.Before a model can retrieve a document, the document must be split into chunks and embedded.Chunk size is a balance.Large chunks preserve context but dilute relevance and burn context tokens at query time.

#2 · 56 tok · 224 ch

Small chunks are precise but can lose the surrounding meaning a passage depends on.Overlap repeats a little text across chunk boundaries.Without it, a sentence split across two chunks may become unretrievable for either one.

#3 · 70 tok · 280 ch

A typical starting point is a few hundred tokens per chunk with ten to twenty percent overlap.The right strategy depends on your content.Prose benefits from sentence or paragraph splitting; code or logs often suit fixed-size windows.Try a few here and watch how the chunks change.

Why chunking decides RAG quality

A retrieval-augmented generation pipeline can only retrieve what it stored, and it stores chunks — pieces of your documents, each embedded into a vector. Chunk badly and no prompt recovers: split mid-sentence and you embed half-thoughts; make chunks huge and the embedding becomes a blurry average that matches everything and nothing.

The strategies

Fixed-size — cut every N characters. Simple and predictable, but blind to structure; it slices through sentences and code.
Recursive — split on the biggest natural boundary that fits (paragraphs → sentences → words) under a size cap. The sensible default.
By sentence / paragraph — align chunks to real semantic units. Great for prose, but sizes vary and long paragraphs still need a cap.

Size and overlap

Chunk size trades recall against precision: small chunks pinpoint the exact passage but lose surrounding context; large chunks carry context but dilute the match and cost more tokens at generation time. Overlap repeats a little text between neighbours so an answer that straddles a boundary isn't lost, at the cost of some duplication. The best settings depend on your documents and embedding model — which is exactly why you tune them against real text here.

How it works

Fixed: cut every N characters, optionally with character overlap.
Recursive: prefer paragraph → sentence → word boundaries under the size cap.
By sentence / paragraph: split on natural language structure.
Token counts are estimated (~4 chars/token) — close enough to size chunks.

Frequently asked questions

What is chunking in RAG?

Chunking is splitting a source document into smaller pieces before embedding them for retrieval. Chunk size and overlap directly affect retrieval quality: too large and you dilute relevance and waste context tokens; too small and you lose the surrounding meaning a passage needs.

What chunk size and overlap should I use?

A common starting point is 200–500 tokens per chunk with 10–20% overlap, but the right values depend on your content and embedding model. This tool lets you try sizes and strategies on your own text so you can see the trade-off instead of guessing.

What is the difference between fixed and recursive chunking?

Fixed chunking cuts every N characters/tokens regardless of structure, which can split sentences mid-thought. Recursive chunking tries to break on natural boundaries first — paragraphs, then sentences, then words — so chunks stay coherent while staying under the size limit.

Why does overlap between chunks matter?

Overlap repeats a little text at the boundary of adjacent chunks so a fact that straddles a split is not lost to retrieval. Without overlap, a sentence cut in half can become unretrievable for either chunk.

Why chunking decides RAG quality

The strategies

Size and overlap

How it works

Frequently asked questions

Keep going