What is GraphRAG and when to use it

GraphRAG in brief

GraphRAG models relationships between document chunks as a graph so the retrieval-generation process can preserve connected context. It complements pure vector search with relationship and hierarchy signals that are easy to miss.

When to use it

Domains where relationships matter (wikis, policy docs, org charts, dependency graphs)
When “A influences B” style relationships are central to the answer
When connectivity across documents affects quality more than any single document

Design considerations

Chunk strategy: Manage sections plus linking nodes (titles/headers) as nodes together.
Graph construction: Create edges from links, heading hierarchy, and shared entities.
Hybrid retrieval: Narrow candidates with vector search, then expand with graph neighbors.

Example flow

Embed the question and retrieve top K documents with vector search.
Expand graph neighbors of those documents to increase candidate context.
Deduplicate and summarize/normalize before sending to the LLM.

Caveats

Expanding the graph too far increases noise.
Unclear relation definitions can confuse the model.
Build cost is high, so adopt selectively for MVPs.

Quick checklist

Is vector search alone insufficient for accuracy?
Is relationship information central to the question?
Is the impact worth the build and ops cost?

Ultimately, GraphRAG excels when relationship-based context is critical.