INQUIRING LINE

How do community summaries and selective traversal differ as graph scaling strategies?

This explores two opposite answers to the same problem — graphs are too big to fit in an LLM's context — where one strategy pre-digests the whole graph into summaries (GraphRAG) and the other learns to walk only the parts that matter (Graph-O1).


This explores two opposite answers to the same problem: an entity graph built from a corpus is far too large to hand an LLM whole, so something has to give. Community summaries and selective traversal resolve that tension in mirror-image ways — one pays the cost up front and reads everything once, the other pays at query time and reads as little as possible.

The community-summary strategy works ahead of time. Can community detection enable RAG systems to answer global corpus questions? partitions the graph into clusters with Leiden community detection, pre-generates a summary for each cluster, then answers a question by map-reducing over those summaries. The whole graph is effectively 'read,' just compressed — which is exactly why it can answer *global* questions ('what are the main themes across this corpus?') that ordinary chunk-retrieval RAG can't, because no single chunk contains the answer. The trade is precomputation cost and staleness: the summaries are only as fresh as the last time you rebuilt them.

Selective traversal flips every one of those choices. Can learned traversal policies beat exhaustive graph reading? (Graph-O1) never ingests the whole graph at all — it navigates step by step using Monte Carlo Tree Search and a reinforcement-learned policy, expanding only the nodes that look promising for *this* query. That fits comfortably inside a context window and adapts to the question, but it trades certainty for efficiency: you get decisions under uncertainty about a graph you never fully saw, rather than a guaranteed-complete (if lossy) global view. So the real axis isn't 'better vs. worse' — it's *completeness with compression* versus *selectivity with adaptivity*.

What's worth knowing is that these two aren't the only points on the map, and the corpus hints at a third move: skip the persistent graph entirely. Can query-time graph construction replace pre-built knowledge graphs? (LogicRAG) builds a small query-specific graph at inference time, dodging both the precomputation bill of community summaries and the navigation overhead of traversing a giant fixed structure. And Can building a document map first improve retrieval over long texts? (MiA-RAG) shows the summary-first instinct generalizes beyond graphs — summarizing a document before retrieving from it recovers the same kind of global structure that community summaries capture, without any graph at all.

If you want to push further: Can hypergraphs capture multi-hop reasoning better than graphs? changes what the nodes-and-edges even represent (binding three-plus entities into one relation), which reshapes both strategies — bigger, richer structures make pre-summarizing more valuable but selective traversal harder. The deeper pattern across all of these is a single recurring lever in scaling: do you compress everything once, or select sharply each time you're asked?


Sources 5 notes

Can community detection enable RAG systems to answer global corpus questions?

GraphRAG uses Leiden community detection to partition entity graphs into modular groups with pre-generated summaries, enabling map-reduce answering of global questions that pure RAG and prior summarization methods cannot handle efficiently.

Can learned traversal policies beat exhaustive graph reading?

Graph-O1 replaces whole-graph ingestion with step-by-step agentic navigation using Monte Carlo Tree Search and reinforcement learning. This approach fits within LLM context windows while learning domain-specific traversal policies, though it trades certainty about the full graph for decision-making under uncertainty.

Can query-time graph construction replace pre-built knowledge graphs?

LogicRAG constructs directed acyclic graphs from queries at inference time rather than pre-building corpus-wide graphs, eliminating construction overhead, avoiding staleness, and enabling query-specific retrieval logic without sacrificing multi-hop reasoning capability.

Can building a document map first improve retrieval over long texts?

MiA-RAG inverts standard RAG by summarizing documents first, then conditioning retrieval on that global view. This approach recovers discourse structure that bag-of-chunks retrieval destroys, making scattered evidence findable by their document role rather than surface similarity alone.

Can hypergraphs capture multi-hop reasoning better than graphs?

HGMem organizes retrieved evidence as hyperedges rather than flat lists or binary graphs, allowing three or more entities to bind into single relations without decomposition. This structure accumulates coherent knowledge across retrieval steps, trading representational complexity for constraint expressiveness.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a graph-RAG systems analyst. The question remains open: as entity graphs scale, do we compress globally (community summaries) or select sharply per query (selective traversal)—or has the regime shifted entirely?

What a curated library found — and when (dated claims, not current truth):
Findings span 2024–2026; treat as perishable.
• Community summaries (Leiden clustering + map-reduce) enable global questions across whole corpora but require precomputation and rebuild overhead (~2024).
• Selective traversal (MCTS + RL policy) navigates only promising nodes, fits context windows, adapts per query, but trades certainty for efficiency (~2025).
• Query-time graph construction (LogicRAG) bypasses persistent graphs entirely, avoiding both precomputation and traversal costs (~2025).
• Hypergraph representations (three-plus-entity relations) reshape both strategies—richer structures favor pre-summarization but complicate selective navigation (~2025).
• Test-time compute scaling and latent reasoning depth suggest reasoning graph topology itself may be optimizable, not fixed (~2025–2026).

Anchor papers (verify; mind their dates):
• arXiv:2404.16130 (2024-04): Graph RAG with community detection & global summarization
• arXiv:2508.06105 (2025-08): Adaptive reasoning without pre-built graphs
• arXiv:2506.05744 (2025-06): Topology of reasoning in large reasoning models
• arXiv:2502.05171 (2025-02): Test-time compute & latent reasoning scaling

Your task:
(1) RE-TEST THE AXIS. For each strategy pair (global compression vs. per-query selection vs. query-time construction), does improved context window size, in-context retrieval speed, or reasoning-token efficiency since mid-2025 dissolve the original trade-off? Has hypergraph reasoning (richer node/edge semantics) become standard, or do most systems still use pairwise edges? Which approach actually scales to 100M+ node corpora now?
(2) Surface the strongest CONTRADICTING finding: if reasoning graph topology is now optimizable (2506.05744, 2502.05171), does that imply pre-built community summaries are inflexible artifacts, or does it vindicate structure-first approaches?
(3) Propose two questions that assume the regime has moved: (a) If test-time compute depth replaces pre-built graph structure, what do 'community summaries' become—runtime abstractions rather than artifacts? (b) Do adaptive reasoning policies now outperform both static strategies, making the original binary choice obsolete?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Next inquiring lines