Enterprise AI Analysis

Towards Open-World Retrieval-Augmented Generation on Knowledge Graph: A Multi-Agent Collaboration Framework

Authors: Jiasheng Xu, Mingda Li, Yongqiang Tang, Peijie Wang, Wensheng Zhang

Publication: arXiv:2509.01238v1 [cs.AI] 1 Sep 2025

Large Language Models (LLMs) struggle with factual errors and knowledge gaps due to static training corpora. Retrieval-Augmented Generation (RAG) using Knowledge Graphs (KGs) addresses this. However, existing KG-based RAG assumes accessible anchor entities, limiting robustness in open-world settings where linking between queries and entities is unreliable. This paper proposes AnchorRAG, a novel multi-agent collaboration (MAC) framework for open-world RAG without predefined anchor entities. It features a predictor agent for dynamic candidate anchor entity identification, parallel multi-hop exploration by retriever agents, and a supervisor agent for iterative retrieval strategy and path synthesis. Experiments show AnchorRAG significantly outperforms baselines on four benchmarks, achieving state-of-the-art results in real-world question answering tasks by improving retrieval robustness and mitigating ambiguous anchors.

Schedule Your Strategy Session

Executive Impact & Strategic Value

AnchorRAG's innovative multi-agent approach significantly enhances the reliability and performance of AI systems requiring deep knowledge integration, making it a critical advancement for enterprise AI applications.

0 Performance Boost

0 Enhanced Robustness

0 Improved Retrieval Accuracy

Current LLM Limitations & The RAG Opportunity

LLMs, despite advanced reasoning, are limited by static training data, leading to factual errors and 'hallucinations'. RAG, particularly with Knowledge Graphs (KGs), offers a solution by integrating real-time external knowledge, significantly boosting accuracy and reliability for knowledge-intensive tasks.

The Open-World Challenge in KG-RAG

Traditional KG-based RAG relies on pre-defined 'anchor entities' for graph traversal. In real-world, 'open-world' scenarios, accurate entity linking from a query is often unreliable, causing existing methods to struggle with imprecise or partial matches from user questions.

Introducing AnchorRAG: Multi-Agent Collaboration

AnchorRAG addresses the open-world challenge through a novel multi-agent collaboration (MAC) framework. It dynamically identifies candidate anchors, performs parallel multi-hop graph explorations, and synthesizes knowledge paths to generate robust answers, even without predefined anchor entities.

Discuss Your Implementation

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Methodology Overview

Experimental Validation

Implementation & Metrics

AnchorRAG: A Multi-Agent Framework

AnchorRAG is a pipeline-based Multi-Agent Collaboration (MAC) framework designed for open-world Knowledge Graph Question Answering. It consists of three primary agents working in concert to identify anchors, explore the knowledge graph, and synthesize answers.

Enterprise Process Flow

Predictor Agent (Anchor Entity Identifying)

→

Multiple Retriever Agents (Parallel Multi-hop Exploration)

→

Supervisor Agent (Iterative Strategy & Answer Synthesis)

Predictor Agent: Dynamic Anchor Identification

This agent extracts keywords from the user query, proofreads for errors, and uses a semantic match model (aligning query terms with KG nodes) to generate candidate anchor entities. It employs a Relation-Aware Entity Grounding strategy, scoring entities based on their neighborhood's semantic alignment with the question, ensuring only the most relevant candidates proceed.

Retriever Agents: Parallel Multi-hop Exploration with Pruning

Each candidate anchor is assigned to an independent retriever agent. These agents conduct parallel, iterative multi-hop graph traversal. They retrieve one-hop neighborhoods (relations and entities) and apply a Rough Pruning and Fine Filtering method to select the most meaningful knowledge paths, reducing noise and computational overhead. An 'early-stopping' mechanism terminates unpromising explorations.

Supervisor Agent: Strategy & Synthesis

At each iteration, the supervisor agent aggregates evidence from all active retriever agents. It formulates the iterative retrieval strategy, assesses whether sufficient information is gathered to answer the question, and synthesizes the resulting knowledge paths into a final answer. If information is insufficient, it guides the next retrieval round or directly answers using internal LLM knowledge if no agents remain active or max depth is reached.

AnchorRAG's Superior Performance

Extensive experiments on four public benchmarks (WebQSP, GrailQA, CWQ, WebQuestions) demonstrate AnchorRAG's significant outperformance over existing baselines, especially in open-world settings and complex QA tasks.

Performance Comparison (Hit@1 with Qwen-Plus)

Method	WebQSP	GrailQA	CWQ	WebQuestions
IO	63.3	33.2	33.2	56.3
Chain-of-Thought	62.9	32.3	37.6	54.1
Self-Consistency	63.0	33.8	39.2	52.7
POG	33.1	36.9	39.5	25.3
TOG	66.1	41.9	39.8	56.1
Ours	73.3	62.7	47.0	60.9
AnchorRAG consistently outperforms baselines, with significant gains on complex datasets like GrailQA.

Robustness on Open-World Datasets (Hits@1 with Qwen-Plus)

Method	WebQSP (Normal)	WebQSP (Open-World)	GrailQA (Normal)	GrailQA (Open-World)
TOG	66.1	58.5(-11.5%)	41.9	32.1(-23.4%)
AnchorRAG	73.3	71.3(-2.7%)	62.7	55.4(-11.6%)
AnchorRAG shows stronger robustness in open-world scenarios with semantic noise compared to TOG.

Ablation Study (Hits@1 with GPT-40-mini)

Component	WebQSP	GrailQA
AnchorRAG (Full)	74.1	63.4
w/o Entity Grounding	67.8	42.8
w/o Parallel Retrieval	72.8	60.2
w/o Triples Fine Filtering	71.6	62.8
Entity Grounding has the greatest impact, emphasizing the importance of precise anchor entities.

Complex Question Answering: F. Scott Fitzgerald & The Mount

Scenario: Question: Who inspired Scott Fitzgerald that is the architect who designed The Mount?

Baseline Method: ToG Baseline

Outcome: Failed to locate both anchor entities, leading to meaningless reasoning paths and an incorrect answer. It struggles with ambiguity.

Key Visual: ToG's retrieval process fails to identify correct anchors (marked in yellow and green) and includes invalid entities (marked in blue and striped).

Our Method: AnchorRAG

Outcome: Successfully identified 'Edith Wharton' as the correct answer entity using entity grounding and multi-agent exploration. It also demonstrated robustness by pruning irrelevant paths early (e.g., 'The Weapon').

Key Visual: AnchorRAG correctly identifies anchor entities and effectively prunes irrelevant paths, demonstrating robust reasoning.

Key Takeaway: AnchorRAG's multi-agent framework effectively handles complex, ambiguous queries by dynamically identifying anchors and pruning irrelevant paths, showcasing superior reasoning and robustness compared to baselines.

Technical Implementation & Key Metrics

Details on how AnchorRAG is implemented, including the selection of base LLMs, dataset specifics, and hyper-parameter configurations that yield optimal performance.

GPT-40-mini & Qwen-Plus Core LLMs used for AnchorRAG Framework

3 Hops / 3 Width Optimal Search Depth and Expansion Width

Dataset Statistics

Dataset	#Train	#Validation	#Test
WebQSP	3098	-	1639
GrailQA	44,337	6763	13231
CWQ	27734	3480	3475
WebQuestions	3778	-	2032
WebQSP (Open-World)	3098	-	1803
GrailQA (Open-World)	44,337	6763	1100
Datasets include multi-hop KGQA and open-domain QA, with open-world versions for robustness testing.

Hyper-parameter Sensitivity

Analysis on GrailQA showed that retaining excessive neighboring relations (k) introduces noise, leading to performance loss. A proper number of retriever agents (m) is crucial. The optimal configuration was found to be 3 hops for search depth and 3 for expansion width, balancing search space and noise mitigation.

Calculate Your Potential ROI

Estimate the annual savings and reclaimed hours your organization could achieve by implementing advanced RAG solutions like AnchorRAG.

Industry

Number of Employees (impacted by knowledge-intensive tasks)

Average Weekly Hours on Knowledge Search/Synthesis

Average Hourly Cost (incl. benefits)

Estimated Annual Savings $0

Annual Hours Reclaimed 0

Get a Personalized ROI Analysis

Your AI Implementation Roadmap

A phased approach to integrating AnchorRAG or similar advanced RAG solutions into your enterprise infrastructure, ensuring a smooth transition and maximum impact.

Phase 1: Discovery & Assessment

Comprehensive analysis of current knowledge workflows, existing LLM usage, and identification of key pain points where RAG can deliver significant value. Define success metrics and select pilot projects.

Phase 2: Proof of Concept (PoC)

Develop a focused PoC using AnchorRAG on a specific, high-impact use case. Integrate with a subset of your knowledge graph and evaluate initial performance against defined benchmarks.

Phase 3: Pilot Deployment & Optimization

Expand AnchorRAG deployment to a pilot team or department. Gather user feedback, refine anchor identification, retrieval strategies, and integrate with existing enterprise applications. Optimize hyper-parameters for your specific data.

Phase 4: Full Scale Integration & Monitoring

Roll out AnchorRAG across relevant departments. Establish robust monitoring and maintenance protocols for continuous performance improvement, knowledge graph updates, and model retraining. Implement security and compliance measures.

Start Your AI Journey

Ready to Transform Your Knowledge Operations?

Leverage the power of open-world RAG with our multi-agent collaboration framework. Our experts are ready to guide you.

Book a Free Consultation