β¨ KG-TRACES: Unleashing Explainable Reasoning in LLMs with Knowledge Graphs β¨
This repository contains the official implementation of KG-TRACES, a novel framework that enhances the reasoning ability of Large Language Models (LLMs) through explicit supervision over reasoning paths and processes. KG-TRACES aims to provide explainable, accurate, and traceable reasoning by leveraging the power of Knowledge Graphs.
For more details, refer to the accompanying paper: KG-TRACES: Enhancing Large Language Models with Knowledge Graph-constrained Trajectory Reasoning and Attribution Supervision
The full codebase and more information can be found on the official GitHub repository: https://github.com/Edaizi/KG-TRACES
π‘ Our Solution: KG-TRACES
KG-TRACES is a novel framework that explicitly teaches LLMs how to reason by supervising their internal "thought process" with knowledge graphs guidance. We guide them to:
- πΊοΈ Chart the Course: Predict symbolic knowledge graph reasoning paths from question to answer.
- π Show Their Work: Generate attribution-aware reasoning explanations, clearly claim whether each step comes from the KG or the LLM's internal knowledge π§ , and how effective it was!
π Why KG-TRACES Rocks
- π Crystal-Clear Explanations: Understand why the LLM reached its conclusion.
- π‘οΈ Trustworthy & Attributable: Know the evidence source of each reasoning step.
- πͺ Robust Performance: Excels even with limited or no direct KG access during inference.
- π Versatile: Shows strong generalization to specialized fields like medicine.
π Quickstart: Pretrained Models
You can easily load our fine-tuned KG-TRACES models from the Hugging Face Model Hub using the transformers library:
from transformers import AutoModelForCausalLM, AutoTokenizer
model_hub_name = "Edaizi/KG-TRACES"
tokenizer = AutoTokenizer.from_pretrained(model_hub_name)
model = AutoModelForCausalLM.from_pretrained(model_hub_name)
π Datasets
We've meticulously prepared augmented SFT datasets for WebQSP and CWQ, packed with reasoning paths and augmented reasoning processes with source attributions. Find them on Hugging Face:
You can load these datasets as follows:
from datasets import load_dataset
webqsp_sft_data = load_dataset("Edaizi/KG-TRACES-WebQSP")
cwq_sft_data = load_dataset("Edaizi/KG-TRACES-CWQ")
π Citation
If KG-TRACES helps your research or project, we'd love a shout-out! Please cite:
@misc{wu2025kgtracesenhancinglargelanguage,
title={KG-TRACES: Enhancing Large Language Models with Knowledge Graph-constrained Trajectory Reasoning and Attribution Supervision},
author={Rong Wu and Pinlong Cai and Jianbiao Mei and Licheng Wen and Tao Hu and Xuemeng Yang and Daocheng Fu and Botian Shi},
year={2025},
eprint={2506.00783},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2506.00783},
}
- Downloads last month
- 19