A Law Reasoning Benchmark for LLM with Tree-Organized Structures including Factum Probandum, Evidence and Experiences

While progress has been made in legal applications, law reasoning, crucial for fair adjudication, remains unexplored. We propose a transparent law reasoning schema enriched with hierarchical factum probandum, evidence, and implicit experience, enabling public scrutiny and preventing bias. Inspired by this schema, we introduce the challenging task, which takes a textual case description and outputs a hierarchical structure justifying the final decision. We also create the first crowd-sourced dataset for this task, enabling comprehensive evaluation. Simultaneously, we propose an agent framework that employs a comprehensive suite of legal analysis tools to address the challenge task. This benchmark paves the way for transparent and accountable AI-assisted law reasoning in the "Intelligent Court".

TL Agent

We have provided the source code of our Transparent Law Reasoning Agent (TL Agent for short). For more details, please refer to the TLAgent. You can learn the system structure and specific installation and configuration tutorials through TLAgent Introduction.

Dataset

Coming soon...

Evaluation

We provide a test dataset and evaluation scripts for users to evaluate model performance. The test dataset can be found at test dataset. For detailed information about the dataset, please refer to dataset introduction. Additionally, we offer evaluation scripts to evaluate the model's performance on this benchmark.

Citation

@misc{shen2025lawreasoningbenchmarkllm,
      title={A Law Reasoning Benchmark for LLM with Tree-Organized Structures including Factum Probandum, Evidence and Experiences}, 
      author={Jiaxin Shen and Jinan Xu and Huiqi Hu and Luyi Lin and Fei Zheng and Guoyang Ma and Fandong Meng and Jie Zhou and Wenjuan Han},
      year={2025},
      eprint={2503.00841},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2503.00841}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
TLAgent		TLAgent
dataset		dataset
eval		eval
imgs		imgs
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Law Reasoning Benchmark for LLM with Tree-Organized Structures including Factum Probandum, Evidence and Experiences

TL Agent

Dataset

Evaluation

Citation

About

Releases

Packages

Languages

License

cocacola-lab/LawReasoningBenchmark

Folders and files

Latest commit

History

Repository files navigation

A Law Reasoning Benchmark for LLM with Tree-Organized Structures including Factum Probandum, Evidence and Experiences

TL Agent

Dataset

Evaluation

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages