The cost of truth: An efficient fact-checking framework | NAACL

MBZUAI · Significant research

Summary

MBZUAI researchers presented FIRE, a new fact-checking framework for LLM outputs, at NAACL 2025. FIRE first assesses the LLM's confidence in its claims before searching the web, reducing computational cost. It also stores knowledge gained from web searches to aid in classifying other claims. Why it matters: This approach improves the efficiency and cost-effectiveness of automatically verifying the accuracy of LLMs, addressing a key limitation in their reliability.

Keywords

fact-checking · LLM · MBZUAI · NAACL · FIRE

Read original article →

Get the weekly digest

Top AI stories from the GCC region, every week.

FIRE: Fact-checking with Iterative Retrieval and Verification

arXiv · Oct 17

A novel agent-based framework called FIRE is introduced for fact-checking long-form text. FIRE iteratively integrates evidence retrieval and claim verification, deciding whether to provide a final answer or generate a subsequent search query. Experiments show FIRE achieves comparable performance to existing methods while reducing LLM costs by 7.6x and search costs by 16.5x.

Fact-Checking Complex Claims with Program-Guided Reasoning

arXiv · May 22

This paper introduces ProgramFC, a fact-checking model that decomposes complex claims into simpler sub-tasks using a library of functions. The model uses LLMs to generate reasoning programs and executes them by delegating sub-tasks, enhancing explainability and data efficiency. Experiments on fact-checking datasets demonstrate ProgramFC's superior performance compared to baseline methods, with publicly available code and data.

OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs

arXiv · Aug 6

MBZUAI researchers release OpenFactCheck, a unified framework to evaluate the factual accuracy of large language models. The framework includes modules for response evaluation, LLM evaluation, and fact-checker evaluation. OpenFactCheck is available as an open-source Python library, a web service, and via GitHub.

New resources for fact-checking LLMs presented at EMNLP

MBZUAI · Invalid Date

MBZUAI researchers presented new resources at EMNLP for improving the factuality of LLMs, including a web application for fact-checking LLM-generated text and benchmarks for evaluating automated fact-checkers. They found that current automated fact-checkers miss nearly 40% of false claims generated by LLMs. The study breaks down the fact-checking process into eight tasks, including decomposition and decontextualization, to identify where systems fail. Why it matters: This work addresses a critical challenge in the deployment of LLMs by providing tools and methods for improving their reliability and trustworthiness, which is essential for widespread adoption in sensitive applications.