Meta’s new method helps AI say ‘I don’t know’

Meta’s new method helps AI say ‘I don’t know’

Tech in Asia·2025-06-19 13:00

🔍 In one sentence

A new fine-tuning approach helps large language models retain their ability to express uncertainty, reducing the risk of incorrect outputs.

🏛️ Paper by:

University of Cambridge, Meta

✏️ Authors:

William F. Shen et al.

🧠 Key discovery

Traditional fine-tuning often weakens a model’s ability to indicate uncertainty, increasing the likelihood of hallucinated content. The proposed Sparse Entity-aware Tuning (SEAT) method helps maintain this ability while incorporating new information.

📊 Surprising results

Key stat: SEAT recorded an average IDK score of 0.620, compared to 0.293 from standard fine-tuning, showing better retention of uncertainty. Breakthrough: By combining sparse training with entity perturbation, SEAT allows the model to learn without losing the ability to signal gaps in knowledge. Comparison: SEAT improved the expression of uncertainty in new contexts by 112% over standard fine-tuning methods.

📌 Why this matters

This study highlights a gap in current fine-tuning practices, which can undermine safety-related features in language models. Preserving uncertainty awareness is important for applications in domains where incorrect information can have serious consequences.

💡 What are the potential applications?

Healthcare: Helps AI systems avoid making claims without sufficient data, improving patient safety. Finance: Lowers the risk of unreliable financial responses by maintaining uncertainty signaling. Legal: Reduces the chance of fabricated content in unfamiliar legal contexts.

⚠️ Limitations

SEAT increases memory requirements and training time, which could limit its use in settings with limited resources.

👉 Bottom line:

SEAT offers a way to fine-tune language models while keeping their ability to express uncertainty, contributing to safer and more trustworthy AI outputs.

📄 Read the full paper: Don’t Make It Up: Preserving Ignorance Awareness in LLM Fine-Tuning

……

Read full article on Tech in Asia

Other