Highlighting LLM safety: How the Libra-Leaderboard is making AI more responsible

MBZUAI · Significant research

Summary

MBZUAI-based startup LibrAI has launched the Libra-Leaderboard, an evaluation framework for LLMs that assesses both capability and safety. The leaderboard evaluates 26 mainstream LLMs using 57 datasets, assigning scores based on bias, misinformation, and oversensitivity. LibrAI also launched the Interactive Safety Arena to engage the public and educate them on AI safety through adversarial prompt testing. Why it matters: The Libra-Leaderboard provides a benchmark for responsible AI development, emphasizing the importance of aligning AI capabilities with safety considerations in the rapidly evolving LLM landscape.

Keywords

LLM · safety · LibrAI · MBZUAI · leaderboard

Read original article →

Get the weekly digest

Top AI stories from the GCC region, every week.

When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

arXiv · Feb 1

Researchers from the National Center for AI in Saudi Arabia investigated the sensitivity of Large Language Model (LLM) leaderboards to minor benchmark perturbations. They found that small changes, like choice order, can shift rankings by up to 8 positions. The study recommends hybrid scoring and warns against over-reliance on simple benchmark evaluations, providing code for further research.

Highlighting LLM safety: How the Libra-Leaderboard is making AI more responsible

Summary

Keywords

Related

When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards