MBZUAI's Institute of Foundation Models (IFM) has released K2 Think V2, a 70 billion parameter open-source general reasoning model built on K2 V2 Instruct. The model excels in complex reasoning benchmarks like AIME2025 and GPQA-Diamond, and features a low hallucination rate with long context reasoning capabilities. K2 Think V2 is fully sovereign and open, from pre-training through post-training, using IFM-curated data and a Guru dataset. Why it matters: This release contributes to closing the gap between community-owned reproducible AI and proprietary models, particularly in reasoning and long-context understanding for Arabic NLP tasks.
MBZUAI researchers have developed K2 Think, an open-source AI reasoning system for interpretable energy decisions. K2 Think uses long chain-of-thought supervised fine-tuning and reinforcement learning to improve accuracy on multi-step reasoning in complex energy problems. The system breaks down challenges into smaller, auditable steps and uses test-time scaling for real-time adaptation. Why it matters: The open-source nature of K2 Think promotes transparency, trust, and compliance in critical energy environments while allowing secure deployment on sovereign infrastructure.
KAUST Discovery highlighted Prof. Karl Leo's insights on translating science into business from an Entrepreneurship Center speaker series. Prof. Leo, with 440 publications and 8 co-founded companies, emphasized the importance of curiosity-driven basic research. He envisions organic semiconductors dominating electronics in 20-30 years, noting the success of Novaled, his OLED company in Dresden. Why it matters: This underscores KAUST's focus on fostering entrepreneurship and translating research into practical applications within the Kingdom.
KAUST's Dean of Biological and Environmental Science and Engineering, Prof. Pierre Magistretti, advised new students to focus on "big questions" in science. He emphasized curiosity, passion, and balancing self-criticism with confidence as guiding principles. Magistretti encouraged students to question existing paradigms and embrace uncertainty in their research. Why it matters: This guidance from a KAUST leader highlights the institution's focus on fostering innovative and impactful research among its students, which can contribute to advancements in science and technology in the region.
Dr. David Edwards from Harvard University spoke at KAUST about creativity in innovative communities. He believes that we are at the dawn of a grassroots renaissance in the arts, sciences and engineering. Edwards highlighted the importance of learning, experimentation, and production centers in fostering innovation. Why it matters: This talk suggests KAUST is looking to foster a cross-disciplinary culture of innovation, aligning with broader trends in AI and technology development that require diverse skill sets.
A new benchmark, LongShOTBench, is introduced for evaluating multimodal reasoning and tool use in long videos, featuring open-ended questions and diagnostic rubrics. The benchmark addresses the limitations of existing datasets by combining temporal length and multimodal richness, using human-validated samples. LongShOTAgent, an agentic system, is also presented for analyzing long videos, with both the benchmark and agent demonstrating the challenges faced by state-of-the-art MLLMs.