KAUST alumnus Kareem Khalil, who graduated in 2012 with a master's in mechanical engineering, is now a senior engineer at the Yanbu National Petrochemical Company (Yansab), a SABIC affiliate. Khalil credits KAUST's strong reputation for giving him an edge over colleagues. He advises students to fully enjoy KAUST's environment and research opportunities. Why it matters: This highlights KAUST's role in supplying skilled professionals to key industries in Saudi Arabia, particularly in the petrochemical sector, and its importance in Saudi Arabia's economic diversification.
Mo Li, an assistant professor of bioscience, is featured in a faculty focus article by KAUST. The article appears on the university's Biological and Environmental Science and Engineering Division page. Why it matters: This highlights KAUST's ongoing efforts to showcase faculty expertise and research areas within the university.
KAUST Ph.D. student Khalil Moussi won two awards at the IEEE International Conference on Nano/Micro Engineered and Molecular Systems for his research on a miniaturized drug delivery system. The system, developed in collaboration with KAIMRC, uses 3D printing and wireless power to deliver drugs for coronary artery disease treatment. The device features an electrochemical micro-pump, a 3D printed reservoir with microneedles, and a wireless powering unit, allowing customization for various in vivo drug delivery applications. Why it matters: This recognition highlights KAUST's contributions to biomedical engineering and its potential to develop innovative solutions for critical healthcare challenges in the region and beyond.
The authors introduce Nile-Chat, a collection of LLMs (4B, 3x4B-A6B, and 12B) specifically for the Egyptian dialect, capable of understanding and generating text in both Arabic and Latin scripts. A novel language adaptation approach using the Branch-Train-MiX strategy is used to merge script-specialized experts into a single MoE model. Nile-Chat models outperform multilingual and Arabic LLMs like LLaMa, Jais, and ALLaM on newly introduced Egyptian benchmarks, with the 12B model achieving a 14.4% performance gain over Qwen2.5-14B-Instruct on Latin-script benchmarks; all resources are publicly available. Why it matters: This work addresses the overlooked aspect of adapting LLMs to dual-script languages, providing a methodology for creating more inclusive and representative language models in the Arabic-speaking world.
Researchers from MBZUAI have introduced VideoMolmo, a large multimodal model for spatio-temporal pointing conditioned on textual descriptions. The model incorporates a temporal module with an attention mechanism and a temporal mask fusion pipeline using SAM2 for improved coherence across video sequences. They also curated a dataset of 72k video-caption pairs and introduced VPoS-Bench, a benchmark for evaluating generalization across real-world scenarios, with code and models publicly available.
A new benchmark, LongShOTBench, is introduced for evaluating multimodal reasoning and tool use in long videos, featuring open-ended questions and diagnostic rubrics. The benchmark addresses the limitations of existing datasets by combining temporal length and multimodal richness, using human-validated samples. LongShOTAgent, an agentic system, is also presented for analyzing long videos, with both the benchmark and agent demonstrating the challenges faced by state-of-the-art MLLMs.