Enowa and KAUST held the Enowa-KAUST Energy Summit 2024, celebrating the third year of their Energy Cortex Program. The Energy Cortex Program funds applied research for clean energy solutions, focusing on renewable energy technologies led by KAUST faculty. The program is structured around Weatherlytics, GenFlex Cortex, Stor Cortex, and Grid Cortex, and has engaged KAUST professors, produced six journal papers, and provided NEOM with data. Why it matters: This partnership aims to revolutionize renewable energy in Saudi Arabia by integrating AI and advanced data analytics to optimize energy generation and distribution, supporting the Kingdom's sustainable energy goals.
This paper introduces AraLLaMA, a new Arabic large language model (LLM) trained using a progressive vocabulary expansion method inspired by second language acquisition. The model utilizes a modified byte-pair encoding (BPE) algorithm to dynamically extend the Arabic subwords in its vocabulary during training, balancing the out-of-vocabulary (OOV) ratio. Experiments show AraLLaMA achieves performance comparable to existing Arabic LLMs on various benchmarks, and all models, data, and code will be open-sourced. Why it matters: This work addresses the need for more accessible and performant Arabic LLMs, contributing to democratization of AI in the Arab world.
Researchers at MBZUAI have demonstrated a method called "Data Laundering" to artificially boost language model benchmark scores using knowledge distillation. The technique covertly transfers benchmark-specific knowledge, leading to inflated accuracy without genuine improvements in reasoning. The study highlights a vulnerability in current AI evaluation practices and calls for more robust benchmarks.
MBZUAI researchers introduce UniMed-CLIP, a unified Vision-Language Model (VLM) for diverse medical imaging modalities, trained on the new large-scale, open-source UniMed dataset. UniMed comprises over 5.3 million image-text pairs across six modalities: X-ray, CT, MRI, Ultrasound, Pathology, and Fundus, created using LLMs to transform classification datasets into image-text formats. UniMed-CLIP significantly outperforms existing generalist VLMs and matches modality-specific medical VLMs in zero-shot evaluations, improving over BiomedCLIP by +12.61 on average across 21 datasets while using 3x less training data.
KAUST participated in the 16th Conference of the Parties (COP16) to the Convention to Combat Desertification (UNCCD) in Riyadh, showcasing its sustainability innovations. KAUST and the Ministry of Environment, Water and Agriculture (MEWA) announced the launch of a new international water-research center to be headquartered at KAUST. KAUST also entered a SAR100 million agreement with the National Center for Palm and Dates (NCPD) for date-palm sector research. Why it matters: These initiatives highlight KAUST's commitment to advancing Saudi Arabia’s data-driven “green” efforts beyond 2030 and addressing critical environmental issues.
KAUST, the National Livestock and Fisheries Development Program (NLFDP), and the National Research and Development Center for Sustainable Agriculture (Estidama) are collaborating to explore algae-based biostimulants for agriculture. These biostimulants, derived from marine algae, enhance plant growth and nutrient uptake without the negative impacts of chemical fertilizers. KAUST already operates a commercial-scale algae manufacturing plant capable of producing tons of algae per month for biostimulant production. Why it matters: This initiative positions Saudi Arabia as a leader in sustainable food technology by leveraging algae biostimulants to improve soil health and reduce dependence on imported raw materials.
Saudi Arabia has announced the establishment of a new International Water Research Center in partnership between the Ministry of Environment, Water and Agriculture and KAUST. The center will serve as a global platform for applied water research, addressing water economics, security, pollution, and digital monitoring. Headquartered at KAUST, the center seeks to foster national and international cooperation, leveraging KAUST's resources to develop sustainable solutions for water challenges. Why it matters: This initiative signals Saudi Arabia's commitment to addressing critical water challenges and solidifying its leadership in water research and technology within the region.
KAUST and the National Center for Palm and Dates (NCPD) have entered a SR100 million agreement for research projects in the date palm sector. The agreement aims to improve production efficiency, develop innovative agricultural practices, and mitigate economic risks to palm trees. KAUST will leverage its expertise to create a genetic atlas for Saudi date varieties and adopt sustainable management practices. Why it matters: This investment highlights Saudi Arabia's commitment to its cultural heritage and economic diversification through advancements in a key agricultural sector.
Researchers propose a spatio-temporal model for high-resolution wind forecasting in Saudi Arabia using Echo State Networks and stochastic partial differential equations. The model reduces spatial information via energy distance, captures dynamics with a sparse recurrent neural network, and reconstructs data using a non-stationary stochastic partial differential equation approach. The model achieves more accurate forecasts of wind speed and energy, potentially saving up to one million dollars annually compared to existing models.
MBZUAI releases BiMediX2, a bilingual (Arabic-English) Bio-Medical Large Multimodal Model, along with the BiMed-V dataset (1.6M samples) and BiMed-MBench evaluation benchmark. BiMediX2 supports multi-turn conversation in Arabic and English and handles diverse medical imaging modalities. The model achieves state-of-the-art results on medical LLM and LMM benchmarks, outperforming existing methods and GPT-4 in specific evaluations.
The paper introduces Arabic Stable LM, a 1.6B parameter Arabic-centric language model, in both base and chat versions. The Arabic Stable LM 1.6B chat model achieves strong results on several benchmarks, outperforming models with up to 8x more parameters. The study also demonstrates the benefit of incorporating synthetic instruction tuning data through a large synthetic dialogue dataset. Why it matters: This work makes Arabic LLMs more accessible by reducing the parameter size while maintaining strong performance, facilitating deployment in resource-constrained environments.
Terraxy, a KAUST startup, is providing solutions to transform Saudi deserts into fertile landscapes using CarboSoil, an advanced biochar product. CarboSoil improves soil fertility, conserves water, and enhances plant growth, and is tailored to counteract the alkaline nature of Saudi sandy soils. Terraxy is working with Saudi Aramco and NEOM to implement its technologies, servicing over 60,000 native plants. Why it matters: This innovation directly supports the Saudi Green Initiative and offers a sustainable approach to combat desertification and promote food production in the region.