Skip to content
GCC AI Research

Archive Monthly

March 2025

22 articles

Top Stories

RP-SAM2: Refining Point Prompts for Stable Surgical Instrument Segmentation

arXiv · · CV Research

Researchers from MBZUAI introduced RP-SAM2, a method to improve surgical instrument segmentation by refining point prompts for more stable results. RP-SAM2 uses a novel shift block and compound loss function to reduce sensitivity to point prompt placement, improving segmentation accuracy in data-constrained settings. Experiments on the Cataract1k and CaDIS datasets show that RP-SAM2 enhances segmentation accuracy and reduces variance compared to SAM2, with code available on GitHub.

KAUST scientists see the first steps of life in DNA unwinding

KAUST · · Research Healthcare

KAUST researchers have captured the initial unwinding of DNA using cryo-electron microscopy and deep learning. The study details 15 atomic states describing how the Simian Virus 40 Large Tumor Antigen helicase unwinds DNA, revealing the coordinated roles of DNA, helicases, and ATP. The research elucidates the fundamental mechanisms of DNA replication, a cornerstone of growth and reproduction. Why it matters: This detailed understanding of helicase function could lead to advances in nanotechnology and our understanding of genetic processes.

SaudiCulture: A Benchmark for Evaluating Large Language Models Cultural Competence within Saudi Arabia

arXiv · · Research LLM

The paper introduces SaudiCulture, a new benchmark for evaluating the cultural competence of LLMs within Saudi Arabia, covering five major geographical regions and diverse cultural domains. The benchmark includes questions of varying complexity and distinguishes between common and specialized regional knowledge. Evaluations of five LLMs (GPT-4, Llama 3.3, FANAR, Jais, and AceGPT) revealed performance declines on region-specific questions, highlighting the need for region-specific knowledge in LLM training.

SALT: Parameter-Efficient Fine-Tuning via Singular Value Adaptation with Low-Rank Transformation

arXiv · · Research Healthcare

Researchers introduce SALT, a parameter-efficient fine-tuning method for medical image segmentation that combines singular value adaptation with low-rank transformation. SALT selectively adapts influential singular values and complements this with a low-rank update for the remaining subspace. Experiments on five medical datasets show SALT outperforms state-of-the-art PEFT methods by 2-5% in Dice score with only 3.9% trainable parameters.

Towards Unified and Lossless Latent Space for 3D Molecular Latent Diffusion Modeling

arXiv · · Research Healthcare

The paper introduces UAE-3D, a multi-modal VAE for 3D molecule generation that compresses molecules into a unified latent space, maintaining near-zero reconstruction error. This approach simplifies latent diffusion modeling by eliminating the need to handle multi-modality and equivariance separately. Experiments on GEOM-Drugs and QM9 datasets show UAE-3D establishes new benchmarks in de novo and conditional 3D molecule generation, with significant improvements in efficiency and quality.

KAUST researchers develop new nanoplastic for sustainable street lighting

KAUST · · Research KAUST

KAUST and KACST researchers have developed a nanoPE nanoplastic that improves LED streetlight energy efficiency by enhancing thermal radiation emission and reducing LED temperature. The nanoPE coating allows infrared light to pass through while reflecting visible light, optimizing illumination. Simulations suggest that adopting this technology in the US could reduce carbon dioxide emissions by over one million metric tons. Why it matters: This innovation offers a sustainable lighting solution with significant potential for reducing energy consumption and carbon emissions in Saudi Arabia and globally.

Tracking Meets Large Multimodal Models for Driving Scenario Understanding

arXiv · · LLM CV

Researchers at MBZUAI have introduced a novel approach to enhance Large Multimodal Models (LMMs) for autonomous driving by integrating 3D tracking information. This method uses a track encoder to embed spatial and temporal data, enriching visual queries and improving the LMM's understanding of driving scenarios. Experiments on DriveLM-nuScenes and DriveLM-CARLA benchmarks demonstrate significant improvements in perception, planning, and prediction tasks compared to baseline models.

Smart water: KAUST researchers bring tailored desalination solutions to KSA farming challenges

KAUST · · Research Partnership

KAUST researchers are partnering with Saudi farmers and the Ministry of Environment, Water and Agriculture (MEWA) to develop tailored desalination solutions for agriculture. A new KAUST Center of Excellence project aims to integrate controlled environment agriculture (CEA) with desalination of non-conventional water resources for hydroponic farming. The approach focuses on selective ion removal to provide 'clean-enough' water, reducing energy use and costs compared to traditional desalination. Why it matters: This initiative could enable more sustainable and affordable local crop production in Saudi Arabia, potentially shifting the Kingdom from importing to exporting agricultural technologies.

The future of batteries: It is in your closet

KAUST · · Research Partnership

KAUST researchers, in collaboration with KACST, discovered that dissolving nylon in battery electrolytes improves the performance of lithium-metal batteries. The nylon additive resulted in more efficient batteries with longer lifespans and fewer unwanted reactions. The research was published in ACS Energy Letters and Energy Environmental Science. Why it matters: This promises cheaper, safer, and more powerful lithium batteries for applications in electric vehicles and aviation, supporting Saudi Arabia's renewable energy goals.

KISTI Establishes Strategic Partnerships with Kuwaiti Institutions in AI and Data Science - 한국과학기술정보연구원

Kuwait AI · · Partnership Research

KISTI (Korea Institute of Science and Technology Information) has announced the establishment of strategic partnerships with various institutions in Kuwait. These collaborations are specifically focused on advancing research and development in the fields of Artificial Intelligence and Data Science. The initiative aims to foster cooperation and knowledge exchange between South Korea and Kuwait in these critical technological areas. Why it matters: This partnership signifies a concerted effort to enhance AI and data science capabilities within Kuwait through international collaboration, potentially accelerating technological growth and innovation in the region.

KAUST developing robotic system to improve date palm harvesting

KAUST · · Robotics AI

KAUST is developing a robotic system for automated date palm harvesting, combining robotics and AI. The system uses robotic arms with visual sensors to identify and harvest dates, flowers, and tree structures. Field trials are scheduled for the 2025 harvest season, with full operational capability expected within three years. Why it matters: This innovation could transform Saudi Arabia's date farming industry, increasing yields, reducing labor risks, and positioning the country as a leader in agricultural technology.

The AI Pentad, the CHARME$^{2}$D Model, and an Assessment of Current-State AI Regulation

arXiv · · Policy Ethics

This paper introduces the AI Pentad model, comprising humans/organizations, algorithms, data, computing, and energy, as a framework for AI regulation. It also presents the CHARME²D Model to link the AI Pentad with regulatory enablers like registration, monitoring, and enforcement. The paper assesses AI regulatory efforts in the EU, China, UAE, UK, and US using the CHARME²D model, highlighting strengths and weaknesses.

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

arXiv · · LLM NLP

MBZUAI researchers introduce LLMVoX, a 30M-parameter, LLM-agnostic, autoregressive streaming text-to-speech (TTS) system that generates high-quality speech with low latency. The system preserves the capabilities of the base LLM and achieves a lower Word Error Rate compared to speech-enabled LLMs. LLMVoX supports seamless, infinite-length dialogues and generalizes to new languages with dataset adaptation, including Arabic.