Researchers from Georgia Tech explored Arabic medical text classification using 82 categories from the AbjadMed dataset. They compared fine-tuned AraBERTv2 encoders with hybrid pooling against multilingual encoders and large causal decoders like Llama 3.3 70B and Qwen 3B. The study found that bidirectional encoders outperformed causal decoders in capturing semantic boundaries for fine-grained medical text classification. Why it matters: The research provides insights into optimal model selection for specialized Arabic NLP tasks, specifically highlighting the effectiveness of fine-tuned encoders for medical text categorization.
MEDAD, a KAUST spin-off, won the 2020 MEED Sustainability Medal for its "Innovative Hybrid Solar Desalination Cycle." The MEDAD hybrid cycle desalinates seawater using solar energy at 60-80 degrees Celsius, combining adsorption with multi-effect desalination. The cycle achieved performance levels of 20% of thermodynamic limits and a water production cost of $0.48/m3. Why it matters: This award recognizes the potential of KAUST-developed technology to address critical water scarcity challenges in the GCC region through sustainable and cost-effective desalination.
MBZUAI researchers introduce BiMediX, a bilingual (English and Arabic) mixture of experts LLM for medical applications. The model is trained on BiMed1.3M, a new 1.3 million bilingual instruction dataset and outperforms existing models like Med42 and Jais-30B on medical benchmarks. Code and models are available on Github.
MBZUAI releases BiMediX2, a bilingual (Arabic-English) Bio-Medical Large Multimodal Model, along with the BiMed-V dataset (1.6M samples) and BiMed-MBench evaluation benchmark. BiMediX2 supports multi-turn conversation in Arabic and English and handles diverse medical imaging modalities. The model achieves state-of-the-art results on medical LLM and LMM benchmarks, outperforming existing methods and GPT-4 in specific evaluations.
The Russian Immune Diversity Atlas project aims to profile immune cells from people of different ancestries at a multiomics level. The goal is to reconstruct a reference atlas of the healthy immune system and investigate its perturbations in Type II Diabetes (T2D). The project seeks to identify novel mechanisms and genetic/epigenetic markers for early T2D diagnostics, prognosis, and therapy as part of the international Human Cell Atlas. Why it matters: Addressing genetic diversity in biomedical research, particularly in the context of the Human Cell Atlas, is crucial for personalized medicine and ensuring that treatments are effective across diverse populations in the Middle East and globally.
This paper introduces a method using Stable Diffusion XL (SDXL) fine-tuned with LoRA to generate culturally relevant coloring templates based on Emirati Al-Sadu weaving patterns for mental health therapy. The approach aims to leverage coloring therapy's stress-relieving benefits while embedding cultural resonance, potentially aiding in the treatment of Generalized Anxiety Disorder (GAD). Future research will explore the impact of Emirati heritage art on Emirati individuals using biosignals to assess engagement and effectiveness.
This paper benchmarks the performance of large language models (LLMs) on Arabic medical natural language processing tasks using the AraHealthQA dataset. The study evaluated LLMs in multiple-choice question answering, fill-in-the-blank, and open-ended question answering scenarios. The results showed that a majority voting solution using Gemini Flash 2.5, Gemini Pro 2.5, and GPT o3 achieved 77% accuracy on MCQs, while other LLMs achieved a BERTScore of 86.44% on open-ended questions. Why it matters: The research highlights both the potential and limitations of current LLMs in Arabic clinical contexts, providing a baseline for future improvements in Arabic medical AI.