GCC AI Research

Search

Results for "Frontier Technologies"

Domain Adaptable Fine-Tune Distillation Framework For Advancing Farm Surveillance

arXiv ·

The paper introduces a framework for camel farm monitoring using a combination of automated annotation and fine-tune distillation. The Unified Auto-Annotation framework uses GroundingDINO and SAM to automatically annotate surveillance video data. The Fine-Tune Distillation framework then fine-tunes student models like YOLOv8, transferring knowledge from a larger teacher model, using data from Al-Marmoom Camel Farm in Dubai.

UAE to deploy 8 exaflop supercomputer in India to strengthen local sovereign AI infrastructure

MBZUAI ·

G42 and Cerebras, in partnership with MBZUAI and C-DAC, will deploy an 8 exaflop AI supercomputer in India. The system will operate under India's governance frameworks, with all data remaining within national jurisdiction to meet sovereign security and compliance requirements. The supercomputer will be accessible to Indian researchers, startups, and government entities under the India AI Mission.

Datacenters in the Desert: Feasibility and Sustainability of LLM Inference in the Middle East

arXiv ·

This paper analyzes the energy consumption and carbon footprint of LLM inference in the UAE compared to Iceland, Germany, and the USA. The study uses DeepSeek Coder 1.3B and the HumanEval dataset to evaluate code generation. It provides a comparative analysis of geographical trade-offs for climate-aware AI deployment, specifically addressing the challenges and potential of datacenters in desert regions.

Machine Learning Risk Intelligence for Green Hydrogen Investment: Insights for Duqm R3 Auction

arXiv ·

This paper introduces an AI-driven decision support system for green hydrogen investment in Oman, specifically for the Duqm R3 auction. The system uses publicly available meteorological data to predict maintenance pressure on hydrogen infrastructure, creating a Maintenance Pressure Index (MPI). This tool supports regulatory oversight and operational decision-making by enabling temporal benchmarking against performance claims.

VideoMolmo: Spatio-Temporal Grounding Meets Pointing

arXiv ·

Researchers from MBZUAI have introduced VideoMolmo, a large multimodal model for spatio-temporal pointing conditioned on textual descriptions. The model incorporates a temporal module with an attention mechanism and a temporal mask fusion pipeline using SAM2 for improved coherence across video sequences. They also curated a dataset of 72k video-caption pairs and introduced VPoS-Bench, a benchmark for evaluating generalization across real-world scenarios, with code and models publicly available.

MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT

arXiv ·

Researchers from MBZUAI have released MobiLlama, a fully transparent open-source 0.5 billion parameter Small Language Model (SLM). MobiLlama is designed for resource-constrained devices, emphasizing enhanced performance with reduced resource demands. The full training data pipeline, code, model weights, and checkpoints are available on Github.

Spot-the-Camel: Computer Vision for Safer Roads

arXiv ·

Researchers in Saudi Arabia are applying computer vision techniques to reduce Camel-Vehicle Collisions (CVCs). They tested object detection models including CenterNet, EfficientDet, Faster R-CNN, SSD, and YOLOv8 on the task, finding YOLOv8 to be the most accurate and efficient. Future work will focus on developing a system to improve road safety in rural areas.