GCC AI Research

Search

Results for "data center"

Datacenters in the Desert: Feasibility and Sustainability of LLM Inference in the Middle East

arXiv ·

This paper analyzes the energy consumption and carbon footprint of LLM inference in the UAE compared to Iceland, Germany, and the USA. The study uses DeepSeek Coder 1.3B and the HumanEval dataset to evaluate code generation. It provides a comparative analysis of geographical trade-offs for climate-aware AI deployment, specifically addressing the challenges and potential of datacenters in desert regions.

Machine Learning Risk Intelligence for Green Hydrogen Investment: Insights for Duqm R3 Auction

arXiv ·

This paper introduces an AI-driven decision support system for green hydrogen investment in Oman, specifically for the Duqm R3 auction. The system uses publicly available meteorological data to predict maintenance pressure on hydrogen infrastructure, creating a Maintenance Pressure Index (MPI). This tool supports regulatory oversight and operational decision-making by enabling temporal benchmarking against performance claims.

SlimPajama-DC: Understanding Data Combinations for LLM Training

arXiv ·

Researchers at MBZUAI release SlimPajama-DC, an empirical analysis of data combinations for pretraining LLMs using the SlimPajama dataset. The study examines the impact of global vs. local deduplication and the proportions of highly-deduplicated multi-source datasets. Results show that increased data diversity after global deduplication is crucial, with the best configuration outperforming models trained on RedPajama.

Proceedings of Symposium on Data Mining Applications 2014

arXiv ·

The Symposium on Data Mining and Applications (SDMA 2014) was organized by MEGDAM to foster collaboration among data mining and machine learning researchers in Saudi Arabia, GCC countries, and the Middle East. The symposium covered areas such as statistics, computational intelligence, pattern recognition, databases, Big Data Mining and visualization. Acceptance was based on originality, significance and quality of contribution.

Wind Speed Forecasting Based on Data Decomposition and Deep Learning Models: A Case Study of a Wind Farm in Saudi Arabia

arXiv ·

A novel wind speed forecasting (WSF) framework is proposed combining Wavelet Packet Decomposition (WPD), Seasonal Adjustment Method (SAM), and Bidirectional Long Short-term Memory (BiLSTM). The SAM method eliminates the seasonal component of the decomposed subseries generated by WPD to reduce forecasting complexity. The model was tested on five years of hourly wind speed observations acquired from the Dumat Al-Jandal wind farm in Al-Jouf, Saudi Arabia, achieving high forecasting accuracy.

Guided Deep List: Automating the Generation of Epidemiological Line Lists from Open Sources

arXiv ·

The paper introduces Guided Deep List, a tool for automating the generation of epidemiological line lists from open source reports. The tool uses distributed vector representations and dependency parsing to extract tabular data on disease outbreaks. It was evaluated on MERS outbreak data in Saudi Arabia, demonstrating improved accuracy over baseline methods and enabling epidemiological inferences.