GCC AI Research

Search

Results for "SDAIA"

Proceedings of Symposium on Data Mining Applications 2014

arXiv ·

The Symposium on Data Mining and Applications (SDMA 2014) was organized by MEGDAM to foster collaboration among data mining and machine learning researchers in Saudi Arabia, GCC countries, and the Middle East. The symposium covered areas such as statistics, computational intelligence, pattern recognition, databases, Big Data Mining and visualization. Acceptance was based on originality, significance and quality of contribution.

The Saudi Privacy Policy Dataset

arXiv ·

A new dataset called the Saudi Privacy Policy Dataset is introduced, which contains Arabic privacy policies from various sectors in Saudi Arabia. The dataset is annotated based on the 10 principles of the Personal Data Protection Law (PDPL) and includes 1,000 websites, 4,638 lines of text, and 775,370 tokens. The dataset aims to facilitate research and development in privacy policy analysis, NLP, and machine learning applications related to data protection.

Leveraging Social Media Analytics for Sustainability Trend Detection in Saudi Arabias Evolving Market

arXiv ·

This paper explores the use of AI and social media analytics to detect sustainability trends in Saudi Arabia's evolving market, in line with Vision 2030. The study processes millions of social media posts, news articles, and blogs to understand sustainability trends across various sectors. The AI-driven methodology offers sector-specific and cross-sector insights, providing decision-makers with a snapshot of market shifts, and can be adapted to other regions.

Web-Based Expert System for Civil Service Regulations: RCSES

arXiv ·

The paper introduces a web-based expert system called RCSES for civil service regulations in Saudi Arabia. The system covers 17 regulations and utilizes XML for knowledge representation and ASP.net for rule-based inference. RCSES was validated by domain experts and technical users, and compared favorably to other web-based expert systems.

ArabJobs: A Multinational Corpus of Arabic Job Ads

arXiv ·

The ArabJobs dataset is a new corpus of over 8,500 Arabic job advertisements collected from Egypt, Jordan, Saudi Arabia, and the UAE. The dataset contains over 550,000 words and captures linguistic, regional, and socio-economic variation in the Arab labor market. It is available on GitHub and can be used for fairness-aware Arabic NLP and labor market research.

Utilizing Social Media Analytics to Detect Trends in Saudi Arabias Evolving Market

arXiv ·

This paper explores how AI and social media analytics can identify and track trends in Saudi Arabia across sectors such as construction, food and beverage, tourism, technology, and entertainment. The study analyzed millions of social media posts each month, classifying discussions and calculating scores to track trends. The AI-driven methodology was able to predict the emergence and growth of trends by utilizing social media data.

UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via HUMAIN Chat

arXiv ·

This paper presents a UI-level evaluation of ALLaM-34B, an Arabic-centric LLM developed by SDAIA and deployed in the HUMAIN Chat service. The evaluation used a prompt pack spanning various Arabic dialects, code-switching, reasoning, and safety, with outputs scored by frontier LLM judges. Results indicate strong performance in generation, code-switching, MSA handling, reasoning, and improved dialect fidelity, positioning ALLaM-34B as a robust Arabic LLM suitable for real-world use.