Skip to content
GCC AI Research

Search

Results for "MASARAT SA"

Mubeen AI: A Specialized Arabic Language Model for Heritage Preservation and User Intent Understanding

arXiv ·

MASARAT SA has developed Mubeen, a proprietary Arabic language model specializing in Arabic linguistics, Islamic studies, and cultural heritage. Mubeen was trained using native Arabic sources, including digitized historical manuscripts processed via a proprietary Arabic OCR engine. The model employs a Practical Closure Architecture to improve user intent understanding and provide decisive guidance. Why it matters: Mubeen addresses the utility gap in current Arabic LLMs by focusing on native Arabic data and cultural authenticity, which is critical for heritage preservation and alignment with Saudi Vision 2030.

Masader Plus: A New Interface for Exploring +500 Arabic NLP Datasets

arXiv ·

Researchers have developed Masader Plus, a web interface for browsing the Masader catalog of Arabic NLP datasets. The interface allows for data exploration, filtration, and API access to examine datasets. User interactions with the website are intended to provide a way to improve the dataset catalog itself. Why it matters: This interface lowers the barrier to entry for researchers seeking Arabic NLP datasets, facilitating more research in the field.

Overview of the Arabic Sentiment Analysis 2021 Competition at KAUST

arXiv ·

KAUST organized an Arabic Sentiment Analysis Challenge where participants developed ML models to classify tweets as positive, negative, or neutral. The competition used the ASAD dataset with 55K tweets for training, 20K for validation, and 20K for final evaluation. The full dataset of 100K labeled tweets has been released for public use.

From Descartes to Morin

KAUST ·

Dominique Sciamma, Managing Director at Strate School of Design in France, gave a presentation at KAUST during Enrichment in the Fall of 2017. The title of the presentation was "From Descartes to Morin." The event was held at King Abdullah University of Science and Technology. Why it matters: While the event is dated, KAUST's ongoing enrichment programs contribute to fostering a culture of innovation and knowledge exchange in Saudi Arabia.

Masader: Metadata Sourcing for Arabic Text and Speech Data Resources

arXiv ·

Researchers created Masader, the largest public catalog for Arabic NLP datasets, containing 200 datasets annotated with 25 attributes. They developed a metadata annotation strategy applicable to other languages. The paper highlights issues within current Arabic NLP datasets and suggests recommendations. Why it matters: This curated dataset directory helps lower the barrier to entry for Arabic NLP research and development.

Monsha'at and KAUST sign MoU to support and empower entrepreneurs

KAUST ·

Monsha'at and KAUST signed a MoU at the Biban 24 forum to support entrepreneurs and SME owners through joint programs. The collaboration aims to remove barriers to entrepreneurship by designing new services and providing specialized support. It also facilitates expertise exchange, joint projects, and training programs like the "Monsha'at Academy." Why it matters: This partnership between a key SME authority and a leading research university can strengthen Saudi Arabia's entrepreneurship ecosystem and contribute to Vision 2030's economic diversification goals.

ASAD: A Twitter-based Benchmark Arabic Sentiment Analysis Dataset

arXiv ·

Researchers introduce ASAD, a new large-scale, high-quality Arabic Sentiment Analysis Dataset based on 95K tweets with positive, negative, and neutral labels. The dataset is launched with a competition sponsored by KAUST offering a total of 17000 USD in prizes. Baseline models are implemented and results reported to provide a reference for competition participants.