Skip to content
GCC AI Research

Search

Results for "coding"

20,000 UAE Students to Learn Coding Under New National Initiative

MBZUAI ·

The UAE's National Programme for Coders will train 20,000 students in coding across eight universities, including MBZUAI and Khalifa University. The program includes 500 training opportunities at local and international companies. Amazon, Huawei, and IBM will launch digital libraries providing resources on AI, data science, and other technologies. Why it matters: This initiative aims to bolster the UAE's AI talent pool and enhance graduates' competitiveness in the job market through practical coding skills.

Thuwal students meet Shaheen

KAUST ·

Students and teachers from Thuwal schools visited KAUST for computer-oriented activities on February 7. The activities included a practical computer coding lesson inspired by "Hour of Code," where participants used Mac computers to work through an online tutorial. Students and teachers also toured the supercomputing facilities in the KAUST Core Labs led by Bilel Hadri of the ECRC. Why it matters: Such outreach programs help promote STEM education and engagement with advanced computing resources among local students.

Can we tell when AI wrote that code? This project thinks so, even when the AI tries to hide it

MBZUAI ·

MBZUAI researchers introduced Droid, a resource suite and detector family, at EMNLP 2025 designed to distinguish between AI-generated and human-written code. The project addresses the challenge of identifying AI-generated code in software development, considering the prevalence of AI-suggested code and the risks of obfuscated backdoors and feedback loops. DroidCollection includes over one million code samples across seven programming languages, three coding domains, and outputs from 43 different code models, including human-AI co-authored code and adversarially humanized machine code. Why it matters: This research is crucial for maintaining software security and integrity in the age of AI-assisted coding, providing a robust tool for detecting AI-generated code across diverse languages and domains.

Exploring bioinformatics

KAUST ·

KAUST researchers organized a week-long workshop on bioinformatics, covering genomics and transcriptomics data analysis. The workshop targeted students, postdocs, and senior researchers, providing hands-on training in coding and analysis using tools like R, Python, and shell scripts. Attendees with little prior computational biology experience were introduced to fundamental concepts and tools for handling large sequencing datasets. Why it matters: The workshop addresses the increasing need for bioinformatics expertise at KAUST and in the region, crucial for advancing research in fields like evolution and complex diseases.

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

arXiv ·

MBZUAI researchers introduce Web2Code, a new large-scale dataset and evaluation framework for training and benchmarking multimodal LLMs on webpage understanding and HTML code generation. The dataset includes webpage images, HTML code, and QA pairs about webpage content. Experiments demonstrate the dataset's utility in webpage understanding, code generation, and general visual domain tasks, with code and data available on Github.

Web2Code: A new dataset to enhance multimodal LLM performance presented at NeurIPS

MBZUAI ·

MBZUAI researchers introduced Web2Code, a new dataset suite, at NeurIPS to enhance multimodal LLM performance in web page analysis and HTML generation. The suite includes a fine-tuning dataset and two benchmark datasets. Instruction tuning with Web2Code improved performance on specialized tasks without affecting general capabilities. Why it matters: This contribution addresses a key limitation in current multimodal LLMs, potentially boosting productivity in web design and development by providing targeted training data.

At the forefront of programming models

KAUST ·

KAUST held its second hackathon and third NVIDIA workshop. Attendees listened to lectures from international experts. Participants worked on porting their scientific applications to a GPU accelerator. Why it matters: Such events help build regional expertise in accelerated computing and attract international collaboration.

Generative Artificial Intelligence in RNA Biology

MBZUAI ·

Researchers at the Rosalind Franklin Institute are using generative AI, including GANs, to augment limited biological datasets, specifically mirtron data from mirtronDB. The synthetic data created mimics real-world samples, facilitating more comprehensive training of machine learning models, leading to improved mirtron identification tools. They also plan to apply Large Language Models (LLMs) to predict unknown patterns in sequence and structure biology problems. Why it matters: This research explores AI techniques to tackle data scarcity in biological research, potentially accelerating discoveries in noncoding RNA and transposable elements.