Addressing NLP problems in low resource settings

MBZUAI · Notable

NLP Research Arabic AI LLM Data Augmentation

Summary

Thamar Solorio from the University of Houston will discuss machine learning approaches for spontaneous human language processing. The talk will cover adapting multilingual transformers to code-switching data and using data augmentation for domain adaptation in sequence labeling tasks. Solorio will also provide an overview of other research projects at the RiTUAL lab, focusing on the scarcity of labeled data. Why it matters: This presentation addresses key challenges in Arabic NLP related to data scarcity, which is a persistent obstacle in developing effective AI applications for the region.

Keywords

low resource · code-switching · multilingual transformers · data augmentation · domain adaptation

Read original article →

Get the weekly digest

Top AI stories from the GCC region, every week.