Skip to content
GCC AI Research

Topics

Multilingual

2 articles RSS ↗

LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content

arXiv · · NLP LLM

Researchers have introduced LlamaLens, a specialized multilingual LLM designed for analyzing news and social media content. The model addresses domain specificity and multilinguality, with a focus on news and social media in Arabic, English, and Hindi. LlamaLens was evaluated on 18 tasks represented by 52 datasets, outperforming the state-of-the-art on 23 testing sets. Why it matters: This work contributes a valuable resource for multilingual NLP research, particularly in the context of analyzing news and social media content across diverse languages.

Bactrian-X: Multilingual Replicable Instruction-Following Models with Low-Rank Adaptation

arXiv · · NLP LLM

MBZUAI releases Bactrian-X, a multilingual parallel dataset of 3.4 million instruction-response pairs across 52 languages. They trained low-rank adaptation (LoRA) adapters using this dataset, creating lightweight, replaceable components for large language models. Experiments show the LoRA-based models outperform vanilla and existing instruction-tuned models in multilingual settings.