Search

Results for "Large Models"

Big language models (LLMs) such as ChatGPT and Gemini led the first wave of the artificial intellig.. - 매일경제

The National · Mar 23

The article discusses the rise of large language models like ChatGPT and Gemini. It highlights their role in driving the first wave of AI development. Why it matters: While lacking specifics, the article suggests ongoing interest in the impact and future of LLMs, a key area of AI research and development.

The head of Abu Dhabi’s AI university wants to defuse a tech ‘atomic bomb’

MBZUAI · Invalid Date

MBZUAI's president Eric Xing warns against the unchecked pursuit of increasingly large AI models, drawing an analogy to an "atomic bomb" due to the unpredictability of their behavior. He argues that the field lacks sufficient understanding of what these models learn and whether their outputs are reliable, advocating for more efficient models. Xing emphasizes the need for debuggability and error tracking in AI, similar to established engineering practices. Why it matters: The piece highlights growing concerns within the AI community about the scalability and potential risks associated with increasingly complex AI models, particularly regarding transparency and control.

Unlocking the Potential of Large Models for Vision Related Tasks

MBZUAI · Invalid Date

Yanwei Fu from Fudan University will present research on multimodal models, robotic grasping, and fMRI neural decoding. Topics include few-shot learning, object-centered self-supervised learning, image manipulation, and visual-language alignment. The research also covers Transformer compression and applications of large models with MVS 3D modeling in robotic arm grasping. Why it matters: While the talk is not directly about Middle East AI, the topics covered are core to advancing AI research and applications in the region.

Towards Real-world Fact-Checking with Large Language Models

MBZUAI · Invalid Date

Iryna Gurevych from TU Darmstadt presented research on using large language models for real-world fact-checking, focusing on dismantling misleading narratives from misinterpreted scientific publications and detecting misinformation via visual content. The research aims to explain why a false claim was believed, why it is false, and why the alternative is correct. Why it matters: Addressing misinformation, especially when supported by seemingly credible sources, is critical for public health, conflict resolution, and maintaining trust in institutions in the Middle East and globally.

LLMs 101: Large language models explained

MBZUAI · Invalid Date

The article provides a basic overview of large language models (LLMs), explaining their functionality and applications. LLMs are AI systems that process and generate human-like text using transformer architecture, trained on vast datasets to predict the next word in a sequence. The piece differentiates between general-purpose, task-specific, and multimodal models, as well as closed-source and open-source LLMs. Why it matters: LLMs are foundational for advancements in Arabic NLP, as evidenced by models like MBZUAI's Jais, and understanding their mechanics is crucial for regional AI development.

K2-V2: Full Openness Finally Meets Real Performance

MBZUAI · Invalid Date

IFM has released K2-V2, a 70B-class LLM that takes a "360-open" approach by making its weights, data, training details, checkpoints, and fine-tuning recipes publicly available. K2-V2 matches leading open-weight model performance while offering full transparency, contrasting with proprietary and semi-open Chinese models. Independent evaluations show K2 as a high-performance, fully open-source alternative in the AI landscape. Why it matters: K2-V2 provides developers with a transparent and reproducible foundation model, fostering trust and enabling customization without sacrificing performance, which is crucial for sensitive applications in the region.

Complex disease modeling and efficient drug discovery with large language models

MBZUAI · Invalid Date

A KAUST alumnus presented research on using large language models for complex disease modeling and drug discovery. LLMs were trained on insurance claims of 123 million US people to model diseases and predict genetic parameters. Protein language models were developed to discover remote homologs and functional biomolecules, while RNA language models were used for RNA structure prediction and reverse design. Why it matters: This work highlights the potential of LLMs to accelerate computational biology research and drug development, with a KAUST connection.

Large Language Models and Arabic Content: A Review

arXiv · May 12

This study reviews the use of large language models (LLMs) for Arabic language processing, focusing on pre-trained models and their applications. It highlights the challenges in Arabic NLP due to the language's complexity and the relative scarcity of resources. The review also discusses how techniques like fine-tuning and prompt engineering enhance model performance on Arabic benchmarks. Why it matters: This overview helps consolidate research directions and benchmarks in Arabic NLP, guiding future development of LLMs tailored for the Arabic language and its diverse dialects.