Skip to content
GCC AI Research

Search

Results for "post-training"

LLM Post-Training: A Deep Dive into Reasoning Large Language Models

arXiv ·

A new survey paper provides a deep dive into post-training methodologies for Large Language Models (LLMs), analyzing their role in refining LLMs beyond pretraining. It addresses key challenges such as catastrophic forgetting, reward hacking, and inference-time trade-offs, and highlights emerging directions in model alignment, scalable adaptation, and inference-time reasoning. The paper also provides a public repository to continually track developments in this fast-evolving field.

Fine-tuning Text-to-Image Models: Reinforcement Learning and Reward Over-Optimization

MBZUAI ·

The article discusses research on fine-tuning text-to-image diffusion models, including reward function training, online reinforcement learning (RL) fine-tuning, and addressing reward over-optimization. A Text-Image Alignment Assessment (TIA2) benchmark is introduced to study reward over-optimization. TextNorm, a method for confidence calibration in reward models, is presented to reduce over-optimization risks. Why it matters: Improving the alignment and fidelity of text-to-image models is crucial for generating high-quality content, and addressing over-optimization enhances the reliability of these models in creative applications.

Kuwait Airways, KFAS explore strategic training partnership - Arab Times Kuwait News

Kuwait AI ·

Kuwait Airways and the Kuwait Foundation for the Advancement of Sciences (KFAS) are reportedly exploring a strategic training partnership. This initiative aims to enhance training programs and potentially foster skilled human capital within Kuwait. Why it matters: While the direct relevance to artificial intelligence is not specified in the title, such collaborations can form a foundation for broader technological advancements and workforce development crucial for future innovation in Kuwait.

Which way from here?

KAUST ·

KAUST highlights postdoctoral fellows Yi Jin Liew, Isabelle Schulz, Maren Ziegler and Neus Garcias Bonet outside the University Library. The article mentions King Abdullah bin Abdulaziz Al Saud (1924 – 2015). It encourages applications to KAUST's Discovery Postdoctoral program. Why it matters: This brief announcement signals KAUST's ongoing investment in attracting international research talent to Saudi Arabia.