KAUST computer scientist Mohamed Elhoseiny and his VISION CAIR team developed Creative Walk Adversarial Networks (CWAN) for novel art generation. CWAN learns from existing art styles and deviates using 'random walk deviation' methods. Human evaluators preferred CWAN-generated art compared to other methods like StyleGAN2. Why it matters: The research demonstrates AI's potential as a valuable tool for artists, enabling the creation of unique and meaningful art, and explores more effective emotional language in image captioning.
Dr. James She from Hamad Bin Khalifa University will give a talk on AI for Art, Culture and Heritage, covering history, trends, opportunities, and potential impacts in the Gulf region. His research focuses on AI and multimedia technologies for art, media, culture, and human creativity, with recent artworks using AI for cultural or heritage content in Arabic regions. He was also a visiting artist-in-residency at Fire Station Museum, Qatar Museums in 2020-2021. Why it matters: This lecture highlights the growing interest and applications of AI in preserving and promoting cultural heritage within the Gulf region, potentially fostering innovation in art and culture.
The article discusses research on fine-tuning text-to-image diffusion models, including reward function training, online reinforcement learning (RL) fine-tuning, and addressing reward over-optimization. A Text-Image Alignment Assessment (TIA2) benchmark is introduced to study reward over-optimization. TextNorm, a method for confidence calibration in reward models, is presented to reduce over-optimization risks. Why it matters: Improving the alignment and fidelity of text-to-image models is crucial for generating high-quality content, and addressing over-optimization enhances the reliability of these models in creative applications.
Researchers from Carnegie Mellon University and MBZUAI have developed a new method called ConceptAligner for precise image editing using AI. The system decomposes text embeddings into independent building blocks called atomic concepts, allowing users to make targeted tweaks without generating entirely new images. Their approach ensures that each latent factor maps to a specific user-controllable dial, enabling accurate concept-level modifications. Why it matters: This research addresses a major limitation in AI image generation, enhancing its usefulness in industries where precise control is crucial, such as advertising and medicine, and improving the reliability of AI-driven creative tools.
This paper introduces a method using Stable Diffusion XL (SDXL) fine-tuned with LoRA to generate culturally relevant coloring templates based on Emirati Al-Sadu weaving patterns for mental health therapy. The approach aims to leverage coloring therapy's stress-relieving benefits while embedding cultural resonance, potentially aiding in the treatment of Generalized Anxiety Disorder (GAD). Future research will explore the impact of Emirati heritage art on Emirati individuals using biosignals to assess engagement and effectiveness.
The Director-General of the Abu Dhabi Digital Authority, Mohamed Abdelhameed Al Askar, will give a talk on AI adoption in government. The talk will cover key points related to AI utilization within government organizations. Al Askar has experience managing digital initiatives such as the Abu Dhabi Government Services Program (TAMM). Why it matters: This signals the increasing importance of AI in public sector modernization efforts in the UAE and the broader region.
VinAI Research presented research projects focused on advancing image generation and manipulation using GANs and Diffusion Models. The research aims to improve GANs regarding utility, coverage, and output consistency. For Diffusion Models, the work focuses on improving the models’ speed to approach real-time performance and prevent negative social impact of diffusion-based personalized text-to-image generation. Why it matters: This talk indicates ongoing research and development in generative AI in Southeast Asia, an area of growing interest globally.