Skip to content
GCC AI Research

Search

Results for "image editing"

Create and edit images like a smart artist

MBZUAI ·

Researchers from Carnegie Mellon University and MBZUAI have developed a new method called ConceptAligner for precise image editing using AI. The system decomposes text embeddings into independent building blocks called atomic concepts, allowing users to make targeted tweaks without generating entirely new images. Their approach ensures that each latent factor maps to a specific user-controllable dial, enabling accurate concept-level modifications. Why it matters: This research addresses a major limitation in AI image generation, enhancing its usefulness in industries where precise control is crucial, such as advertising and medicine, and improving the reliability of AI-driven creative tools.

Improving patient care with computer vision

MBZUAI ·

MBZUAI's BioMedIA lab, led by Mohammad Yaqub, is developing AI solutions for healthcare challenges in cardiology, pulmonology, and oncology using computer vision. Yaqub's previous research analyzed fetal ultrasound images to correlate bone development with maternal vitamin D levels. The lab is now applying image analysis to improve the treatment of head and neck cancer using PET and CT scans. Why it matters: This research demonstrates the potential of AI and computer vision to improve diagnostic accuracy and accessibility of healthcare in the region and beyond.

Image generation and manipulation research at VinAI

MBZUAI ·

VinAI Research presented research projects focused on advancing image generation and manipulation using GANs and Diffusion Models. The research aims to improve GANs regarding utility, coverage, and output consistency. For Diffusion Models, the work focuses on improving the models’ speed to approach real-time performance and prevent negative social impact of diffusion-based personalized text-to-image generation. Why it matters: This talk indicates ongoing research and development in generative AI in Southeast Asia, an area of growing interest globally.

Learned Optics — Improving Computational Imaging Systems through Deep Learning and Optimization

MBZUAI ·

KAUST Professor Wolfgang Heidrich is researching computational imaging systems that jointly design optics and image reconstruction algorithms. He focuses on hardware-software co-design for imaging systems with applications in HDR, compact cameras, and hyperspectral imaging. Heidrich's work on HDR displays was the basis for Brightside Technologies, acquired by Dolby in 2007. Why it matters: This research aims to advance imaging technology through AI-driven design, potentially impacting various fields from consumer electronics to scientific research within the region and globally.

Medical Image Computing: Harvesting the Healing Power of AI and Domain Knowledg

MBZUAI ·

MBZUAI hosted a panel discussion in collaboration with the Manara Center for Coexistence and Dialogue. The discussion focused on the intersection of AI and medical image computing. Jiebo Luo, a professor at the University of Rochester, discussed his work on applying AI to healthcare, including moving beyond classification to semantic description and expanding use from hospitals to home telemedicine. Why it matters: This highlights the increasing focus on AI applications in healthcare within the Middle East, particularly at institutions like MBZUAI, which are fostering discussions on the ethical and practical implications of AI in medicine.

Computer vision: Teaching computers how to see the world

KAUST ·

KAUST's Visual Computing Center (VCC) is researching computer vision, image processing, and machine learning, with applications in self-driving cars, surveillance, and security. Professor Bernard Ghanem is working on teaching machines to understand visual data semantically, similar to how humans perceive the world. Self-driving cars use visual sensors to interpret traffic signals and detect obstacles, while computer vision also assists governments and corporations with security applications like facial recognition and detecting unattended luggage. Why it matters: Advancements in computer vision at KAUST can contribute to innovations in autonomous vehicles and enhance security measures in the region.

BRIQA: Balanced Reweighting in Image Quality Assessment of Pediatric Brain MRI

arXiv ·

This paper introduces BRIQA, a new method for automated assessment of artifact severity in pediatric brain MRI, which is important for diagnostic accuracy. BRIQA uses gradient-based loss reweighting and a rotating batching scheme to handle class imbalance in artifact severity levels. Experiments show BRIQA improves average macro F1 score from 0.659 to 0.706, especially for Noise, Zipper, Positioning and Contrast artifacts.

Cross-modal understanding and generation of multimodal content

MBZUAI ·

Nicu Sebe from the University of Trento presented recent work on video generation, focusing on animating objects in a source image using external information like labels, driving videos, or text. He introduced a Learnable Game Engine (LGE) trained from monocular annotated videos, which maintains states of scenes, objects, and agents to render controllable viewpoints. Why it matters: This talk highlights advancements in cross-modal AI, potentially enabling new applications in gaming, simulation, and content creation within the region.