MedMerge: Merging Models for Effective Transfer Learning to Medical Imaging Tasks

arXiv · March 18, 2024 · Significant research

CV Healthcare Research Transfer Learning MBZUAI

Summary

Researchers at MBZUAI have introduced MedMerge, a transfer learning technique that merges weights from independently initialized models to improve performance on medical imaging tasks. MedMerge learns kernel-level weights to combine features from different models into a single model. Experiments across various medical imaging tasks demonstrated performance gains of up to 7% in F1 score.

Keywords

transfer learning · medical imaging · model merging · deep learning · MedMerge

Read original article →

Get the weekly digest

Top AI stories from the GCC region, every week.

FissionFusion: Fast Geometric Generation and Hierarchical Souping for Medical Image Analysis

arXiv · Mar 20

Researchers at MBZUAI introduce FissionFusion, a hierarchical model merging approach to improve medical image analysis performance. The method uses local and global aggregation of models based on hyperparameter configurations, along with a cyclical learning rate scheduler for efficient model generation. Experiments show FissionFusion outperforms standard model souping by approximately 6% on HAM10000 and CheXpert datasets and improves OOD performance.

Interpretable and synergistic deep learning for visual explanation and statistical estimations of segmentation of disease features from medical images

arXiv · Nov 11

The study compares deep learning models trained via transfer learning from ImageNet (TII-models) against those trained solely on medical images (LMI-models) for disease segmentation. Results show that combining outputs from both model types can improve segmentation performance by up to 10% in certain scenarios. A repository of models, code, and over 10,000 medical images is available on GitHub to facilitate further research.

MedNNS: Supernet-based Medical Task-Adaptive Neural Network Search

arXiv · Apr 22

The paper introduces MedNNS, a neural network search framework designed for medical imaging, addressing challenges in architecture selection and weight initialization. MedNNS constructs a meta-space encoding datasets and models based on their performance using a Supernetwork-based approach, expanding the model zoo size by 51x. The framework incorporates rank loss and Fréchet Inception Distance (FID) loss to capture inter-model and inter-dataset relationships, improving alignment in the meta-space and outperforming ImageNet pre-trained DL models and SOTA NAS methods.

DynaMMo: Dynamic Model Merging for Efficient Class Incremental Learning for Medical Images