arxiv:2412.04106

MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities

Published on Dec 4, 2024

· Submitted by

Haoning Wu on Dec 6, 2024

Upvote

Authors:

Haoning Wu ,

Ziheng Zhao ,

Abstract

A diffusion-based data engine, MRGen, is proposed to synthesize training samples from unannotated medical modalities using text prompts and masks, effectively extending MRI segmentation.

AI-generated summary

Medical image segmentation has recently demonstrated impressive progress with deep neural networks, yet the heterogeneous modalities and scarcity of mask annotations limit the development of segmentation models on unannotated modalities. This paper investigates a new paradigm for leveraging generative models in medical applications: controllably synthesizing data for unannotated modalities, without requiring registered data pairs. Specifically, we make the following contributions in this paper: (i) we collect and curate a large-scale radiology image-text dataset, MedGen-1M, comprising modality labels, attributes, region, and organ information, along with a subset of organ mask annotations, to support research in controllable medical image generation; (ii) we propose a diffusion-based data engine, termed MRGen, which enables generation conditioned on text prompts and masks, synthesizing MR images for diverse modalities lacking mask annotations, to train segmentation models on unannotated modalities; (iii) we conduct extensive experiments across various modalities, illustrating that our data engine can effectively synthesize training samples and extend MRI segmentation towards unannotated modalities.

View arXiv page View PDF Project page GitHub 35 Add to collection

Community

haoningwu

Paper author Paper submitter Dec 6, 2024

•

edited Sep 8

Project Page: https://haoningwu3639.github.io/MRGen/
Paper: https://arxiv.org/abs/2412.04106
Code: https://github.com/haoningwu3639/MRGen/

Concretely, our contributions are threefold:
(i) we introduce MRGen-DB, a large-scale radiology image-text dataset comprising extensive samples with rich metadata, including modality labels, attributes, regions, and organs information, with a subset having pixelwise mask annotations; (ii) we present MRGen, a diffusion-based data engine for controllable medical image synthesis, conditioned on text prompts and segmentation masks. MRGen can generate realistic images for diverse MRI modalities lacking mask annotations, facilitating segmentation training in low-source domains; (iii) extensive experiments across multiple modalities demonstrate that MRGen significantly improves segmentation performance on unannotated modalities by providing high-quality synthetic data. We believe that our method bridges a critical gap in medical image analysis, extending segmentation capabilities to scenarios that are challenging to acquire manual annotations.

We are organizing our code, data, and checkpoints, and will gradually open-source them in the near future, please stay tuned.

librarian-bot

Dec 7, 2024

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 1

Datasets citing this paper 1

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2412.04106 in a Space README.md to link it from this page.