3D Latent Diffusion Model for MR-Only Radiotherapy: Accurate and Consistent Synthetic CT Generation

Mohammed A. Mahdi, Mohammed Al-Shalabi, Ehab T. Alnfrawy, Reda Elbarougy, Muhammad Usman Hadi, Rao Faizan Ali

Research output: Contribution to journalArticlepeer-review

1 Downloads (Pure)

Abstract

Background: The clinical imperative to reduce patient ionizing radiation exposure during diagnosis and treatment planning necessitates robust, high-fidelity synthetic imaging solutions. Current cross-modal synthesis techniques, primarily based on GANs and deterministic CNNs, exhibit instability and critical errors in modeling high-contrast tissues, thereby hindering their reliability for safety-critical applications such as radiotherapy. Objectives: Our primary objective was to develop a stable, high accuracy framework for 3D Magnetic Resonance Imaging (MRI) to Computed Tomography (CT) synthesis capable of generating clinically equivalent synthetic CTs (sCTs) across multiple anatomical sites. Methods: We introduce a novel 3D Latent Diffusion Model (3DLDM) that operates in a compressed latent space, mitigating the computational burden of 3D diffusion while leveraging the stability of the denoising objective. Results: Across the Head & Neck, Thorax, and Abdomen, the 3DLDM achieved a Mean Absolute Error (MAE) of 56.44 Hounsfield Units (HU). This result demonstrates a significant 3.63% reduction in overall error compared to the strongest adversarial baseline, CycleGAN (MAE = 60.07 HU, p < 0.05), a 10.76% reduction compared to NNUNet (MAE = 67.20 HU, p < 0.01), and a 20.79% reduction compared to the transformer-based SwinUNeTr (MAE = 77.23 HU, p < 0.0001). The model also achieved the highest structural similarity (SSIM = 0.885 ± 0.031), significantly exceeding SwinUNeTr ( p < 0.0001), NNUNet ( p < 0.01), and Pix2Pix ( p < 0.0001). Likewise, the 3D-LDM achieved the highest peak signal-to-noise ratio (PSNR = 29.73 ± 1.60 dB), with statistically significant gains over CycleGAN ( p < 0.01), NNUNet ( p < 0.001), and SwinUNeTr ( p < 0.0001). Conclusions: This work validates a scalable, accurate approach for volumetric synthesis, positioning the 3DLDM to enable MR-only radiotherapy planning and accelerate radiation-free multi-modal imaging in the clinic.

Original languageEnglish
Article number3010
Pages (from-to)1-18
Number of pages18
JournalDiagnostics
Volume15
Issue number23
Early online date26 Nov 2025
DOIs
Publication statusPublished online - 26 Nov 2025

Bibliographical note

Publisher Copyright:
© 2025 by the authors.

Data Access Statement

The data supporting the findings of this study are publicly available on Zenodo (https://doi.org/10.5281/zenodo.7260705) under the SynthRAD2023 collection (Accessed: 19 January 2025). No additional data were generated or analyzed beyond those reported in this study.

Funding

This research has been funded by Scientific Research Deanship at University of Ha’il-Saudi Arabia through project number RG-24 182.

Keywords

  • latent diffusion models
  • medical image synthesis
  • MRI-to-CT tanslation
  • 3D volumetric imaging
  • generative models
  • synthetic CT
  • radiotherapy planning
  • 3d Volumetric Imaging
  • Mri-to-ct Translation
  • Latent Diffusion Models
  • Medical Image Synthesis
  • Generative Models
  • Synthetic Ct
  • Radiotherapy Planning

Fingerprint

Dive into the research topics of '3D Latent Diffusion Model for MR-Only Radiotherapy: Accurate and Consistent Synthetic CT Generation'. Together they form a unique fingerprint.

Cite this