Cone-Beam CT to Synthetic CT Translation using Conditional 3D Latent Diffusion-Based Model

  • Mohammed Al-Shalabi
  • , Mohammed A. Mahdi
  • , Reda Elbarougy
  • , Ehab T. Alnfrawy
  • , Muhammad Usman Hadi
  • , Rao Faizan Ali

Research output: Contribution to journalArticlepeer-review

Abstract

Accurate cone-beam CT (CBCT)-to-synthetic CT (sCT) translation is essential for image-guided adaptive radiotherapy (IGART), where Hounsfield unit (HU) fidelity and structural accuracy directly affect dose calculation. We propose a conditional 3D Latent Diffusion Model (3DLDFM) for volumetric CBCT-to-sCT synthesis. The framework comprises two stages: 1) a 3D variational autoencoder with KL regularization that compresses CBCT volumes into a three-channel latent representation, trained with a composite loss combining L1 reconstruction, perceptual, KL, and adversarial terms; and 2) a conditional 3D U-Net diffusion model that performs iterative denoising in latent space using a DDPM-style noise schedule, conditioned on the input CBCT. We evaluated 3DLDFM on the multi-center SynthRAD2023 dataset comprising 955 paired CBCT/CT volumes spanning head-and-neck, thorax, and abdominal sites. Performance is benchmarked against SwinUNETR, nnUNet, CycleGAN, and Pix2Pix using Mean Absolute Error (MAE), Peak Signal-to-Noise Ratio (PSNR), and Structural Similarity Index Measure (SSIM) within body masks. Across all regions, 3DLDFM achieves the lowest overall MAE (51.40 HU) and the highest overall SSIM (0.9124), while maintaining competitive PSNR (30.60 dB), surpassing all baselines in HU accuracy and structural fidelity. These results demonstrate that the proposed latent diffusion framework provides a robust and generalizable solution for CBCT-to-CT synthesis and strengthens the feasibility of simulation-free adaptive radiotherapy workflows.

Original languageEnglish
Pages (from-to)12680-12693
Number of pages14
JournalIEEE Access
Volume14
Early online date12 Jan 2026
DOIs
Publication statusPublished online - 12 Jan 2026

Bibliographical note

Publisher Copyright:
© 2013 IEEE.

Funding

This work was supported by the Scientific Research Deanship at University of Hail, Saudi Arabia, under Project RG-24 182.

Funders
The British Council

    Keywords

    • Cone-beam CT
    • Synthetic CT
    • Latent Diffusion Model
    • Image-guided adaptive radiotherapy
    • Image-to-Image Translation
    • Deep learning
    • deep learning
    • Image-guided adaptive radiotherapy

    Fingerprint

    Dive into the research topics of 'Cone-Beam CT to Synthetic CT Translation using Conditional 3D Latent Diffusion-Based Model'. Together they form a unique fingerprint.

    Cite this