DoseRAD2026 Dataset

Overview

The DoseRAD2026 dataset provides paired multimodal imaging and high-accuracy Monte Carlo–simulated dose distributions for benchmarking fast 3D radiation dose calculation methods.

It supports all four challenge tasks:

  • Photon dose calculation on CT
  • Photon dose calculation on MRI
  • Proton dose calculation on CT
  • Proton dose calculation on MRI

All tasks are based on a unified dataset of paired MRI–CT volumes, enabling consistent cross-modality evaluation.


Cohort

The training dataset consists of:

  • 75 registered MRI–CT pairs
    • 36 abdominal cases
    • 39 thoracic cases

Each case includes:

  • A planning MRI volume
  • A deformably registered CT volume

MRI–CT pairs are spatially aligned to ensure voxel-level correspondence. Imaging data are provided in 3D .mha format with consistent metadata.


Beam Configurations

For each case, multiple independent radiation beams are provided. Each beam must be evaluated separately.

Photon Beams

Photon beams are defined by:

  • Multi-leaf collimator (MLC) apertures
  • Gantry and collimator angles
  • Isocenter location
  • Beam energy and delivery parameters

Beam shapes are generated from arbitrary MLC configurations representative of VMAT-style treatments.

Proton Beams

Proton beams are defined by:

  • Source position and angle
  • Sets of energies originating from the same source (rays)

Configurations reflect idealized proton delivery scenarios.


Ground Truth Dose

For every beam, a corresponding 3D dose distribution is provided.

Ground truth dose distributions are generated using:

  • Full-physics Monte Carlo particle transport simulations
  • Tissue-dependent interaction modeling

All dose grids:

  • Are beam-specific
  • Are spatially aligned with CT and MRI volumes
  • Serve as the reference standard for evaluation

Data Organization

Each case includes:

  • CT volume
  • MRI volume
  • Beam parameter files
  • Corresponding Monte Carlo dose distributions

Data are organized in a standardized folder structure compatible with automated evaluation pipelines. Detailed format specifications are provided in the documentation.


Training and Test Split

The dataset is divided into:

  • A public training set (images, beam definitions, ground truth dose)
  • A private preliminary test set for initial evaluation
  • A private test set for final evaluation

The private test sets will remain inaccessible to participants to ensure objective benchmarking.