Informed Patch Sampling for 3D Medical Image Reconstruction | TU Wien

Information

Publication Type: Master Thesis
Workgroup(s)/Project(s): not specified
Date: 2026
TU Wien Library: AC17859583
Open Access: yes
First Supervisor: Renata Georgia Raidou
Pages: 122
Keywords: Medical Image Analysis, Segmentation, Data Sampling, 3D Data Processing, Deep Learning, Image Reconstruction

Abstract

Efficient analysis and processing of 3D volumes are crucial in clinical practice. A common task in 3D medical image processing is object segmentation, where objects of interest are delineated. However, segmenting large volumetric scans requires substantial memory and computational power, making end-to-end segmentation both memory- and computationally intensive. A potential solution to reduce these costs is to select only a subset of the input, segment this subset, and estimate the values of the remaining regions by reconstructing them from the segmented subset. Our hypothesis is that the choice of subset influences reconstruction performance, with some regions of the input volume being more informative for reconstruction than others. To test this hypothesis, we propose a neural network capable of identifying subsets that contribute most to accurate reconstruction. To simplify the process and focus on the core task, we assume that binary segmentations of the objects of interest are provided and select subsets directly from them, reconstructing the original binary segmentations afterwards. We build our neural network upon an existing point cloud-based network that learns to select representative points, and integrate it into a novel end-to-end pipeline for reconstructing full volumes from limited input data. We modify the original point cloud–based loss function to operate on voxel grid data and introduce conversion and extraction mechanisms that enable transitions between voxel grid and point cloud representations. Our proposed pipeline first converts the input voxel grid into a point cloud representation to enable efficient geometric processing. A neural network architecture then processes the point cloud and predicts a set of candidate centers for volumetric patches. These predicted centers are subsequently used to extract the output patch set, which is then fed to the downstream reconstruction network. We evaluate our pipeline on two datasets of medical shape segmentations with varying geometrical complexity. Our experiments show that the proposed learned sampler identifies informative regions, which support reconstruction performance, especially for complex shapes and limited spatial context. We further evaluate the effect of reconstruction network quality across different input configurations, varying patch size and number of patches, and show that our approach is effective when reconstruction accuracy is poor or when the input shape has complex geometry. Finally, we analyze the computations and memory demands of the proposed pipeline, showing that the additional overhead remains under 1 GB of memory and 0.5 s of extra computation, making the method suitable for deployment in resource-limited environments.

Additional Files and Images

thesis

Weblinks

BibTeX

@mastersthesis{peterka-2026-ips,
  title =      "Informed Patch Sampling for 3D Medical Image Reconstruction",
  author =     "Ond\v{r}ej Peterka",
  year =       "2026",
  abstract =   "Efficient analysis and processing of 3D volumes are crucial
               in clinical practice. A common task in 3D medical image
               processing is object segmentation, where objects of interest
               are delineated. However, segmenting large volumetric scans
               requires substantial memory and computational power, making
               end-to-end segmentation both memory- and computationally
               intensive. A potential solution to reduce these costs is to
               select only a subset of the input, segment this subset, and
               estimate the values of the remaining regions by
               reconstructing them from the segmented subset. Our
               hypothesis is that the choice of subset influences
               reconstruction performance, with some regions of the input
               volume being more informative for reconstruction than
               others. To test this hypothesis, we propose a neural network
               capable of identifying subsets that contribute most to
               accurate reconstruction. To simplify the process and focus
               on the core task, we assume that binary segmentations of the
               objects of interest are provided and select subsets directly
               from them, reconstructing the original binary segmentations
               afterwards. We build our neural network upon an existing
               point cloud-based network that learns to select
               representative points, and integrate it into a novel
               end-to-end pipeline for reconstructing full volumes from
               limited input data. We modify the original point
               cloud–based loss function to operate on voxel grid data
               and introduce conversion and extraction mechanisms that
               enable transitions between voxel grid and point cloud
               representations. Our proposed pipeline first converts the
               input voxel grid into a point cloud representation to enable
               efficient geometric processing. A neural network
               architecture then processes the point cloud and predicts a
               set of candidate centers for volumetric patches. These
               predicted centers are subsequently used to extract the
               output patch set, which is then fed to the downstream
               reconstruction network. We evaluate our pipeline on two
               datasets of medical shape segmentations with varying
               geometrical complexity. Our experiments show that the
               proposed learned sampler identifies informative regions,
               which support reconstruction performance, especially for
               complex shapes and limited spatial context. We further
               evaluate the effect of reconstruction network quality across
               different input configurations, varying patch size and
               number of patches, and show that our approach is effective
               when reconstruction accuracy is poor or when the input shape
               has complex geometry. Finally, we analyze the computations
               and memory demands of the proposed pipeline, showing that
               the additional overhead remains under 1 GB of memory and 0.5
               s of extra computation, making the method suitable for
               deployment in resource-limited environments.",
  pages =      "122",
  address =    "Favoritenstrasse 9-11/E193-02, A-1040 Vienna, Austria",
  school =     "Research Unit of Computer Graphics, Institute of Visual
               Computing and Human-Centered Technology, Faculty of
               Informatics, TU Wien",
  keywords =   "Medical Image Analysis, Segmentation, Data Sampling, 3D Data
               Processing, Deep Learning, Image Reconstruction",
  URL =        "https://www.cg.tuwien.ac.at/research/publications/2026/peterka-2026-ips/",
}