Decoder-Free Supervoxel GNN for Accurate Brain-Tumor Localization in Multi-modal MRI
Modern vision backbones for 3D medical imaging typically process dense voxel grids through parameter-heavy encoder-decoder structures, a design that allocates a significant portion of its parameters to spatial reconstruction rather than feature learning. Our approach introduces SVGFormer, a decoder-free pipeline built upon a content-aware grouping stage that partitions the volume into a semantic graph of supervoxels. Its hierarchical encoder learns rich node representations by combining a patch-level Transformer with a supervoxel-level Graph Attention Network, jointly modeling fine-grained intra-region features and broader inter-regional dependencies. This design concentrates all learnable capacity on feature encoding and provides inherent, dual-scale explainability from the patch to the region level. To validate the framework’s flexibility, we trained two specialized models on the BraTS dataset: one for node-level classification and one for tumor proportion regression. Both models achieved strong performance, with the classification model achieving a F1-score of 0.875 and the regression model a MAE of 0.028, confirming the encoder’s ability to learn discriminative and localized features. Our results establish that a graph-based, encoder-only paradigm offers an accurate and inherently interpretable alternative for 3D medical image representation.
2-s2.0-105019649100
EPFL
Organisation Européenne pour la Recherche Nucléaire
Organisation Européenne pour la Recherche Nucléaire
Organisation Européenne pour la Recherche Nucléaire
Organisation Européenne pour la Recherche Nucléaire
Organisation Européenne pour la Recherche Nucléaire
Organisation Européenne pour la Recherche Nucléaire
2025-10-01
Lecture Notes in Computer Science; 16150 LNCS
1611-3349
0302-9743
162
171
REVIEWED
EPFL
| Event name | Event acronym | Event place | Event date |
Daejeon, Korea, Republic of | 2025-09-27 - 2025-09-27 | ||