Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. ThermoNeRF: A multimodal Neural Radiance Field for joint RGB-thermal novel view synthesis of building facades
 
research article

ThermoNeRF: A multimodal Neural Radiance Field for joint RGB-thermal novel view synthesis of building facades

Hassan, Mariam  
•
Forest, Florent  
•
Fink, Olga  
Show more
May 1, 2025
Advanced Engineering Informatics

Thermal scene reconstruction holds great potential for various applications, such as building energy analysis and non-destructive infrastructure testing. However, existing methods rely on dense scene measurements and use RGB images for 3D reconstruction, incorporating thermal data only through a post-hoc projection. Due to the lower resolution of thermal cameras and the challenges of RGB/Thermal camera calibration, this post-hoc projection often results in spatial discrepancies between temperatures projected onto the 3D model and real temperatures at the surface. We propose ThermoNeRF, a novel multimodal Neural Radiance Fields (NerF) that renders new RGB and thermal views of a scene with joint optimization of the geometry and thermal information while preventing cross-modal interference. To compensate for the lack of texture in thermal images, ThermoNeRF leverages paired RGB and thermal images to learn scene geometry while maintaining separate networks for reconstructing RGB color and temperature values, ensuring accurate and modality-specific representations. We also introduce ThermoScenes, a dataset of paired RGB+thermal images comprising 8 scenes of building facades and 8 scenes of everyday objects enabling evaluation in diverse scenarios. On ThermoScenes, ThermoNeRF achieves an average mean absolute error of 1.13 °C for buildings and 0.41 °C for other scenes when predicting temperatures of previously unobserved views. This improves accuracy by over 50% compared to concatenated RGB+thermal input in standard NeRF. While ThermoNeRF performs well on aligned RGB-thermal images, future work could address misaligned or unpaired data for better generalization. Code and dataset are available online.

  • Details
  • Metrics
Type
research article
DOI
10.1016/j.aei.2025.103345
Scopus ID

2-s2.0-105003540039

Author(s)
Hassan, Mariam  

École Polytechnique Fédérale de Lausanne

Forest, Florent  

École Polytechnique Fédérale de Lausanne

Fink, Olga  

École Polytechnique Fédérale de Lausanne

Mielle, Malcolm

Schindler EPFL Lab

Date Issued

2025-05-01

Published in
Advanced Engineering Informatics
Volume

65

Article Number

103345

Subjects

Building information modeling

•

NeRF

•

Novel view synthesis

•

Thermal imaging

•

Thermography

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
VITA  
IMOS  
FunderFunding(s)Grant NumberGrant URL

Innosuisse - Swiss Innovation Agency

105.237.1 IP-ICT

Available on Infoscience
May 12, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/250060
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés