A New Deep Learning Process For Postoperative Corneal Topography Prediction Based On Raw Multimodal Data From The Pentacam Hr System

Published 2025 - 43rd Congress of the ESCRS

Reference: PO339 | Type: Free paper | DOI: 10.82333/ab34-kj21

Authors: Dineshkumar NATHALAL Maru* ¹

¹Opthalmology,Aastha Hospital,Anand,India

Purpose

This work addresses the limitations of existing GAN models, particularly their inability to effectively capture global feature relationships due to the locality of convolutional neural network (CNN)-based approaches, while mitigating challenges such as image noise and geometric information loss during training.

Setting

Shenzhen Eye Hospital, Shenzhen Eye Center, Southern Medical University

Methods

The MCSTransWnet framework integrates a generator and a discriminator. By fusing transformer-based global feature extraction with U-net’s local feature preservation, the model simultaneously captures contextual relationships and fine-grained image details. This marks the first integration of transformers, U-nets, and multi-conditional modules in a GAN framework for medical topography prediction.

Results

Experimental evaluations demonstrate the model’s effectiveness, achieving a structural similarity index (SSIM) of 0.77, peak signal-to-noise ratio (PSNR) of 16.03 dB, and Fréchet inception distance (FID) of 9.11. These metrics confirm the model’s capability to generate clinically relevant postoperative corneal topographies with high structural fidelity and low distortion.

Conclusions

The MCSTransWnet model successfully bridges the gap between global feature modeling and local detail preservation in medical image prediction tasks. Its novel fusion of transformers and U-nets within a conditional GAN framework outperforms conventional CNN-based approaches, offering a robust solution for surgical outcome simulation. This work establishes a new paradigm for multimodal medical data integration and holds potential for enhancing preoperative planning in refractive surgeries.