Transformer-Integrated Hybrid Convolutional Neural Network for Dose Prediction in Nasopharyngeal Carcinoma Radiotherapy

Radiotherapy is recognized as the major treatment of nasopharyngeal carcinoma. Rapid and accurate dose prediction can improve the efficiency of the treatment planning process and the quality of radiotherapy plans. Currently, deep learning-based methods have been widely applied to dose prediction for radiotherapy treatment planning. However, it is important to note that existing models based on Convolutional Neural Networks (CNN) often overlook long-distance information. Although some studies try to use Transformer to solve the problem, it lacks the ability of CNN to process the spatial information inherent in images. Therefore, we propose a novel CNN and Transformer hybrid dose prediction model. To enhance the transmission ability of features between CNN and Transformer, we design a hierarchical dense recurrent encoder with a channel attention mechanism. Additionally, we propose a progressive decoder that preserves richer texture information through layer-wise reconstruction of high-dimensional feature maps. The proposed model also introduces object-driven skip connections, which facilitate the flow of information between the encoder and decoder. Experiments are conducted on in-house datasets, and the results show that the proposed model is superior to baseline methods in most dosimetric criteria. In addition, the image analysis metrics including PSNR, SSIM, and NRMSE demonstrate that the proposed model is consistent with ground truth and produces promising visual effects compared to other advanced methods. The proposed method could be taken as a powerful clinical guidance tool for physicists, significantly enhancing the efficiency of radiotherapy planning. The source code is available at https://github.com/CDUTJ102/THCN-Net .

© 2024. The Author(s) under exclusive licence to Society for Imaging Informatics in Medicine.
Journal of imaging informatics in medicine, 2024-10-20