Purpose: To present and evaluate Dafne (deep anatomical federated network), a freely available decentralized, collaborative deep learning system for the semantic segmentation of radiologic images through federated incremental learning. Materials and Methods: Dafne is free software with a client-server architecture. The client side is an advanced user interface that applies the deep learning models stored on the server to the user’s data and allows the user to check and refine the prediction. Incremental learning is then performed on the client’s side and sent back to the server, where it is integrated into the root model. Dafne was evaluated locally by assessing the performance gain across model generations on 38 MRI datasets of the lower legs and through the analysis of real-world usage statistics (639 use cases). Results: Dafne demonstrated a statistical improvement in the accuracy of semantic segmentation over time (average increase of the Dice similarity coeffi-cient by 0.007 points per generation on the local validation set, P < .001). Qualitatively, the models showed enhanced performance on various radiologic image types, including those not present in the initial training sets, indicating good model generalizability. Conclusion: Dafne showed improvement in segmentation quality over time, demonstrating potential for learning and generalization.
Deep Anatomical Federated Network (Dafne): An Open Client-Server Framework for Continuous, Collaborative Improvement of Deep Learning–based Medical Image Segmentation
Agosti, Abramo;Pichiecchio, Anna
2025-01-01
Abstract
Purpose: To present and evaluate Dafne (deep anatomical federated network), a freely available decentralized, collaborative deep learning system for the semantic segmentation of radiologic images through federated incremental learning. Materials and Methods: Dafne is free software with a client-server architecture. The client side is an advanced user interface that applies the deep learning models stored on the server to the user’s data and allows the user to check and refine the prediction. Incremental learning is then performed on the client’s side and sent back to the server, where it is integrated into the root model. Dafne was evaluated locally by assessing the performance gain across model generations on 38 MRI datasets of the lower legs and through the analysis of real-world usage statistics (639 use cases). Results: Dafne demonstrated a statistical improvement in the accuracy of semantic segmentation over time (average increase of the Dice similarity coeffi-cient by 0.007 points per generation on the local validation set, P < .001). Qualitatively, the models showed enhanced performance on various radiologic image types, including those not present in the initial training sets, indicating good model generalizability. Conclusion: Dafne showed improvement in segmentation quality over time, demonstrating potential for learning and generalization.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


