-
[pdf]
[supp]
[bibtex]@InProceedings{Son_2025_WACV, author = {Son, Ju Heon and Choi, Jang-Hwan}, title = {FMD: Comprehensive Data Compression in Medical Domain via Fused Matching Distillation}, booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV)}, month = {February}, year = {2025}, pages = {3406-3415} }
FMD: Comprehensive Data Compression in Medical Domain via Fused Matching Distillation
Abstract
Medical datasets are often large and contain sensitive information presenting significant challenges for data sharing and storage. To address these issues this paper introduces a novel method called Fused Matching Distillation (FMD) which combines multiple dataset distillation techniques to achieve both data compression and enhanced privacy. FMD synthesizes representative subsets that capture the essential information from the original dataset while anonymizing sensitive details during the distillation process. The proposed approach integrates two complementary methods: parameter matching a technique that aligns the training trajectories of a teacher network trained on real data with those of a student network trained on synthetic data and feature distribution matching which ensures that the synthetic dataset closely approximates the feature distribution of the original data. By fusing these techniques FMD maximizes the information density within each pixel of the distilled dataset achieving a balance between compression and performance. Experimental evaluations on medical datasets including COVID chest X-ray and Pancreas cancer CT demonstrate that FMD achieves superior accuracy and privacy compared to existing methods. Furthermore the proposed method is evaluated using metrics for both model performance and anonymity showing that FMD not only maintains high diagnostic accuracy but also effectively anonymizes the data. This makes FMD a promising tool for secure and efficient medical data sharing.
Related Material