ML-JET: A Benchmark Dataset for Classifying Heavy Ion Collisions

Haydar Mehryar, Chengzhi Mao, Loren Schwiebert; Proceedings of the Winter Conference on Applications of Computer Vision (WACV) Workshops, 2025, pp. 1103-1112

Abstract


Understanding relativistic heavy ion collisions is important for studying the evolution of the universe. Models of these collisions require hyperparameter tuning to accurately reproduce experimental results. Traditionally Bayesian analysis which is costly and non-scalable has been used for this tuning. Deep learning is being explored as a potential alternative although it is still in the early stages. In this work we propose a novel benchmark dataset for anomaly detection in visual data generated from relativistic heavy ion collisions using the JETSCAPE framework. This dataset consists of 10.8 million jet event images enabling the application of computer vision techniques to this domain. Our dataset converts complex physics simulations into event images which are compatible with standard vision classifiers. Using the standard Convolutional Neural Networks (CNN) our initial results attain a 92% accuracy in energy loss module classification while concurrently accelerating the tuning process by an order of magnitude and saving millions of CPU/GPU hours over Bayesian analysis. This work demonstrates the potential of leveraging AI-based anomaly detection techniques to advance both physics and computer vision offering a scalable solution for identifying anomalies in particle collision data.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Mehryar_2025_WACV, author = {Mehryar, Haydar and Mao, Chengzhi and Schwiebert, Loren}, title = {ML-JET: A Benchmark Dataset for Classifying Heavy Ion Collisions}, booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV) Workshops}, month = {February}, year = {2025}, pages = {1103-1112} }