Popularity analysis approaches in high energy physics on example of ATLAS Experiment at LHC
Seminars
Laboratory of Information Technologies
Joint Laboratory Seminar
Date and Time: Tuesday, 20 May 2025, at 3:00 PM
Venue: room 310, Meshcheryakov Laboratory of Information Technologies, online on Webinar
Seminar topic: “Study of data popularity analysis approaches in high energy physics on the example of the ATLAS Experiment at LHC”
Speaker: Mikhail Shubin (Faculty of Computational Mathematics and Cybernetics of Moscow State University, Russia)
Authors: Mikhail Shubin, Nina Popova (Faculty of Computational Mathematics and Cybernetics of Moscow State University, Russia), Maria Grigorieva (Research Computing Centre of Moscow State University)
Modern large-scale scientific experiments, such as those conducted at LHC (CERN) and the NICA Project, generate massive volumes of data that require efficient storage and processing. Monitoring systems in distributed computing environments accumulate valuable information about data access patterns, which can be leveraged to optimise computational workflows. One promising approach is popularity-based data management, where frequently accessed data is cached or replicated, while rarely used data is archived. However, the chaotic and irregular nature of access patterns poses challenges for traditional statistical analysis. This work explores the application of modern machine learning methods for predicting data popularity and identifying groups of datasets with similar access behaviour, enabling more efficient data caching, replication, and archiving strategies.