Published May 15, 2024 | Version v1
Thesis Open

Anomaly Detection, Prognostics, and Diagnostics: Machine Learning for the Hadron Calorimeter at the CMS Experiment

  • 1. U Agder Kristiansand

Contributors

  • 1. U Agder Kristiansand

Description

Machine Learning (ML) tools have gained immense popularity due to the proliferation of sensor data for monitoring, prognostic, and diagnostic applications in various industrial domains. The growing system complexity and monitoring data volumes of the Large Hadron Collider (LHC) at CERN accentuates the need for automation through advanced ML tools. Detection, identification, and resolution of anomalies are essential to generate more physics collision data of the highest quality. Developing ML tools for complex systems often involves expensive data curation and modeling efforts; it requires adequate, cleaned, and annotated data sets, and addresses the challenges of heterogeneity and curse-of-dimensionality of large data sets. The Compact Muon Solenoid (CMS) experiment -- one of the large general-purpose colliders at the LHC -- has dedicated substantial monitoring efforts for detector systems and particle data quality; the control and safety systems (DCS/DSS) actively monitor safety-critical problems, and the data quality monitoring (DQM) system mitigates data loss by identifying and diagnosing physics data problems. The existing monitoring systems need to incorporate a wide range of monitoring variables and adapt to the evolving conditions of the detectors. This dissertation focuses on the development of unsupervised anomaly detection (AD), anomaly prediction (AP), and root-cause analysis (RCA) on multivariate time series data sets. We have developed deep learning models for frontend electronics of the Hadron Calorimeter (HCAL) of the CMS detector using diagnostic sensors and high-dimensional particle acquisition channel-monitoring data sets. We have employed subsystem-granularity modeling using a divide-and-conquer approach to monitor the complex HCAL systems with thousands of sensors. Our monitoring tools have detected and identified previously unknown and hard-to-monitor anomalies, and extended the monitoring, diagnostics, and prognostics automation of the HCAL. The developed tools are deployed at CERN and are currently providing essential real-time and offline anomaly monitoring and diagnostics on the frontend electronics of the HCAL and the online DQM system. Our scientific contribution in tackling the challenges for complex system monitoring includes: 1) enhancing multivariate sensor AD, 2) a promising AP approach, 3) context-aware high-dimensional spatio-temporal AD, 4) transfer learning on multi-network deep learning models, 5) lightweight interconnection and divergence discovery for multi-systems with multivariate sensors, and 6) enhancing computational efficiency of anomalies causality discovery on binary anomaly data.

Files

TS2024_028.pdf

Files (121.8 MB)

Name Size Download all
md5:57f27eab937b83b26edf4836b429433a
40.2 MB Preview Download
md5:c36e7408382bbd1c06e32dfb23db9c36
40.8 MB Preview Download
md5:9f0fdd3d5da22eca97c4a86555a9c7d3
40.8 MB Preview Download

Additional details

Identifiers

CDS
2920461
CDS Report Number
CERN-THESIS-2024-282
CDS Report Number
CMS-TS-2024-028

Related works

Is variant form of
Other: 2867435 (Inspire)
Is version of
978-82-8427-194-1 (ISBN)

CERN

Department
PH
Programme
No program participation
Accelerator
CERN LHC
Experiment
CMS

Linked records