Orthos, an alarm system for the ALICE DAQ operations

Sylvain Chapeland, Franco Carena, Wisla Carena, Vasco Chibante Barroso, Filippo Costa, E. Dénes, Roberto Divia, Ulrich Fuchs, Alexandru Grigore, Giuseppe Simonetti, Csaba Soos, Adriana Telesca, Pierre Vande Vyvre, Barthelemy Von Haller

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

ALICE (A Large Ion Collider Experiment) is the heavy-ion detector studying the physics of strongly interacting matter and the quark-gluon plasma at the CERN LHC (Large Hadron Collider). The DAQ (Data Acquisition System) facilities handle the data flow from the detectors electronics up to the mass storage. The DAQ system is based on a large farm of commodity hardware consisting of more than 600 devices (Linux PCs, storage, network switches), and controls hundreds of distributed hardware and software components interacting together. This paper presents Orthos, the alarm system used to detect, log, report, and follow-up abnormal situations on the DAQ machines at the experimental area. The main objective of this package is to integrate alarm detection and notification mechanisms with a full-featured issues tracker, in order to prioritize, assign, and fix system failures optimally. This tool relies on a database repository with a logic engine, SQL interfaces to inject or query metrics, and dynamic web pages for user interaction. We describe the system architecture, the technologies used for the implementation, and the integration with existing monitoring tools.

Original languageEnglish
Article number012013
JournalJournal of Physics: Conference Series
Volume396
Issue numberPART 1
DOIs
Publication statusPublished - 2012

Fingerprint

warning systems
data acquisition
hardware
system failures
commodities
ions
detectors
fixing
logic
engines
heavy ions
switches
quarks
computer programs
physics
electronics
interactions

ASJC Scopus subject areas

  • Physics and Astronomy(all)

Cite this

Chapeland, S., Carena, F., Carena, W., Chibante Barroso, V., Costa, F., Dénes, E., ... Von Haller, B. (2012). Orthos, an alarm system for the ALICE DAQ operations. Journal of Physics: Conference Series, 396(PART 1), [012013]. https://doi.org/10.1088/1742-6596/396/1/012013

Orthos, an alarm system for the ALICE DAQ operations. / Chapeland, Sylvain; Carena, Franco; Carena, Wisla; Chibante Barroso, Vasco; Costa, Filippo; Dénes, E.; Divia, Roberto; Fuchs, Ulrich; Grigore, Alexandru; Simonetti, Giuseppe; Soos, Csaba; Telesca, Adriana; Vande Vyvre, Pierre; Von Haller, Barthelemy.

In: Journal of Physics: Conference Series, Vol. 396, No. PART 1, 012013, 2012.

Research output: Contribution to journalArticle

Chapeland, S, Carena, F, Carena, W, Chibante Barroso, V, Costa, F, Dénes, E, Divia, R, Fuchs, U, Grigore, A, Simonetti, G, Soos, C, Telesca, A, Vande Vyvre, P & Von Haller, B 2012, 'Orthos, an alarm system for the ALICE DAQ operations', Journal of Physics: Conference Series, vol. 396, no. PART 1, 012013. https://doi.org/10.1088/1742-6596/396/1/012013
Chapeland S, Carena F, Carena W, Chibante Barroso V, Costa F, Dénes E et al. Orthos, an alarm system for the ALICE DAQ operations. Journal of Physics: Conference Series. 2012;396(PART 1). 012013. https://doi.org/10.1088/1742-6596/396/1/012013
Chapeland, Sylvain ; Carena, Franco ; Carena, Wisla ; Chibante Barroso, Vasco ; Costa, Filippo ; Dénes, E. ; Divia, Roberto ; Fuchs, Ulrich ; Grigore, Alexandru ; Simonetti, Giuseppe ; Soos, Csaba ; Telesca, Adriana ; Vande Vyvre, Pierre ; Von Haller, Barthelemy. / Orthos, an alarm system for the ALICE DAQ operations. In: Journal of Physics: Conference Series. 2012 ; Vol. 396, No. PART 1.
@article{d78e71aa04c5474c8c4cf00e64bf15c0,
title = "Orthos, an alarm system for the ALICE DAQ operations",
abstract = "ALICE (A Large Ion Collider Experiment) is the heavy-ion detector studying the physics of strongly interacting matter and the quark-gluon plasma at the CERN LHC (Large Hadron Collider). The DAQ (Data Acquisition System) facilities handle the data flow from the detectors electronics up to the mass storage. The DAQ system is based on a large farm of commodity hardware consisting of more than 600 devices (Linux PCs, storage, network switches), and controls hundreds of distributed hardware and software components interacting together. This paper presents Orthos, the alarm system used to detect, log, report, and follow-up abnormal situations on the DAQ machines at the experimental area. The main objective of this package is to integrate alarm detection and notification mechanisms with a full-featured issues tracker, in order to prioritize, assign, and fix system failures optimally. This tool relies on a database repository with a logic engine, SQL interfaces to inject or query metrics, and dynamic web pages for user interaction. We describe the system architecture, the technologies used for the implementation, and the integration with existing monitoring tools.",
author = "Sylvain Chapeland and Franco Carena and Wisla Carena and {Chibante Barroso}, Vasco and Filippo Costa and E. D{\'e}nes and Roberto Divia and Ulrich Fuchs and Alexandru Grigore and Giuseppe Simonetti and Csaba Soos and Adriana Telesca and {Vande Vyvre}, Pierre and {Von Haller}, Barthelemy",
year = "2012",
doi = "10.1088/1742-6596/396/1/012013",
language = "English",
volume = "396",
journal = "Journal of Physics: Conference Series",
issn = "1742-6588",
publisher = "IOP Publishing Ltd.",
number = "PART 1",

}

TY - JOUR

T1 - Orthos, an alarm system for the ALICE DAQ operations

AU - Chapeland, Sylvain

AU - Carena, Franco

AU - Carena, Wisla

AU - Chibante Barroso, Vasco

AU - Costa, Filippo

AU - Dénes, E.

AU - Divia, Roberto

AU - Fuchs, Ulrich

AU - Grigore, Alexandru

AU - Simonetti, Giuseppe

AU - Soos, Csaba

AU - Telesca, Adriana

AU - Vande Vyvre, Pierre

AU - Von Haller, Barthelemy

PY - 2012

Y1 - 2012

N2 - ALICE (A Large Ion Collider Experiment) is the heavy-ion detector studying the physics of strongly interacting matter and the quark-gluon plasma at the CERN LHC (Large Hadron Collider). The DAQ (Data Acquisition System) facilities handle the data flow from the detectors electronics up to the mass storage. The DAQ system is based on a large farm of commodity hardware consisting of more than 600 devices (Linux PCs, storage, network switches), and controls hundreds of distributed hardware and software components interacting together. This paper presents Orthos, the alarm system used to detect, log, report, and follow-up abnormal situations on the DAQ machines at the experimental area. The main objective of this package is to integrate alarm detection and notification mechanisms with a full-featured issues tracker, in order to prioritize, assign, and fix system failures optimally. This tool relies on a database repository with a logic engine, SQL interfaces to inject or query metrics, and dynamic web pages for user interaction. We describe the system architecture, the technologies used for the implementation, and the integration with existing monitoring tools.

AB - ALICE (A Large Ion Collider Experiment) is the heavy-ion detector studying the physics of strongly interacting matter and the quark-gluon plasma at the CERN LHC (Large Hadron Collider). The DAQ (Data Acquisition System) facilities handle the data flow from the detectors electronics up to the mass storage. The DAQ system is based on a large farm of commodity hardware consisting of more than 600 devices (Linux PCs, storage, network switches), and controls hundreds of distributed hardware and software components interacting together. This paper presents Orthos, the alarm system used to detect, log, report, and follow-up abnormal situations on the DAQ machines at the experimental area. The main objective of this package is to integrate alarm detection and notification mechanisms with a full-featured issues tracker, in order to prioritize, assign, and fix system failures optimally. This tool relies on a database repository with a logic engine, SQL interfaces to inject or query metrics, and dynamic web pages for user interaction. We describe the system architecture, the technologies used for the implementation, and the integration with existing monitoring tools.

UR - http://www.scopus.com/inward/record.url?scp=84873283780&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84873283780&partnerID=8YFLogxK

U2 - 10.1088/1742-6596/396/1/012013

DO - 10.1088/1742-6596/396/1/012013

M3 - Article

AN - SCOPUS:84873283780

VL - 396

JO - Journal of Physics: Conference Series

JF - Journal of Physics: Conference Series

SN - 1742-6588

IS - PART 1

M1 - 012013

ER -