_images/autodqm_logo.png

AutoDQM is a statistical tool for Data Quality Management (DQM) at the Large Hadron Particle Collider.

  1. Motivation

  2. The Statistical Arsenal

  3. Example Output

Motivation

_images/D_plus_R_equals.png

Before AutoDQM, DQM shifters were tasked with looking at hundreds of DQM histograms, looking for hard-to-spot issues in data collection. AutoDQM runs long established statistical tests using ROOT on these graphs and outputs outliers on a simple, but effective, GUI that expedites the shifters’ task.

The Statistical Arsenal

AutoDQM uses a variety of long-established statistical tests to determine how similar a “data” run is to a reference run selected by the DQM shifter.

Bin-by-Bin Pull Values

_images/pull_values.png

For 2D histograms AutoDQM goes bin-by-bin through both the data and reference histograms, then plots the difference between each corresponding bin onto a new, identical histogram, taking proper Poisson errors into account. The equation for this calculation is fairly simple:

_images/pull_eq.png

where “x” is the bin value and epsilon is the error of that bin value. The code for this can be found here.

Kolmogorov-Smirnov Test

_images/ks_test.png

The Kolmogorov-Smirnov test compares the distance between points on a sample distribution and some reference distribution and qualifies how close or far they are. It is too complex to discuss here, but Wikipedia has a good article on it.