rawdatautils

General Utilities

`file_quality_checker.py`

Jun-18-2026: this script is deprecated as it works with the now-removed WIB2 frames

Usage:

file_quality_checker.py <FILENAME1> [FILENAME2 ...]

This script simply takes a list of HDF5 files produced by the DAQ (globs of course would work on the command line), loops over their records (whether trigger records or timeslices) and performs a few sanity checks. The output of the script when run on one file looks something like the following:

Processing /data2/np04_hd_run018000_0009_dataflow0_datawriter_0_20221118T091751.hdf5.copied...
Will process 27 trigger records.
Processed 6 of 27 records...
Processed 12 of 27 records...
Processed 18 of 27 records...
Processed 27 of 27 records...

Progression of record ids over records looks ok (expected change of 1 for each new record)
Progression of sequence ids over records looks ok (expected change of 0 for each new record)

 FragType         |min # in rec|max # in rec|smallest (B)|largest (B)|min err in rec|max err in rec|
----------------------------------------------------------------------------------------------------
kWIB              |40          |40          |3866696     |3866696    |0             |0             |
----------------------------------------------------------------------------------------------------
kHardwareSignal   |1           |1           |96          |96         |1             |1             |
----------------------------------------------------------------------------------------------------
kTriggerCandidate |1           |1           |128         |128        |1             |1             |
----------------------------------------------------------------------------------------------------

Concerning the progression of record ids and sequence ids: the script will look at the change in record id and sequence id between the first and second records in the file, and extrapolate that as the normal step size. If this step size is violated, then the full list of id values gets printed; otherwise simple one-sentence summaries such as you see above get printed.

Concerning the table: each row corresponds to a fragment type found in at least one record in the file. The first and second columns tell you the fewest instances of such a fragment found in a single record, and the most. Of course, these are typically the same value. The third and fourth columns tell you what the smallest and largest examples of this fragment were in the entire file. The fifth and sixth columns tell you the fewest instances of a fragment with a nonzero set of error bits, and the most, in a single record. Of course for these last two columns you'd ideally see all 0's.

Unpack utilities for Python

There are fast unpackers of data for working in python. These unpackers will take a Fragment and put the resulting values (ADCs or timestamps) in a numpy array with shape (number of frames, number of channels) at a similar speed compared to doing that in C++. This is much faster than doing a similar thing frame by frame in a loop in python.

To use it import the functions first:

from rawdatautils.unpack.<format> import *

where <format> is one of the supported formats: wibeth, tde, daphneeth, or daphne for the corresponding WIBEthFrame, TDEEthFrame, DAPHNEEthFrame and DAPHNEFrame frame formats. Then there are several functions available:

# assumming frag is a fragment
adc = np_array_adc(frag)
timestamp = np.array_timestamp(frag)
print(adc.shape)       # (number of frames in frag)
print(timestamp.shape) # (number of frames in frag)

np_array_adc and np_array_timestamp will unpack the whole fragment. It is also possible to unpack only a part of it:

# assumming frag is a fragment
adc = np_array_adc_data(frag.get_data(), 100)             # first 100 frames
timestamp = np_array_timestamp_data(frag.get_data(), 100) # first 100 frames
print(adc.shape)       # (100)
print(timestamp.shape) # (100)

Warning: np_array_adc_data and np_array_timestamp_data do not make any checks on the number of frames so if passed a value larger than the actual number of frames in the fragment it will try to read out of bounds. np_array_adc and np_array_timestamp call np_array_adc_data and np_array_timestamp_data under the hood with the correct checks on the number of frames.

Name		Name	Last commit message	Last commit date
Latest commit History 400 Commits
.github		.github
cmake		cmake
docs		docs
include/rawdatautils		include/rawdatautils
pybindsrc		pybindsrc
python/rawdatautils		python/rawdatautils
scripts		scripts
src		src
CMakeLists.txt		CMakeLists.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rawdatautils

General Utilities

`file_quality_checker.py`

Unpack utilities for Python

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

rawdatautils

General Utilities

file_quality_checker.py

Unpack utilities for Python

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`file_quality_checker.py`

Packages