Professional Documents
Culture Documents
-----
this document
~1Hz power readings, whole home and circuits
aligned and group current/voltage waveforms
raw current/voltage waveforms
labels.dat
channel_{1..k}.dat
102.964
103.125
104.001
102.994
102.361
102.589
house_{1..n}/
current_1.dat
current_2.dat
voltage.dat
-----
The data files are text files, where each line contains:
1) A decimal UTC timestamp, in the same format as the timestamps for
the low frequency data, but allowing for fractional parts
2) A cycle count. Although this is represented in the file as a
double, it is in fact an integer that indicates for how many AC cycles
this particular waveform remains.
3) 275 decimal values, indicating the value of the waveform (in amps or
volts), at equally-spaced portions of the cycle.
Thus, an example file might be:
1297340206.597013 135.000000 0.000000 3.623859 7.254136 10.949398 ...
1297340208.844086 722.000000 0.000000 3.638527 7.249567 10.929027 ...
....
Indicating that the waveform in the first line occurred first at
timestamp 1297340206.597013 and lasted for 135 cycles.
--------------------------------------------------------------------------High Frequency Raw Data
--------------------------------------------------------------------------Finally, the high_freq_raw/ directory contains raw current and voltage
waveforms (unaligned and without compression), for a small number of
sample points throughout the data. This is main intended for those
who wish to test different compression/filtering methods beyond what
we do in the high_freq/ data. Although for practicality we are not
planning to broadly distribute raw data for the entire data set (this
would consist of more than a terabyte of data), if other groups are
able to develop substantially better compression/filtering techniques
then we'd be happy to share the full data or run these proposed
algorithms on the full uncompressed data set.
The high_freq_raw/ directory is organized similar to the high_freq/
directory, except that each current_1, current_2, and voltage files
are instead themselves directories that contain raw binary data:
high_freq_raw/
house_{1..n}/
current_1/
<timestamp>.bz2
...
current_2/
voltage/