Professional Documents
Culture Documents
Home About EVRI Applications Consulting Software Training Resources Contact Us Search Site Search
Data Sets > NIR Spectra
There are three formats of these data: Matlab DataSet objects, Standard Matlab variables, and CSV files. This data was obtained at Soutwest
Research Institute (SWRI) on a project sponsored by the U.S. Army. Many thanks to them for letting us post it here!
The file "SWRI_Diesel_NIR.zip" contains a .mat file which can be loaded into MATLAB. This .mat file contains two dataset objects: One includes all the
raw unpreprocessed spectra (diesel_spec) and another that is all the properties (diesel_prop). Some of the properties are not measured on some of
the samples, so diesel_prop has some missing values (NaNs) in it. The wavelength axis is included as axisscale in the diesel_spec. If you don't have
PLS_Toolbox or our freeware for the DataSet Object, these two variables should turn into structures when you load them into MATLAB.
The following are .zip files of separate .mat files, each with standard Matlab variables containing the same data as above. There are 6 workspace
variables in each file, 3 for the spectra and 3 matching ones for the property value. In each case the data includes 20 high leverage samples (_hl) and
the remaining samples are split into two random groups (_ll_a and _ll_b). These spectra can be used to test variable selection and calibration
algorithms. For instance, you can use the high leverage samples and one of the other sets to make a calibration model (say the _hl and _ll_a), then
test it on the third set (the _ll_b). In all cases the data have been pretty thoroughly weeded: outliers removed, and all samples belong to the same
class (all summer fuels, no winter fuels).
All of the files end in GATEST because we've used the data to test genetic algorithms for variable selection.
The file "SWRI_Diesel_NIR_CSV.zip" contains two .csv files. One includes all the raw unpreprocessed spectra (diesel_spec) and another that is all
the properties (diesel_prop). Some of the properties are not measured on some of the samples, so diesel_prop has some missing values (NaNs) in
it. The wavelength axis is included as axisscale in the diesel_spec.
http://www.eigenvector.com/data/SWRI/index.html 1/1