View on GitHub

pyPreprocessing

Especially useful for preprocessing of datasets like Raman spectra, infrared spectra, UV/Vis spectra, but also HPLC data and many other types of data. pyPreprocessing includes baseline correction, smoothing, filtering, normalization and transformation.

pyPreprocessing documentation

Baseline correction (in baseline_correction.py)

Before baseline correction, data can be smoothed or transformed by methods described below. Implemented baseline correction methods are:

Smoothing (in smooting.py)

Data can be extended on the edges by point mirroring to reduce smoothing artifacts. Data output has the same dimensions like the input. In case of unevenly spaced data, a possibility to interpolate the data to even spacing is given that should be used, and either the interpolated dataset is returned or the dataset is brought back to its original dimensions. Currently implemented methods are:

Filtering (in smoothing.py)

In contrast to smoothing, filtering mehods identify certain data points that need to be dropped without any change of the remaining data. Current methods are:

Transformation (in transform.py)

Contains methods to apply or undo a transformation on the input data. Currently, the only implemented method is:

Normalization (in transform.py)

Normalize the input data. Currently only one method is implemented: