Override Defaults via Config File¶
For more complex configurations, you will probably prefer using a YAML config file to drive the preprocessor.
Examples¶
Creating a Custom Training Dataset¶
Create a custom configuration file configs/preprocess/train_demo_dataset.yaml:
- Always define defaults in the header of your custom config file.
- Uses the default preprocess config within the canari-ml codebase as base config.
- Override the above defaults with values from this file. (The order matters,
_self_should be defined last to override previous configs in this list). - If you do not want anomaly variables, you can set
anomaly: null.
You can now run the preprocess command and point to this custom config file (just like in the download section).
Create a Prediction Dataset¶
Note
This step is only needed after a trained model is generated or made available.
The approach to creating a prediction dataset is very similar to creating a training dataset. The main difference is that it needs to use the same normalisation parameters as the training dataset used to train the model. And, there is no cached Zarr dataset generated since it would not be worth it for the prediction step.
Using config file¶
| configs/preprocess/predict_trial_dataset.yaml | |
|---|---|
To generate the prediction dataset, run: