The toolkit provides capabilities to use optimization algorithms through an API. It means that the user embeds the optimization code into its own inference pipeline, which is usually a model validation script for a full-precision model. Here we describe a sample of how to do this embedding on the ImageNet classification task.

In order to use optimization features one should implement the following interfaces which are required for the optimization process:

Engine: custom engine class which allows model inference. We created such class based on the DLDT IE asynchronous API which can be reused in user application as well. An example of this engine can be found in the engines folder in the compression directory.
Data Loader: responsible for calibration dataset loading. An example of ImageNet DataLoader can be found in the sample folder.
Metric: required if to use accuracy-aware optimization methods, such as AccuracyAwareQuantization algorithm and implements an accuracy metric calculation. An example of the Accuracy Top-1 metric can be found in the sample folder.
Loss: used only in case when the optimization methods requires per-sample loss calculation.

Sample demonstrates quantization of the classification model and uses API implementations described above and can be found in the sample folder.

How to Run a Sample

In the instructions below, the Post-Training Optimization Tool directory <INSTALL_DIR>/deployment_tools/tools/post_training_optimization_toolkit is referred to as <POT_DIR>. <INSTALL_DIR> is the directory where Intel® Distribution of OpenVINO™ toolkit is installed.

Move to the Model Downloader folder:
cd <POT_DIR>/libs/open_model_zoo/tools/downloader
Launch the downloader tool to download a model from the Open Model Zoo repository:
python3 downloader.py --name <MODEL_NAME>
Launch converter tool to generate the IRv10 model:
python3 converter.py --name <MODEL_NAME> --mo <PATH_TO_MODEL_OPTIMIZER>/mo.py
Move to the sample folder and launch the sample script:
cd <POT_DIR>/sample
python3 sample.py -m <PATH_TO_IR_XML> -a <IMAGENET_ANNOTATION_FILE> -d <IMAGENER_IMAGES>
Optional: you can specify weights directly using the -w, --weights options.

WARNING: Sample works with predefined central crop and resize. In other words, it suits only for models with TensorFlow* preproc.