Hello NV12 Input Classification C Sample¶
This sample demonstrates how to execute an inference of image classification networks like AlexNet with images in NV12 color format using Synchronous Inference Request API.
Hello NV12 Input Classification C Sample demonstrates how to use the NV12 automatic input pre-processing API in your applications:
Feature |
API |
Description |
---|---|---|
Node Operations |
|
Get a layer name |
Infer Request Operations |
|
Operate with tensors |
Preprocessing |
|
Change the color format of the input data |
Basic Inference Engine API is covered by Hello Classification C sample.
Options |
Values |
---|---|
Validated Models |
alexnet |
Model Format |
Inference Engine Intermediate Representation (*.xml + *.bin), ONNX (*.onnx) |
Validated images |
An uncompressed image in the NV12 color format - *.yuv |
Supported devices |
|
Other language realization |
How It Works¶
Upon the start-up, the sample application reads command-line parameters, loads specified network and an image in the NV12 color format to an Inference Engine plugin. Then, the sample creates an synchronous inference request object. When inference is done, the application outputs data to the standard output stream.
You can see the explicit description of each sample step at Integration Steps section of “Integrate OpenVINO™ Runtime with Your Application” guide.
Building¶
To build the sample, please use instructions available at Build the Sample Applications section in Inference Engine Samples guide.
Running¶
To run the sample, you need specify a model and image:
you can use public or Intel’s pre-trained models from the Open Model Zoo. The models can be downloaded using the Model Downloader.
you can use images from the media files collection available at https://storage.openvinotoolkit.org/data/test_data.
The sample accepts an uncompressed image in the NV12 color format. To run the sample, you need to convert your BGR/RGB image to NV12. To do this, you can use one of the widely available tools such as FFmpeg* or GStreamer*. The following command shows how to convert an ordinary image into an uncompressed NV12 image using FFmpeg:
ffmpeg -i cat.jpg -pix_fmt nv12 cat.yuv
NOTES :
Because the sample reads raw image files, you should provide a correct image size along with the image path. The sample expects the logical size of the image, not the buffer size. For example, for 640x480 BGR/RGB image the corresponding NV12 logical image size is also 640x480, whereas the buffer size is 640x720.
By default, this sample expects that network input has BGR channels order. If you trained your model to work with RGB order, you need to reconvert your model using the Model Optimizer tool with
--reverse_input_channels
argument specified. For more information about the argument, refer to When to Reverse Input Channels section of Embedding Preprocessing Computation.Before running the sample with a trained model, make sure the model is converted to the Inference Engine format (*.xml + *.bin) using the Model Optimizer tool.
The sample accepts models in ONNX format (.onnx) that do not require preprocessing.
Example¶
Download a pre-trained model using Model Downloader :
python <path_to_omz_tools>/downloader.py --name alexnet
If a model is not in the Inference Engine IR or ONNX format, it must be converted. You can do this using the model converter script:
python <path_to_omz_tools>/converter.py --name alexnet
Perform inference of NV12 image using
alexnet
model on aCPU
, for example:<path_to_sample>/hello_nv12_input_classification_c <path_to_model>/alexnet.xml <path_to_image>/cat.yuv 300x300 CPU
Sample Output¶
The application outputs top-10 inference results.
Top 10 results:
Image ./cat.yuv
classid probability
------- -----------
435 0.091733
876 0.081725
999 0.069305
587 0.043726
666 0.038957
419 0.032892
285 0.030309
700 0.029941
696 0.021628
855 0.020339
This sample is an API example, for any performance measurements please use the dedicated benchmark_app tool