How It Works

The demo application expects an image retrieval model in the Intermediate Representation (IR) format.

As input, the demo application takes:

a path to a list of images represented by textfile with following format 'path_to_image' 'ID' --images
a path to a video file or a device node of a web-camera specified with a command line argument --video

The demo workflow is the following:

The demo application reads video frames one by one, runs ROI detector that extracts ROI (moving area).
Extracted ROI is passed to artificial neural network that computes embedding vector for extracted frame area.
Then the demo application searches computed embedding in gallery of images in order to determine which image in the gallery is the most similar to what one can see on video frame.
The app visualizes results of it work as graphical window where following objects are shown.
- Input frame with detected ROI.
- Top-10 most similar images from the gallery.
- Performance characteristics.

NOTE: By default, Open Model Zoo demos expect input with BGR channels order. If you trained your model to work with RGB order, you need to manually rearrange the default channels order in the demo application or reconvert your model using the Model Optimizer tool with --reverse_input_channels argument specified. For more information about the argument, refer to When to Reverse Input Channels section of Converting a Model Using General Conversion Parameters.

Running

Run the application with the -h option to see the following usage message:

usage: image_retrieval_demo.py [-h] -m MODEL -i I -g GALLERY
                               [-gt GROUND_TRUTH] [-d DEVICE]
                               [-l CPU_EXTENSION] [--no_show]
                               [-u UTILIZATION_MONITORS]
Options:
  -h, --help            Show this help message and exit.
  -m MODEL, --model MODEL
                        Required. Path to an .xml file with a trained model.
  -i I                  Required. Path to a video file or a device node of a
                        web-camera.
  -g GALLERY, --gallery GALLERY
                        Required. Path to a file listing gallery images.
  -gt GROUND_TRUTH, --ground_truth GROUND_TRUTH
                        Optional. Ground truth class.
  -d DEVICE, --device DEVICE
                        Optional. Specify the target device to infer on: CPU,
                        GPU, FPGA, HDDL or MYRIAD. The demo will look for a
                        suitable plugin for device specified (by default, it
                        is CPU).
  -l CPU_EXTENSION, --cpu_extension CPU_EXTENSION
                        Optional. Required for CPU custom layers. Absolute
                        path to a shared library with the kernels
                        implementations.
  --no_show             Optional. Do not visualize inference results.
  -u UTILIZATION_MONITORS, --utilization_monitors UTILIZATION_MONITORS
                        Optional. List of monitors to show initially.

Running the application with an empty list of options yields the short version of the usage message and an error message.

To run the demo, you can use public or pre-trained models. To download the pre-trained models, use the OpenVINO Model Downloader or go to https://download.01.org/opencv/.

NOTE: Before running the demo with a trained model, make sure the model is converted to the Inference Engine format (*.xml + *.bin) using the Model Optimizer tool.

To run the demo, please provide paths to the model in the IR format, to a file with class labels, and to an input video, image, or folder with images:

python image_retrieval_demo.py \
-m /home/user/image-retrieval-0001.xml \
-i /home/user/video.dav.mp4 \
-g /home/user/list.txt \
--ground_truth text_label

An example of file listing gallery images can be found here.

Examples of videos can be found here.

Demo Output

The application uses OpenCV to display gallery searching result and current inference performance.

How It Works

Running

Demo Output

See Also