Model Representation in OpenVINO™ Runtime¶
In OpenVINO™ Runtime, a model is represented by the ov::Model
class.
The ov::Model
object stores shared pointers to ov::op::v0::Parameter
, ov::op::v0::Result
, and ov::op::Sink
operations, which are inputs, outputs, and sinks of the graph. Sinks of the graph have no consumers and are not included in the results vector. All other operations hold each other via shared pointers, in which a child operation holds its parent via a hard link. If an operation has no consumers and is neither the Result
nor the Sink
operation whose shared pointer counter is zero, the operation will be destructed and not be accessible anymore.
Each operation in ov::Model
has the std::shared_ptr<ov::Node>
type.
How OpenVINO Runtime Works with Models¶
OpenVINO™ Runtime enables you to use different approaches to work with model inputs/outputs:
The
ov::Model::inputs()
/ov::Model::outputs()
methods are used to get vectors of all input/output ports.For a model that has only one input or output, you can use the
ov::Model::input()
orov::Model::output()
methods without any arguments to get input or output port respectively.The
ov::Model::input()
andov::Model::output()
methods can be used with the index of inputs or outputs from the framework model to get specific ports by index.You can use the tensor name of input or output from the original framework model together with the
ov::Model::input()
orov::Model::output()
methods to get specific ports. It means that you do not need to have any additional mapping of names from framework to OpenVINO as it was before. OpenVINO™ Runtime allows the usage of native framework tensor names, for example:
For details on how to build a model in OpenVINO™ Runtime, see the Build a Model in OpenVINO™ Runtime section.
OpenVINO™ Runtime model representation uses special classes to work with model data types and shapes. The ov::element::Type
is used for data types. See the section below for representation of shapes.
Representation of Shapes¶
OpenVINO™ Runtime provides two types for shape representation:
ov::Shape
- Represents static (fully defined) shapes.ov::PartialShape
- Represents dynamic shapes. This means that the rank or some of dimensions are dynamic (dimension defines an interval or undefined).
ov::PartialShape
can be converted to ov::Shape
by using the get_shape()
method if all dimensions are static; otherwise, the conversion will throw an exception. For example:
ov::Shape static_shape;
ov::PartialShape partial_shape = node->output(0).get_partial_shape(); // get zero output partial shape
if (!partial_shape.is_dynamic() /\* or partial_shape.is_static() \*/) {
static_shape = partial_shape.get_shape();
}
partial_shape = node.output(0).get_partial_shape() # get zero output partial shape
if not partial_shape.is_dynamic: # or partial_shape.is_static
static_shape = partial_shape.get_shape()
However, in most cases, before getting static shape using the get_shape()
method, you need to check if that shape is static.
Representation of Operations¶
The ov::Op
class represents any abstract operation in the model representation. Use this class to create custom operations.
Representation of Operation Sets¶
An operation set (opset) is a collection of operations that can be used to construct a model. The ov::OpSet
class provides the functionality to work with operation sets. For each operation set, OpenVINO™ Runtime provides a separate namespace, for example opset8
.
Each OpenVINO™ Release introduces new operations and adds them to new operation sets, within which the new operations would change the behavior of previous operations. Using operation sets helps you avoid changing your application when new operations are introduced. For a complete list of operation sets supported in OpenVINO™ toolkit, see the Available Operations Sets. To add the support for custom operations, see OpenVINO Extensibility Mechanism.
Building a Model in OpenVINO™ Runtime¶
You can create a model from source. This section illustrates how to construct a model composed of operations from an available operation set.
Operation set opsetX
integrates a list of pre-compiled operations that work for this purpose. In other words, opsetX
defines a set of operations for building a graph.
To build an ov::Model
instance from opset8
operations, include the following files:
#include <openvino/core/model.hpp>
#include <openvino/opsets/opset8.hpp>
import openvino.runtime as ov
The following code demonstrates how to create a simple model:
std::shared_ptr<ov::Model> create_simple_model() {
// This example shows how to create ov::Model
//
// Parameter--->Multiply--->Add--->Result
// Constant---' /
// Constant---'
// Create opset8::Parameter operation with static shape
auto data = std::make_shared<ov::opset8::Parameter>(ov::element::f32, ov::Shape{3, 1, 2});
auto mul_constant = ov::opset8::Constant::create(ov::element::f32, ov::Shape{1}, {1.5});
auto mul = std::make_shared<ov::opset8::Multiply>(data, mul_constant);
auto add_constant = ov::opset8::Constant::create(ov::element::f32, ov::Shape{1}, {0.5});
auto add = std::make_shared<ov::opset8::Add>(mul, add_constant);
// Create opset8::Result operation
auto res = std::make_shared<ov::opset8::Result>(mul);
// Create nGraph function
return std::make_shared<ov::Model>(ov::ResultVector{res}, ov::ParameterVector{data});
}
def create_simple_model():
# This example shows how to create ov::Function
#
# Parameter--->Multiply--->Add--->Result
# Constant---' /
# Constant---'
data = ov.opset8.parameter([3, 1, 2], ov.Type.f32)
mul_constant = ov.opset8.constant([1.5], ov.Type.f32)
mul = ov.opset8.multiply(data, mul_constant)
add_constant = ov.opset8.constant([0.5], ov.Type.f32)
add = ov.opset8.add(mul, add_constant)
res = ov.opset8.result(add)
return ov.Model([res], [data], "model")
The following code creates a model with several outputs:
std::shared_ptr<ov::Model> create_advanced_model() {
// Advanced example with multi output operation
//
// Parameter->Split---0-->Result
// | `--1-->Relu-->Result
// `----2-->Result
auto data = std::make_shared<ov::opset8::Parameter>(ov::element::f32, ov::Shape{1, 3, 64, 64});
// Create Constant for axis value
auto axis_const = ov::opset8::Constant::create(ov::element::i64, ov::Shape{} /\*scalar shape\*/, {1});
// Create opset8::Split operation that splits input to three slices across 1st dimension
auto split = std::make_shared<ov::opset8::Split>(data, axis_const, 3);
// Create opset8::Relu operation that takes 1st Split output as input
auto relu = std::make_shared<ov::opset8::Relu>(split->output(1) /\*specify explicit output\*/);
// Results operations will be created automatically based on provided OutputVector
return std::make_shared<ov::Model>(ov::OutputVector{split->output(0), relu, split->output(2)},
ov::ParameterVector{data});
}
def create_advanced_model():
# Advanced example with multi output operation
#
# Parameter->Split---0-->Result
# | `--1-->Relu-->Result
# `----2-->Result
data = ov.opset8.parameter(ov.Shape([1, 3, 64, 64]), ov.Type.f32)
# Create Constant for axis value
axis_const = ov.opset8.constant(ov.Type.i64, ov.Shape({}), [1])
# Create opset8::Split operation that splits input to three slices across 1st dimension
split = ov.opset8.split(data, axis_const, 3)
# Create opset8::Relu operation that takes 1st Split output as input
relu = ov.opset8.relu(split.output(1))
# Results operations will be created automatically based on provided OutputVector
return ov.Model([split.output(0), relu, split.output[2]], [data], "model")
Model Debugging Capabilities¶
OpenVINO™ provides several debug capabilities:
To receive additional messages about applied model modifications, rebuild the OpenVINO™ Runtime library with the
-DENABLE_OPENVINO_DEBUG=ON
option.Model can be visualized to image from the xDot format:
void visualize_example(const std::shared_ptr<ov::Model>& m) { // Need include: // \* openvino/pass/manager.hpp // \* openvino/pass/visualize_tree.hpp ov::pass::Manager manager; // Serialize ov::Model to before.svg file before transformation manager.register_pass<ov::pass::VisualizeTree>("image.svg"); manager.run_passes(m); }
def visualize_example(m : ov.Model): # Need import: # \* import openvino.runtime.passes as passes pass_manager = passes.Manager() pass_manager.register_pass(pass_name="VisualTree", file_name='image.svg') pass_manager.run_passes(m)
`ov::pass::VisualizeTree` can be parametrized via environment variables: OV_VISUALIZE_TREE_OUTPUT_SHAPES=1 - visualize shapes OV_VISUALIZE_TREE_OUTPUT_TYPES=1 - visualize types OV_VISUALIZE_TREE_MIN_MAX_DENORMAL=1 - pretty denormal values OV_VISUALIZE_TREE_RUNTIME_INFO=1 - print runtime information OV_VISUALIZE_TREE_IO=1 - print I/O ports OV_VISUALIZE_TREE_MEMBERS_NAME=1 - print member names
Also model can be serialized to IR:
void serialize_example(const std::shared_ptr<ov::Model>& model) { ov::serialize(model, "/path/to/file/model.xml", "/path/to/file/model.bin"); }
def serialize_example(m : ov.Model): from openvino.runtime import serialize serialize(m, xml_path='model.xml', bin_path='model.bin')