Converting a PyTorch Model#
You can download a PyTorch model from PyTorch Hub or
Hugging Face.
To convert the model, use the openvino.convert_model
function.
Here is the simplest example of PyTorch model conversion using a model from torchvision
:
import torchvision
import torch
import openvino as ov
model = torchvision.models.resnet50(weights='DEFAULT')
ov_model = ov.convert_model(model)
openvino.convert_model
function supports the following PyTorch model object types:
torch.nn.Module
derived classestorch.jit.ScriptModule
torch.jit.ScriptFunction
torch.export.ExportedProgram
When using torch.nn.Module
as an input model, openvino.convert_model
often requires the
example_input
parameter to be specified. Internally, it triggers the model tracing during
the model conversion process, using the capabilities of the torch.jit.trace
function.
The use of example_input
can lead to a better quality OpenVINO model in terms of correctness
and performance compared to converting the same original model without specifying
example_input
. While the necessity of example_input
depends on the implementation
details of a specific PyTorch model, it is recommended to always set the example_input
parameter when it is available.
The value for the example_input
parameter can be easily derived from knowing the input
tensor’s element type and shape. While it may not be suitable for all cases, random numbers can
frequently serve this purpose effectively:
import torchvision
import torch
import openvino as ov
model = torchvision.models.resnet50(weights='DEFAULT')
ov_model = ov.convert_model(model, example_input=torch.rand(1, 3, 224, 224))
In practice, the code to evaluate or test the PyTorch model is usually provided with the model
itself and can be used to generate a proper example_input
value. A modified example of using
resnet50
model from torchvision
is presented below. It demonstrates how to switch
inference in the existing PyTorch application to OpenVINO and how to get value for
example_input
:
from torchvision.io import read_image
from torchvision.models import resnet50, ResNet50_Weights
import requests, PIL, io, torch
# Get a picture of a cat from the web:
img = PIL.Image.open(io.BytesIO(requests.get("https://placekitten.com/200/300").content))
# Torchvision model and input data preparation from https://pytorch.org/vision/stable/models.html
weights = ResNet50_Weights.DEFAULT
model = resnet50(weights=weights)
model.eval()
preprocess = weights.transforms()
batch = preprocess(img).unsqueeze(0)
# PyTorch model inference and post-processing
prediction = model(batch).squeeze(0).softmax(0)
class_id = prediction.argmax().item()
score = prediction[class_id].item()
category_name = weights.meta["categories"][class_id]
print(f"{category_name}: {100 * score:.1f}% (with PyTorch)")
# OpenVINO model preparation and inference with the same post-processing
import openvino as ov
compiled_model = ov.compile_model(ov.convert_model(model, example_input=batch))
prediction = torch.tensor(compiled_model(batch)[0]).squeeze(0).softmax(0)
class_id = prediction.argmax().item()
score = prediction[class_id].item()
category_name = weights.meta["categories"][class_id]
print(f"{category_name}: {100 * score:.1f}% (with OpenVINO)")
Check out more examples in interactive Python tutorials.
Note
In the examples above the openvino.save_model
function is not used because there are no
PyTorch-specific details regarding the usage of this function. In all examples, the converted
OpenVINO model can be saved to IR by calling ov.save_model(ov_model, 'model.xml')
as usual.
Supported Input Parameter Types#
If the model has a single input, the following input types are supported in example_input
:
openvino.Tensor
torch.Tensor
tuple
or any nested combination of tuples
If a model has multiple inputs, the input values are combined in a list
, a tuple
, or a
dict
:
values in a
list
ortuple
should be passed in the same order as the original model specifies,dict
has keys from the names of the original model argument names.
Enclosing in list
, tuple
or dict
can be used for a single input as well as for
multiple inputs.
If a model has a single input parameter and the type of this input is a tuple
, it should be
always passed enclosed into an extra list
, tuple
or dict
as in the case of multiple
inputs. It is required to eliminate ambiguity between model((a, b))
and model(a, b)
in
this case.
Non-tensor Data Types#
When a non-tensor data type, such as a tuple
or dict
, appears in a model input or output,
it is flattened. The flattening means that each element within the tuple
will be represented
as a separate input or output. The same is true for dict
values, where the keys of the
dict
are used to form a model input/output name. The original non-tensor input or output is
replaced by one or multiple new inputs or outputs resulting from this flattening process. This
flattening procedure is applied recursively in the case of nested tuples
, lists
, and
dicts
until it reaches the assumption that the most nested data type is a tensor.
For example, if the original model is called with example_input=(a, (b, c, (d, e)))
, where
a
, b
, … e
are tensors, it means that the original model has two inputs. The first
is a tensor a
, and the second is a tuple (b, c, (d, e))
, containing two tensors b
and c
and a nested tuple (d, e)
. Then the resulting OpenVINO model will have signature
(a, b, c, d, e)
, which means it will have five inputs, all of type tensor, instead of two in
the original model.
If your model has a dict
input, such as, {"x": a, "y": b, "z": c}
, it will be decomposed
into multiple inputs of the OpenVINO model signature: (a, b, c)
, where inputs assume the
names of x
, y
, and z
respectively.
Note
An important consequence of flattening is that only tuple
and dict
with a fixed number
of elements and key values are supported. The structure of such inputs should be fully
described in the example_input
parameter of convert_model
. The flattening on outputs
should be reproduced with the given example_input
and cannot be changed once the
conversion is done.
Check out more examples of model conversion with non-tensor data types in the following tutorials:
Input and output names of the model#
PyTorch doesn’t produce relevant names for model inputs and outputs in the TorchScript
representation. OpenVINO will assign input names based on the signature of models’s forward
method or dict
keys provided in the example_input
. Output names will be assigned if
there is a dict
at the output or when there is some internal name available in the
TorchScript model representation. In general, the output name is not assigned and stays empty.
It is recommended to address model outputs by the index rather then the name.
Support for torch.export#
torch.export is the current way to get a graph
representation of a model (since PyTorch 2.1). It produces ExportedProgram
which includes
the graph representation in the FX format. To see why it has an advantage over the TorchScript
representation, refer to PyTorch documentation.
Here is an example of how to convert a model obtained with torch.export
:
from torchvision.models import resnet50, ResNet50_Weights
from torch.export import export
from openvino import convert_model
model = resnet50(weights=ResNet50_Weights.DEFAULT)
model.eval()
exported_model = export(model, (torch.randn(1, 3, 224, 224),))
ov_model = convert_model(exported_model)
Converting a PyTorch Model from Disk#
PyTorch can save models in two formats: torch.jit.ScriptModule
and torch.export.ExportedProgram
.
Both formats may be saved to drive as standalone files and reloaded later, independently of the
original Python code.
ExportedProgram Format#
You can save the ExportedProgram
format using
torch.export.save().
Here is an example of how to convert it:
import openvino as ov
ov_model = ov.convert_model('exported_program.pt2')
ovc exported_program.pt2
ScriptModule Format#
torch.jit.save() serializes
the ScriptModule
object on a drive. To convert the serialized ScriptModule
format, run
the convert_model
function with example_input
parameter as follows:
from openvino import convert_model
import torch
convert_model(input_model='script_module.pt', example_input=torch.rand(1, 10))
example_input
is the required parameter for the conversion because torch.jit.ScriptModule
object is always saved in an untraced state on disk.
Exporting a PyTorch Model to ONNX Format#
An alternative method of converting a PyTorch models is to export it to ONNX first
(with torch.onnx.export
) and then convert the resulting .onnx
file to the OpenVINO IR
model (with openvino.convert_model
). It should be considered a backup solution, if a model
cannot be converted directly, as described previously. Converting through ONNX can be more
expensive in terms of code overhead, conversion time, and allocated memory.
Refer to the Exporting PyTorch models to ONNX format guide to learn how to export models from PyTorch to ONNX.
Follow the Convert an ONNX model guide to produce OpenVINO IR.
Here is an illustration of using these two steps together:
import torchvision
import torch
import openvino as ov
model = torchvision.models.resnet50(weights='DEFAULT')
# 1. Export to ONNX
torch.onnx.export(model, (torch.rand(1, 3, 224, 224), ), 'model.onnx')
# 2. Convert to OpenVINO
ov_model = ov.convert_model('model.onnx')
Note
As of version 1.8.1, not all PyTorch operations can be exported to ONNX opset 9 which is
used by default. It is recommended to export models to opset 11 or higher when export to
default opset 9 is not working. In that case, use opset_version
option of the
torch.onnx.export
. For more information about ONNX opset, refer to the
Operator Schemas page.