Class ov::op::internal::DynamicQuantize#

class DynamicQuantize : public ov::op::Op#

Operator performing Dynamic Quantize.

Public Types

enum class QuantizationType#

Configuration for the type of quantization applied to the data:

  • Symmetric: Quantization where the zero point is fixed at zero, and the range is symmetric around zero.

  • Asymmetric: Quantization where the zero point is not fixed at zero.

Values:

enumerator Symmetric#
enumerator Asymmetric#
enum class OutputStorageType#

Configuration for how Activations, Scales and Zero Points will be stored in output buffers:

  • Planar: Activations, Scales, and Zero Points are stored in independent buffers.

  • InterleavedScalesZP: Activations are stored in an independent buffer, while Scales and Zero Points (if any) are combined in a separate buffer.

Values:

enumerator Planar#
enumerator InterleavedScalesZP#

Public Functions

DynamicQuantize(const Output<Node> &data, const Attributes &attrs)#

Constructs an DynamicQuantize operation.

Parameters:
  • dataInput tensor with data

  • config – Dynamic quantization configuration

virtual void validate_and_infer_types() override#

Verifies that attributes and inputs are consistent and computes output shapes and element types. Must be implemented by concrete child classes so that it can be run any number of times.

Throws if the node is invalid.

struct Attributes#

Structure that specifies attributes for interpolation.