Class ov::op::internal::DynamicQuantize#
-
class DynamicQuantize : public ov::op::Op#
Operator performing Dynamic Quantize.
Public Types
-
enum class QuantizationType#
Configuration for the type of quantization applied to the data:
Symmetric: Quantization where the zero point is fixed at zero, and the range is symmetric around zero.
Asymmetric: Quantization where the zero point is not fixed at zero.
Values:
-
enumerator Symmetric#
-
enumerator Asymmetric#
-
enum class OutputStorageType#
Configuration for how Activations, Scales and Zero Points will be stored in output buffers:
Planar: Activations, Scales, and Zero Points are stored in independent buffers.
InterleavedScalesZP: Activations are stored in an independent buffer, while Scales and Zero Points (if any) are combined in a separate buffer.
Values:
-
enumerator Planar#
-
enumerator InterleavedScalesZP#
Public Functions
-
DynamicQuantize(const Output<Node> &data, const Attributes &attrs)#
Constructs an DynamicQuantize operation.
- Parameters:
data – Input tensor with data
config – Dynamic quantization configuration
-
virtual void validate_and_infer_types() override#
Verifies that attributes and inputs are consistent and computes output shapes and element types. Must be implemented by concrete child classes so that it can be run any number of times.
Throws if the node is invalid.
-
struct Attributes#
Structure that specifies attributes for interpolation.
-
enum class QuantizationType#