openvino.runtime.opset15.scaled_dot_product_attention#

Return a node which implements Scaled Dot Product Attention.

Parameters:

query – Query tensor of shape [N, …, L, E] and floating-point datatype.
key – Key tensor of shape [N, …, S, E] and floating-point datatype.
value – Value tensor of shape [N, …, S, Ev] and floating-point datatype.
attention_mask – Optional attention mask tensor of shape [N, …, L, S] or scalar float type zero value. Refer to the operation specification for a complete description.
scale – Optional alternative scale, a floating-point type scalar.
causal – If true, then autogenerates causal attention mask instead of using attention_mask input. In this case attention_mask input is ignored.
name – The optional new name for output node.

Returns:

The new node performing Scaled Dot Product Attention operation.