Gather¶
Versioned name: Gather-7
Category: Data movement
- Short description: Gather operation takes slices of data of the first input tensor according to the indices
specified with the second input tensor and axis from the third input. Semantics of this operation is identical to
TensorFlow Gather operation.
Detailed description
output[p_0, p_1, ..., p_{axis-1}, i_b, ..., i_{M-1}, p_{axis+1}, ..., p_{N-1}] =
data[p_0, p_1, ..., p_{axis-1}, indices[p_0, p_1, ..., p_{b-1}, i_b, ..., i_{M-1}], p_{axis+1}, ..., p_{N-1}]
Where data, indices and axis are tensors from first, second and third inputs correspondingly, b is
the number of batch dimensions. N and M are numbers of dimensions of data and indices tensors, respectively.
Attributes:
batch_dims
Description: batch_dims (also denoted as
b) is a leading number of dimensions ofdatatensor andindicesrepresenting the batches, and Gather starts to gather from thebdimension. It requires the firstbdimensions in data and indices tensors to be equal. Ifbatch_dimsis less than zero, the normalized value is usedbatch_dims = indices.rank + batch_dims.Range of values:
[-min(data.rank, indices.rank); min(data.rank, indices.rank)]andbatch_dims' <= axis'. Wherebatch_dims'andaxis'stand for normalizedbatch_dimsandaxisvalues.Type: T_AXIS
Default value: 0
Required: no
Example 1 with default batch_dims value:
batch_dims = 0
axis = 0
indices = [0, 0, 4]
data = [1, 2, 3, 4, 5]
output = [1, 1, 5]
Example 2 with non-default batch_dims value:
batch_dims = 1
axis = 1
indices = [[0, 0, 4], <-- this is applied to the first batch
[4, 0, 0]] <-- this is applied to the second batch
indices_shape = (2, 3)
data = [[1, 2, 3, 4, 5], <-- the first batch
[6, 7, 8, 9, 10]] <-- the second batch
data_shape = (2, 5)
output = [[ 1, 1, 5],
[10, 6, 6]]
output_shape = (2, 3)
Example 3 with non-default batch_dims value:
batch_dims = 2
axis = 2
indices = [[[0, 0, 4], <-- this is applied to the first batch, index = (0, 0)
[4, 0, 0]], <-- this is applied to the second batch, index = (0, 1)
[[1, 2, 4], <-- this is applied to the third batch, index = (1, 0)
[4, 3, 2]]] <-- this is applied to the fourth batch, index = (1, 1)
indices_shape = (2, 2, 3)
data = [[[1, 2, 3, 4, 5], <-- the first batch, index = (0, 0)
[6, 7, 8, 9, 10]], <-- the second batch, index = (0, 1)
[[11, 12, 13, 14, 15], <-- the third batch, index = (1, 0)
[16, 17, 18, 19, 20]]] <-- the fourth batch, index = (1, 1)
data_shape = (2, 2, 5)
output = [[[ 1, 1, 5],
[10, 6, 6]],
[[12, 13, 15],
[20, 19, 18]]]
output_shape = (2, 2, 3)
Example 4 with axis > batch_dims:
batch_dims = 1
axis = 2
indices = [[1, 2, 4], <-- this is applied to the first batch
[4, 3, 2]] <-- this is applied to the second batch
indices_shape = (2, 3)
data = [[[[ 1, 2, 3, 4], <-- first batch
[ 5, 6, 7, 8],
[ 9, 10, 11, 12],
[13, 14, 15, 16],
[17, 18, 19, 20]]],
[[[21, 22, 23, 24], <-- second batch
[25, 26, 27, 28],
[29, 30, 31, 32],
[33, 34, 35, 36],
[37, 38, 39, 40]]]]
data_shape = (2, 1, 5, 4)
output = [[[[ 5, 6, 7, 8],
[ 9, 10, 11, 12],
[17, 18, 19, 20]]],
[[[37, 38, 39, 40],
[33, 34, 35, 36],
[29, 30, 31, 32]]]]
output_shape = (2, 1, 3, 4)
Example 5 with negative batch_dims value:
batch_dims = -1 <-- normalized value will be indices.rank + batch_dims = 2 - 1 = 1
axis = 1
indices = [[0, 0, 4], <-- this is applied to the first batch
[4, 0, 0]] <-- this is applied to the second batch
indices_shape = (2, 3)
data = [[1, 2, 3, 4, 5], <-- the first batch
[6, 7, 8, 9, 10]] <-- the second batch
data_shape = (2, 5)
output = [[ 1, 1, 5],
[10, 6, 6]]
output_shape = (2, 3)
Inputs
1:
datatensor of type T with arbitrary data. Required.2:
indicestensor of type T_IND with indices to gather. 0D tensor (scalar) for indices is also allowed. The values for indices are in the range[0, data[axis] - 1]. Required.3: Scalar or 1D tensor
axisof T_AXIS type is a dimension index to gather data from. For example, axis equal to 1 means that gathering is performed over the first dimension. Negativeaxismeans reverse indexing and will be normalized to valueaxis = data.rank + axis. Allowed values are from[-len(data.shape), len(data.shape) - 1]andaxis' >= batch_dims'. Whereaxis'andbatch_dims'stand for normalizedbatch_dimsandaxisvalues. Required.
Outputs
1: The resulting tensor of type T that consists of elements from
datatensor gathered byindices. The shape of the output tensor isdata.shape[:axis] + indices.shape[batch_dims:] + data.shape[axis + 1:]
Types
T: any supported type.
T_IND: any supported integer types.
T_AXIS: any supported integer types.
Example
<layer ... type="Gather" version="opset7">
<data batch_dims="1" />
<input>
<port id="0">
<dim>2</dim>
<dim>64</dim>
<dim>128</dim>
</port>
<port id="1">
<dim>2</dim>
<dim>32</dim>
<dim>21</dim>
</port>
<port id="2"/> <!-- axis = 1 -->
</input>
<output>
<port id="2">
<dim>2</dim>
<dim>32</dim>
<dim>21</dim>
<dim>128</dim>
</port>
</output>
</layer>