jax.lax.conv_general_dilated

jax.lax.conv_general_dilated(lhs, rhs, window_strides, padding, lhs_dilation=None, rhs_dilation=None, dimension_numbers=None, feature_group_count=1, batch_group_count=1, precision=None)[source]

General n-dimensional convolution operator, with optional dilation.

Wraps XLA’s Conv operator.

Parameters
  • lhs (Any) – a rank n+2 dimensional input array.

  • rhs (Any) – a rank n+2 dimensional array of kernel weights.

  • window_strides (Sequence[int]) – a sequence of n integers, representing the inter-window strides.

  • padding (Union[str, Sequence[Tuple[int, int]]]) – either the string ‘SAME’, the string ‘VALID’, or a sequence of n (low, high) integer pairs that give the padding to apply before and after each spatial dimension.

  • lhs_dilation (Optional[Sequence[int]]) – None, or a sequence of n integers, giving the dilation factor to apply in each spatial dimension of lhs. LHS dilation is also known as transposed convolution.

  • rhs_dilation (Optional[Sequence[int]]) – None, or a sequence of n integers, giving the dilation factor to apply in each spatial dimension of rhs. RHS dilation is also known as atrous convolution.

  • dimension_numbers (Union[None, ConvDimensionNumbers, Tuple[str, str, str]]) – either None, a ConvDimensionNumbers object, or a 3-tuple (lhs_spec, rhs_spec, out_spec), where each element is a string of length n+2.

  • feature_group_count (int) – integer, default 1. See XLA HLO docs.

  • batch_group_count (int) – integer, default 1. See XLA HLO docs.

  • precision (Optional[Any]) – Optional. Either None, which means the default precision for the backend, or a lax.Precision enum value (Precision.DEFAULT, Precision.HIGH or Precision.HIGHEST).

Return type

Any

Returns

An array containing the convolution result.

In the string case of dimension_numbers, each character identifies by position:

  • the batch dimensions in lhs, rhs, and the output with the character ‘N’,

  • the feature dimensions in lhs and the output with the character ‘C’,

  • the input and output feature dimensions in rhs with the characters ‘I’ and ‘O’ respectively, and

  • spatial dimension correspondences between lhs, rhs, and the output using any distinct characters.

For example, to indicate dimension numbers consistent with the conv function with two spatial dimensions, one could use (‘NCHW’, ‘OIHW’, ‘NCHW’). As another example, to indicate dimension numbers consistent with the TensorFlow Conv2D operation, one could use (‘NHWC’, ‘HWIO’, ‘NHWC’). When using the latter form of convolution dimension specification, window strides are associated with spatial dimension character labels according to the order in which the labels appear in the rhs_spec string, so that window_strides[0] is matched with the dimension corresponding to the first character appearing in rhs_spec that is not ‘I’ or ‘O’.

If dimension_numbers is None, the default is (‘NCHW’, ‘OIHW’, ‘NCHW’) (for a 2D convolution).