highlight the usage same as regular BF16 Autocast

leslie-fang-intel · leslie-fang-intel · commit e7e252593bdf · 2023-11-15T08:40:37.000+08:00
diff --git a/prototype_source/pt2e_quant_ptq_x86_inductor.rst b/prototype_source/pt2e_quant_ptq_x86_inductor.rst
@@ -175,6 +175,8 @@ In a more advanced scenario, int8-mixed-bf16 quantization comes into play. In th
 a Convolution or GEMM operator produces BFloat16 output data type instead of Float32 in the absence
 of a subsequent quantization node. Subsequently, the BFloat16 tensor seamlessly propagates through
 subsequent pointwise operators, effectively minimizing memory usage and potentially enhancing performance.
+The utilization of this feature mirrors that of regular BFloat16 Autocast, as simple as wrapping the
+script within the BFloat16 Autocast context.
 
 ::