You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: prototype_source/pt2e_quant_ptq_static.rst
+8-5Lines changed: 8 additions & 5 deletions
Original file line number
Diff line number
Diff line change
@@ -436,11 +436,13 @@ Convert the Calibrated Model to a Quantized Model
436
436
print(quantized_model)
437
437
438
438
.. note::
439
-
At this step, we currently have two representations that you can choose from, but what exact representation
440
-
we offer in the long term might change based on feedbacks from users.
439
+
At this step, we currently have two representations that you can choose from, but exact representation
440
+
we offer in the long term might change based on feedback from PyTorch users.
441
441
442
442
* Q/DQ Representation (default)
443
-
Previous documentation for `representations <https://github.com/pytorch/rfcs/blob/master/RFC-0019-Extending-PyTorch-Quantization-to-Custom-Backends.md>`_ all quantized operators are represented as ``dequantize -> fp32_op -> qauntize``.
443
+
444
+
Previous documentation for `representations <https://github.com/pytorch/rfcs/blob/master/RFC-0019-
445
+
Extending-PyTorch-Quantization-to-Custom-Backends.md>`_ all quantized operators are represented as ``dequantize -> fp32_op -> qauntize``.
444
446
445
447
.. code-block:: python
446
448
@@ -455,11 +457,12 @@ Convert the Calibrated Model to a Quantized Model
* Reference Quantized Model Representation (WIP, expected to be ready at end of August): we have special representation for selected ops (for example, quantized linear), other ops are represented as (dq -> float32_op -> q), and q/dq are decomposed into more primitive operators.
460
+
* Reference Quantized Model Representation (WIP, expected to be ready at end of August): we have special representation for selected ops (for example, quantized linear), other ops are represented as (``dq -> float32_op -> q``), and``q/dq`` are decomposed into more primitive operators.
459
461
460
-
You can get this representation by: ``convert_pt2e(..., use_reference_representation=True)``
462
+
You can get this representation by using ``convert_pt2e(..., use_reference_representation=True)``.
461
463
462
464
.. code-block:: python
465
+
463
466
# Reference Quantized Pattern for quantized linear
0 commit comments