You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -441,8 +439,7 @@ we offer in the long term might change based on feedback from PyTorch users.
441
439
442
440
* Q/DQ Representation (default)
443
441
444
-
Previous documentation for `representations <https://github.com/pytorch/rfcs/blob/master/RFC-0019-
445
-
Extending-PyTorch-Quantization-to-Custom-Backends.md>`_ all quantized operators are represented as ``dequantize -> fp32_op -> qauntize``.
442
+
Previous documentation for `representations <https://github.com/pytorch/rfcs/blob/master/RFC-0019-Extending-PyTorch-Quantization-to-Custom-Backends.md>`_ all quantized operators are represented as ``dequantize -> fp32_op -> qauntize``.
446
443
447
444
.. code-block:: python
448
445
@@ -457,7 +454,9 @@ we offer in the long term might change based on feedback from PyTorch users.
* Reference Quantized Model Representation (WIP, expected to be ready at end of August): we have special representation for selected ops (for example, quantized linear), other ops are represented as (``dq -> float32_op -> q``), and ``q/dq`` are decomposed into more primitive operators.
457
+
* Reference Quantized Model Representation (available in nightlies)
458
+
459
+
We will have special representation for selected ops (for example, quantized linear), other ops are represented as (``dq -> float32_op -> q``), and ``q/dq`` are decomposed into more primitive operators.
461
460
462
461
You can get this representation by using ``convert_pt2e(..., use_reference_representation=True)``.
463
462
@@ -515,7 +514,7 @@ Now we can compare the size and model accuracy with baseline model.
515
514
If you want to get better accuracy or performance, try configuring
516
515
``quantizer`` in different ways, and each ``quantizer`` will have its own way
517
516
of configuration, so please consult the documentation for the
518
-
quantization you are using to learn more about how you can have more control
517
+
quantizer you are using to learn more about how you can have more control
0 commit comments