Skip to content

Commit eded873

Browse files
committed
some small fixes
Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
1 parent f887283 commit eded873

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

prototype_source/gpu_quantization_torchao_tutorial.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -157,8 +157,8 @@ def get_sam_model(only_one_block=False, batchsize=1):
157157
# weight data, rather than doing less computation. The torchao APIs:
158158
#
159159
# ``int8_dynamic_activation_int8_weight()``,
160-
# ``int8_dynamic_activation_int8_semi_sparse_weight`` or
161-
# ``int8_dynamic_activation_int4_weight``
160+
# ``int8_weight_only()`` or
161+
# ``int4_weight_only()``
162162
#
163163
# can be used to easily apply the desired quantization technique and then
164164
# once the model is compiled with ``torch.compile`` with ``max-autotune``, quantization is

0 commit comments

Comments
 (0)