We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 677c1b6 commit 4773257Copy full SHA for 4773257
_static/img/pruning_flow.jpg
29.3 KB
index.rst
@@ -544,6 +544,13 @@ What's new in PyTorch tutorials?
544
:link: advanced/static_quantization_tutorial.html
545
:tags: Quantization
546
547
+.. customcarditem::
548
+ :header: (prototype) Accelerating BERT with semi-structured (2:4) sparsity
549
+ :card_description: Prune BERT to be 2:4 sparse and accelerate for inference.
550
+ :image: _static/img/thumbnails/cropped/generic-pytorch-logo.png
551
+ :link: prototype/semi_structured_sparse.html
552
+ :tags: Model-Optimziation
553
+
554
.. customcarditem::
555
:header: Grokking PyTorch Intel CPU Performance from First Principles
556
:card_description: A case study on the TorchServe inference framework optimized with Intel® Extension for PyTorch.
0 commit comments