Skip to content

Commit 8eca060

Browse files
committed
better images
1 parent f8ac1c5 commit 8eca060

File tree

5 files changed

+2
-2
lines changed

5 files changed

+2
-2
lines changed

_posts/2024-10-23-torchrec-fbgemm-1.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -30,13 +30,13 @@ FBGEMM has been empowering TorchRec through its backend high-performance kernel
3030

3131
[DLRM (Deep Learning Recommendation Model)](https://ai.meta.com/blog/dlrm-an-advanced-open-source-deep-learning-recommendation-model/) is the standard neural network architecture for powering recommendations at Meta, with categorical features being processed through embeddings, while continuous (dense) features are processed with a bottom multilayer perceptron. The following diagram depicts the basic architecture of DLRM, with a second order interaction layer between the dense and sparse features and a top MLP for generating the prediction.
3232

33-
![flow diagram](/assets/images/torchrec-fbgemm-1/fg1.jpg){:style="width:100%"}
33+
![flow diagram](/assets/images/torchrec-1.png){:style="width:100%"}
3434

3535

3636

3737
TorchRec provides standardized modules with significant optimizations in fusing embedding lookups. EBC is a traditional PyTorch embedding module implementation, containing a collection of `torch.nn.EmbeddingBags.` FusedEBC, powered by FBGEMM for high performance operations on embedding tables with a fused optimizer and UVM caching/management for alleviating memory constraints, is the optimized version present in sharded TorchRec modules for distributed training and inference. The below benchmark demonstrates the vast performance improvements of FusedEBC in comparison to a traditional PyTorch embedding module implementation (EBC) and the ability for FusedEBC to handle much larger embeddings than what is available on GPU memory with UVM caching.
3838

39-
![performance chart](/assets/images/torchrec-fbgemm-1/fg2.png){:style="width:100%"}
39+
![performance chart](/assets/images/torchrec-2.png){:style="width:100%"}
4040

4141

4242

Loading
Loading

assets/images/torchrec-1.png

182 KB
Loading

assets/images/torchrec-2.png

992 KB
Loading

0 commit comments

Comments
 (0)