Issue 2338 #1

onurtore · 2023-06-10T14:34:01Z

Fixes #ISSUE_NUMBER

Description

Checklist

The issue that is being fixed is referred in the description (see above "Fixes #ISSUE_NUMBER")
Only one issue is addressed in this pull request
Labels from the issue that this PR is fixing are added to this pull request
No unnessessary issues are included into this pull request.

Fix to "perhaps there is a misprint at line 40 pytorch#2111"; review of referenced paper https://arxiv.org/pdf/1706.03762.pdf section 3.2.3 suggests: "Similarly, self-attention layers in the decoder allow each position in the decoder to attend to all positions in the decoder up to and including that position. We need to prevent leftward information flow in the decoder to preserve the auto-regressive property. We implement this inside of scaled dot-product attention by masking out (setting to −∞) all values in the input of the softmax which correspond to illegal connections. See Figure 2." Thus the suggested change in reference from nn.Transform.Encoder to nn.Transform.Decoder seems reasonable.

As per suggestion in pytorch#1114

Fix for pytorch#1781 Rather than manually update the version number with the current stable version (e.g., 2.0.1), as long as ONNX maintains compatibility with the lastest stable version that reference should be sufficient and constantly up to date.

* Update nn_tutorial.py Fix to pytorch#1303 "add pyplot.show() in beginner tutorial." Comments to issue suggested manually commenting out pyplot.show for users not using colab. --------- Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

* refactored train loop in trainingyt.py, resolves issue pytorch#2230 * Simplified numpy function call, resolves issue pytorch#1038

* Added matplotlib dependency to blitz tutorial. * Removed a modified file from pull request --------- Co-authored-by: Carl Parker <carljparker@meta.com>

* removed ### lines and numbered in headlines * removed numbered from titles * added blank lines to show code * Remove the empty TODO placeholder --------- Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

…tion Finetuning Tutorial" (pytorch#2378)

…torch#2379) See https://github.com/pytorch/tutorials/actions/runs/5140794478/jobs/9252588225?pr=2377 as an example

Co-authored-by: Carl Parker <carljparker@meta.com>

…rch#2398)

Co-authored-by: sekyondaMeta <127536312+sekyondaMeta@users.noreply.github.com>

* Add temporary fix for embeddings bug Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

…rch#2401) Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

* Updates pytorch#836 as suggested in pytorch/pytorch#16885 (comment)

* address bug; do a little editing Signed-off-by: Mike Brown <brownwm@us.ibm.com> * Update intermediate_source/char_rnn_classification_tutorial.py Signed-off-by: Mike Brown <brownwm@us.ibm.com> Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

…ansforms.Normalize (pytorch#2405) * Fixes pytorch#2083 - explain model.eval, torch.no_grad * set norm to mean & std of CIFAR10(pytorch#1818) --------- Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

… Tutorial, resolves issue pytorch#2332 (pytorch#2404)

Co-authored-by: noqqaqq <noqqaqq@users.noreply.github.com> Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

…orch#1971 (pytorch#2403) Co-authored-by: Carl Parker <carljparker@meta.com>

…n_tutorial.py (pytorch#2380) * changed the loss init to make it less confusing --------- Co-authored-by: Nicolas Hug <contact@nicolas-hug.com> Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

…h#2402) * Update transformer_tutorial.py Add description for positional encoding calculation for Transformers * Update Positional Encoding description in transformer_tutorial.py * Update transformer_tutorial.py --------- Co-authored-by: Carl Parker <carljparker@meta.com>

In the function demo_model_parallel, dev0 and dev1 are computed in a way that assigns two distinct GPUs to each process. This is achieved by doubling the rank and applying modulus operation with twice the world_size. Assuming 8 gpus world_size is set to 4, leading to the creation of 4 processes. Each of these processes is allocated two distinct GPUs. For instance, the first process (process 0) is assigned GPUs 0 and 1, the second process (process 1) is assigned GPUs 2 and 3, and so forth.

* Update captum dependencies (matplotlib and flask-compress) * Use resnet18 due to RAM limitation Google Colab crashes due to insufficient RAM (more than 12 GB is required) if resnet101 or resnet50 are used. Thus, resnet18 is used instead (approximately 6 GB is used).

Remove `global_rng` and use `torch.randint` to feel the tensor of shape `shape` with values in range `[0, vocab_size)` Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

I noticed when reading through these docs that the two examples did not use the parameter 'y'. I assume it was meant to be used so I updated the code in the examples. Another possibility is that we don't need param 'y' and only need 'x'. Let me know if that is the case and I will fix this :)

* Update mario_rl_tutorial.py Fixes pytorch#1620 --------- Co-authored-by: Vincent Moens <vincentmoens@gmail.com> Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Update the github username of an author

To make tutorial builds predictable, but still keep randomness when one rans it on Collab. Also, reset default_device after every tutorial runCo-authored-by: Nikita Shulga <nshulga@meta.com> Co-authored-by: Nikita Shulga <nshulga@meta.com>

* created original copy of the model by loading from disk * Update fx_graph_mode_ptq_dynamic.py --------- Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

"evaluating and training ResNet-18 on random data" --> "evaluating and training a ``torchvision`` model on random data", since speedups are no longer demonstrated on resnet18.

Fixes pytorch#1642 Signed-off-by: BJ Hargrave <hargrave@us.ibm.com> Co-authored-by: sekyondaMeta <127536312+sekyondaMeta@users.noreply.github.com>

We also fix the code to use the scripted_cell just created. Fixes pytorch#1449 Signed-off-by: BJ Hargrave <hargrave@us.ibm.com>

Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Co-authored-by: NM512 <morihira3513@gmailcom> Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

…#2452) * replace old decoder diagram with new one * remove 1 from encoder1 and decoder1 * fix attention in AttnDecoderRNN * Fix formatting going over max character count --------- Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

* Image prediction using trained model * Inference on custom images * Updated the PR following the PEP8 guidelines and made the requested changes --------- Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Fixes: pytorch#800 Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

* add quantization 2.0 document --------- Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Signed-off-by: Onur Berk Töre <onurberk_t@hotmail.com>

frasertajima and others added 30 commits May 31, 2023 13:17

Update dynamic_quantization_bert_tutorial.rst (pytorch#2369)

dfc6aa2

As per suggestion in pytorch#1114

Update nn_tutorial.py (pytorch#2368)

7aff96c

* Update nn_tutorial.py Fix to pytorch#1303 "add pyplot.show() in beginner tutorial." Comments to issue suggested manually commenting out pyplot.show for users not using colab. --------- Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Add model.eval() in neural_style_tutorial.py (pytorch#2371)

4673b14

Fix train loop in trainingyt.py (pytorch#2372)

d686b66

* refactored train loop in trainingyt.py, resolves issue pytorch#2230 * Simplified numpy function call, resolves issue pytorch#1038

Added matplotlib dependency to blitz tutorial (pytorch#2366)

0bee138

* Added matplotlib dependency to blitz tutorial. * Removed a modified file from pull request --------- Co-authored-by: Carl Parker <carljparker@meta.com>

Fix formatting in the FX Graph Mode Quantization guide (pytorch#2362)

d368626

* removed ### lines and numbered in headlines * removed numbered from titles * added blank lines to show code * Remove the empty TODO placeholder --------- Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Redirect "Finetuning Torchvision Models" to "TorchVision Object Detec…

c5501e7

…tion Finetuning Tutorial" (pytorch#2378)

Fix docathon-label-sync.py to not fail on PRs without description (py…

9633e5f

…torch#2379) See https://github.com/pytorch/tutorials/actions/runs/5140794478/jobs/9252588225?pr=2377 as an example

Change batchify desc to remove ambiguity (pytorch#2383)

d9fd5ba

Co-authored-by: Carl Parker <carljparker@meta.com>

Change formatting of code blocks for correct rendering in Colab (pyto…

4cd44ae

…rch#2398)

README.txt - fix unreachable link (pytorch#2386)

7e72b70

Co-authored-by: sekyondaMeta <127536312+sekyondaMeta@users.noreply.github.com>

Fix typo in a PR template (pytorch#2377)

0be50f4

Fixes module 'get_filesystem' error (pytorch#2397)

aa400c3

* Add temporary fix for embeddings bug Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Clear plot at beginning of loop so that non-empty image renders (pyto…

e2a7ab0

…rch#2401) Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Optimize DataLoader iteration in WrappedDataLoader (pytorch#2375)

e1ec4bd

Patch 3 (pytorch#2389)

d078756

* Updates pytorch#836 as suggested in pytorch/pytorch#16885 (comment)

Fixes pytorch#2083 - explain model.eval, torch.no_grad (pytorch#2400)

9b54056

Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Copy float_model using load_model (pytorch#2385)

d41e23b

resolve issue 1818 by modifying mean and standard deviation in the tr…

dd6a55d

…ansforms.Normalize (pytorch#2405) * Fixes pytorch#2083 - explain model.eval, torch.no_grad * set norm to mean & std of CIFAR10(pytorch#1818) --------- Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Replace usage of copy.deepcopy() in Computer Vision Transfer Learning…

64dc702

… Tutorial, resolves issue pytorch#2332 (pytorch#2404)

fix cropping to include last column and last row (pytorch#2384)

5b804b8

Enumerate over dataset instead of simple loop (pytorch#2407)

fd9a6a7

Co-authored-by: noqqaqq <noqqaqq@users.noreply.github.com> Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

Implement function for BERT quantization tutorial, resolves issue pyt…

b966c1f

…orch#1971 (pytorch#2403) Co-authored-by: Carl Parker <carljparker@meta.com>

Fix the loss initialization in intermediate_source/char_rnn_generatio…

769cff9

…n_tutorial.py (pytorch#2380) * changed the loss init to make it less confusing --------- Co-authored-by: Nicolas Hug <contact@nicolas-hug.com> Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

malfet and others added 30 commits June 6, 2023 12:31

[BE] Simplify ids_tensor (pytorch#2431)

1d90341

Remove `global_rng` and use `torch.randint` to feel the tensor of shape `shape` with values in range `[0, vocab_size)` Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Update mario_rl_tutorial.py (pytorch#2381)

6e0fd0a

* Update mario_rl_tutorial.py Fixes pytorch#1620 --------- Co-authored-by: Vincent Moens <vincentmoens@gmail.com> Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Update rpc_ddp_tutorial.rst (pytorch#2437)

730029b

Update the github username of an author

Set random seed (pytorch#2438)

eaa2e90

To make tutorial builds predictable, but still keep randomness when one rans it on Collab. Also, reset default_device after every tutorial runCo-authored-by: Nikita Shulga <nshulga@meta.com> Co-authored-by: Nikita Shulga <nshulga@meta.com>

created original copy of the model by loading from disk (pytorch#2406)

d9938ee

* created original copy of the model by loading from disk * Update fx_graph_mode_ptq_dynamic.py --------- Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Typo fix to torch_compile_tutorial.py (pytorch#2446)

fc7494d

"evaluating and training ResNet-18 on random data" --> "evaluating and training a ``torchvision`` model on random data", since speedups are no longer demonstrated on resnet18.

Change paper reference to a paper matching the model used (pytorch#2424)

3b6d83b

Fixes pytorch#1642 Signed-off-by: BJ Hargrave <hargrave@us.ibm.com> Co-authored-by: sekyondaMeta <127536312+sekyondaMeta@users.noreply.github.com>

Set the random seed for reproducibility of the output (pytorch#2428)

1fe4025

We also fix the code to use the scripted_cell just created. Fixes pytorch#1449 Signed-off-by: BJ Hargrave <hargrave@us.ibm.com>

Update example link in FSDP_adavnced_tutorial.rst (pytorch#2448)

2bdd846

Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Update transformer_tutorial.py (pytorch#2451)

203f567

Co-authored-by: NM512 <morihira3513@gmailcom> Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Image prediction using trained model (pytorch#2392)

a58279c

* Image prediction using trained model * Inference on custom images * Updated the PR following the PEP8 guidelines and made the requested changes --------- Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Fix log-softmax unused issue (pytorch#2420)

3b20fe6

Fixes: pytorch#800 Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Update en-wordlist.txt

2960601

[Quant] Add quantization 2.0 document (pytorch#2354)

0ef9a65

* add quantization 2.0 document --------- Co-authored-by: Svetlana Karslioglu <svekars@fb.com>

Fix: amp_recipe.py fix

b694832

Signed-off-by: Onur Berk Töre <onurberk_t@hotmail.com>

Fix: amp_recipe fixed

aa6f573

Signed-off-by: Onur Berk Töre <onurberk_t@hotmail.com>

Fix: beginner/examples_autograd/polynomial_autograd.py

a920534

Signed-off-by: Onur Berk Töre <onurberk_t@hotmail.com>

Polynomial autograd fixed

340cbd9

Signed-off-by: Onur Berk Töre <onurberk_t@hotmail.com>

Fix tuning_guide

5f3b837

Signed-off-by: Onur Berk Töre <onurberk_t@hotmail.com>

Fix nestedtensor

7173e8b

Signed-off-by: Onur Berk Töre <onurberk_t@hotmail.com>

Fix polynomial tensor

cf21c1d

Signed-off-by: Onur Berk Töre <onurberk_t@hotmail.com>

Fix neural-style tutorial

3c19f99

Signed-off-by: Onur Berk Töre <onurberk_t@hotmail.com>

Fix cpp_extension.rst

671960c

Signed-off-by: Onur Berk Töre <onurberk_t@hotmail.com>

Fix neural_style_tutorial

2a34c3c

Signed-off-by: Onur Berk Töre <onurberk_t@hotmail.com>

Fix nested style

bc65968

Signed-off-by: Onur Berk Töre <onurberk_t@hotmail.com>

fix

b1a589d

Signed-off-by: Onur Berk Töre <onurberk_t@hotmail.com>

Fix amp

a76c954

Signed-off-by: Onur Berk Töre <onurberk_t@hotmail.com>

Merge branch 'main' into issue-2338

b91add3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issue 2338 #1

Issue 2338 #1

Uh oh!

onurtore commented Jun 10, 2023

Uh oh!

Uh oh!

Issue 2338 #1

Are you sure you want to change the base?

Issue 2338 #1

Uh oh!

Conversation

onurtore commented Jun 10, 2023

Description

Checklist

Uh oh!

Uh oh!