Add torchmultimodal tutorial for flava finetuning #2054

ankitade · 2022-09-24T17:59:50Z

Adding first tutorial for torchmultimodal around how to finetune FLAVA for vqa.

netlify · 2022-09-24T18:03:17Z

✅ Deploy Preview for pytorch-tutorials-preview ready!

Name	Link
🔨 Latest commit	`158e289`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-tutorials-preview/deploys/6359bafa8c621c0008aaef28
😎 Deploy Preview	https://deploy-preview-2054--pytorch-tutorials-preview.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site settings.

svekars · 2022-09-26T17:06:37Z

beginner_source/flava_finetuning_tutorial.py

+
+######################################################################
+# Installations
+# 


Suggested change

#

# -----------------

#

svekars · 2022-09-26T17:07:06Z

beginner_source/flava_finetuning_tutorial.py

+# Installations
+# 
+# We will use TextVQA dataset from HuggingFace for this
+# tutorial. So we install datasets in addition to TorchMultimodal


Suggested change

# tutorial. So we install datasets in addition to TorchMultimodal

# tutorial. We install datasets in addition to TorchMultimodal.

ghstack-source-id: e043284 Pull Request resolved: #2055

svekars · 2022-09-26T18:37:12Z

beginner_source/flava_finetuning_tutorial.py

+# 
+
+!wget http://dl.fbaipublicfiles.com/pythia/data/vocab.tar.gz
+!tar xf vocab.tar.gz


You added this to the Makefile above I believe

svekars · 2022-09-26T18:39:25Z

beginner_source/flava_finetuning_tutorial.py

+
+# TODO: replace with install from pip when binary is ready
+!git clone https://github.com/facebookresearch/multimodal.git
+!pip install -r multimodal/requirements.txt


Should go to requirements.txt - I see only two items in that file - you can add them to the requirements.txt

svekars · 2022-09-26T18:39:35Z

beginner_source/flava_finetuning_tutorial.py

+sys.path.append(os.path.join(os.getcwd(),"multimodal"))
+sys.path.append(os.getcwd())
+!pip install datasets
+!pip install transformers


Let's add this instead of lines 30 - 34:

# .. note:: # # When running this tutorial in Google Colab, install the required packages by # creating a new cell and running the following commands: # # .. code-block:: # # !pip install torchmultimodal-nightly # !pip install datasets # !pip install transformers

Yes, but we want it to be present in the notebook

svekars · 2022-09-26T18:44:17Z

beginner_source/flava_finetuning_tutorial.py

+!tar xf vocab.tar.gz
+
+
+with open("vocabs/answers_textvqa_more_than_1.txt") as f:


this should go to where you have downloaded your data - probably 'data/'

langong347 · 2022-09-27T00:00:05Z

requirements.txt

@@ -45,3 +48,7 @@ wget
 gym==0.24.0
 gym-super-mario-bros==7.3.0
 timm
+
+# flava tutorial - multimodal
+packaging


I don't think we need packaging anymore. I removed it in this PR.

langong347 · 2022-09-27T00:24:24Z

beginner_source/flava_finetuning_tutorial.py

+# which is a multimodal model for object detection and
+# `Omnivore <https://github.com/facebookresearch/multimodal/blob/main/torchmultimodal/models/omnivore.py>`__
+# which is multitask model spanning image, video and 3d classification.
+#


Did we miss the generation example links?

https://github.com/facebookresearch/multimodal/blob/main/examples/mugen/generation/text_video_gpt.py
https://github.com/facebookresearch/multimodal/blob/main/torchmultimodal/models/gpt.py

i can add it in follow up PR

beginner_source/flava_finetuning_tutorial.py

subramen

Left some feedback for more detail, and some suggested changes

beginner_source/flava_finetuning_tutorial.py

subramen · 2022-09-28T20:49:32Z

beginner_source/flava_finetuning_tutorial.py

+for _ in range(epochs):
+  for idx, batch in enumerate(train_dataloader):
+    optimizer.zero_grad()
+    out = model(text = batch["input_ids"], image = batch["image"], labels = batch["answers"], required_embedding="mm")


What is the required_embedding arg doing? It is not as obvious as the other params, maybe add a note in the plaintext above

Removed, its not required

subramen · 2022-09-28T20:50:37Z

beginner_source/flava_finetuning_tutorial.py

+for _ in range(epochs):
+  for idx, batch in enumerate(train_dataloader):
+    optimizer.zero_grad()
+    out = model(text = batch["input_ids"], image = batch["image"], labels = batch["answers"], required_embedding="mm")


Does this need retraining the encoders too, or just the head?

it finetunes the encoders as well

beginner_source/flava_finetuning_tutorial.py

Co-authored-by: Nikita Shulga <nshulga@fb.com>

svekars · 2022-10-18T18:52:34Z

@ankitade what's the status on this? Can you resolve the merge conflict?

ebsmothers

LGTM, just a couple nits

ebsmothers · 2022-10-21T19:29:38Z

beginner_source/flava_finetuning_tutorial.py

+# end examples, aiming to enable and accelerate research in
+# multimodality**.
+# 
+# In this tutorial, we will demonstrate how to use a **pretrained SoTA


nit: can we just say state-of-the-art here?

ebsmothers · 2022-10-21T19:31:00Z

beginner_source/flava_finetuning_tutorial.py

+# TorchMultimodal library to finetune on a multimodal task i.e. visual
+# question answering** (VQA). The model consists of two unimodal transformer
+# based encoders for text and image and a multimodal encoder to combine
+# the two embeddings. It is pretrained using contrastive, image text matching and 


Can the losses be enumerated in a different way here? I feel the comma placement makes this kinda confusing

svekars · 2022-10-26T19:58:29Z

requirements.txt

@@ -27,6 +26,9 @@ pytorch-lightning
 torchx
 ax-platform
 nbformat>=4.2.0
+datasets
+transformers
+torchmultimodal-nightly


can this be updated to use stable?

[WIP] Add torchmultimodal tutorial for flava finetuning

242f420

facebook-github-bot added the cla signed label Sep 24, 2022

svekars reviewed Sep 26, 2022

View reviewed changes

Svetlana Karslioglu and others added 2 commits September 26, 2022 10:53

Merge branch 'master' into tmm

6fbb8ce

[WIP] Add torchmultimodal tutorial for flava finetuning

d991b9b

ghstack-source-id: e043284 Pull Request resolved: #2055

ankitade closed this Sep 26, 2022

svekars reviewed Sep 26, 2022

View reviewed changes

Update

3188b66

svekars reopened this Sep 26, 2022

langong347 reviewed Sep 27, 2022

View reviewed changes

Ankita De added 2 commits September 26, 2022 20:58

Merge branch 'tmm' of https://github.com/pytorch/tutorials into tmm

0eca85b

Fix imports

7002ed5

svekars reviewed Sep 28, 2022

View reviewed changes

beginner_source/flava_finetuning_tutorial.py Outdated Show resolved Hide resolved

svekars added the torchmultimodal label Sep 28, 2022

Merge branch 'master' into tmm

d12df19

subramen reviewed Sep 28, 2022

View reviewed changes

svekars reviewed Sep 29, 2022

View reviewed changes

beginner_source/flava_finetuning_tutorial.py Show resolved Hide resolved

svekars reviewed Sep 29, 2022

View reviewed changes

beginner_source/flava_finetuning_tutorial.py Show resolved Hide resolved

svekars reviewed Sep 29, 2022

View reviewed changes

Ankita De and others added 5 commits October 3, 2022 09:44

Address comments

c33c3aa

Fix syntaxerror

31d1ca4

Fix syntax

6b6563a

Fix formatting

0fe598c

[DO NOT MERGE] 1.13 RC Test

720d370

svekars added the 1.13 label Oct 10, 2022

Svetlana Karslioglu and others added 2 commits October 11, 2022 10:15

Update .jenkins/build.sh

e67331d

Co-authored-by: Nikita Shulga <nshulga@fb.com>

Update build.sh

38939c4

svekars changed the base branch from master to 1.13-RC-TEST October 13, 2022 18:59

Merge branch 'master' into 1.13-RC-TEST

59048da

Svetlana Karslioglu added 6 commits October 17, 2022 13:09

Merge branch 'master' into 1.13-RC-TEST

f76b30d

Update build.sh

f509d8e

Update build.sh

d6e72e0

Update build.sh

5fbf500

Update build.sh

3c7694f

Remove functorch

3559c44

svekars requested a review from subramen October 18, 2022 18:52

Svetlana Karslioglu added 7 commits October 18, 2022 12:38

Merge branch 'master' into 1.13-RC-TEST

71e2e2c

Merge branch 'master' into 1.13-RC-TEST

f665ee3

Temporarily disabling fx_numeric_suite_tutorial

06b9874

Update build.sh

3c0fc31

Disable in the validate list

a449a55

Disable ax tutorial

047a956

Merge branch '1.13-RC-TEST' into tmm

3e6a2ce

svekars marked this pull request as ready for review October 20, 2022 17:19

ankitade changed the title ~~[WIP] Add torchmultimodal tutorial for flava finetuning~~ Add torchmultimodal tutorial for flava finetuning Oct 21, 2022

ankitade requested review from langong347, ebsmothers and kartikayk October 21, 2022 19:14

langong347 approved these changes Oct 21, 2022

View reviewed changes

ebsmothers approved these changes Oct 21, 2022

View reviewed changes

svekars changed the base branch from 1.13-RC-TEST to master October 26, 2022 15:44

Merge branch 'master' into tmm

0a10b61

svekars reviewed Oct 26, 2022

View reviewed changes

Svetlana Karslioglu added 4 commits October 26, 2022 15:00

rebase

cff152e

Small fix

1c11444

Merge branch 'master' into tmm

3549f56

Merge branch 'master' into tmm

158e289

svekars merged commit 5185031 into master Oct 27, 2022

	# tutorial. So we install datasets in addition to TorchMultimodal
	# tutorial. We install datasets in addition to TorchMultimodal.

		!tar xf vocab.tar.gz


		with open("vocabs/answers_textvqa_more_than_1.txt") as f:

Add torchmultimodal tutorial for flava finetuning #2054

Add torchmultimodal tutorial for flava finetuning #2054

Uh oh!

Conversation

ankitade commented Sep 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

netlify bot commented Sep 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for pytorch-tutorials-preview ready!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

svekars Sep 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

subramen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

svekars commented Oct 18, 2022

Uh oh!

ebsmothers left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ankitade commented Sep 24, 2022 •

edited

Loading

netlify bot commented Sep 24, 2022 •

edited

Loading

svekars Sep 26, 2022 •

edited

Loading