[maskedtensor] Add overview tutorial #2042

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

george-qi wants to merge 1 commit into master from gh/george-qi/1/head

Contributor

george-qi commented Sep 20, 2022 •

edited

Loading

Stack from ghstack (oldest at bottom):


          [maskedtensor] Add overview tutorial

4eaaf66

[ghstack-poisoned]

facebook-github-bot added the cla signed label

This was referenced Sep 20, 2022

[maskedtensor] Add sparsity tutorial #2043

Closed

[maskedtensor] Distinguish between 0 and NaN gradient #2044

Closed

[maskedtensor] Add safe softmax tutorial #2045

Closed

[maskedtensor] Add missing nan ops tutorial #2046

Closed

[maskedtensor] Add adagrad sparse semantics tutorial #2047

Closed

malfet requested changes

View reviewed changes

Contributor

malfet left a comment

Please do not use stacks, just submit a regular PR (which otherwise looks good to me)

jisaacso suggested changes

View reviewed changes

jisaacso left a comment

As a noob, I'm pretty confused by the focus of this on comparisons to Numpy.

IMO, the first overview tutorial should not speak of other libraries. There can be a whole separate tutorial covering "comparisons to numpy". It's just too confusing for a new user to think about alternative semantics that they don't need.

I would personally pull out all of the numpy conversation into a separate tutorial, and focus this one on building up knowledge: what is a MT, how are they instantiated, How are the visualized, how do you do reductions, how do you do combine MTs via ops like sum, and then you can touch on the requirement that masks have the same shape (and that you can get around this with an &). If you want, you can end the whole tutorial with a single sentence, "for those familiar with numpy, we recognize the semantics deviate from Numpy's. Details on why are addressed in "

beginner_source/maskedtensor_overview.rst

+              Indexing and slicing
+              --------------------
+              :class:`MaskedTensor` is a Tensor subclass, which means that it inherits the same semantics for indexing and slicing

jisaacso Sep 21, 2022

Can you write a docstring on MaskedTensor?

Init signature: MaskedTensor(data, mask, requires_grad=False)
Docstring:      <no docstring>

beginner_source/maskedtensor_overview.rst Show resolved Hide resolved

beginner_source/maskedtensor_overview.rst

+              :class:`MaskedTensor` is a Tensor subclass, which means that it inherits the same semantics for indexing and slicing
+              as :class:`torch.Tensor`. Below are some examples of common indexing and slicing patterns:
+                  >>> data = torch.arange(60).reshape(3, 4, 5)

jisaacso Sep 21, 2022

can you copy the output of this call and the mask call below as well? It helps the user see the original data before masking.

Contributor Author

george-qi Sep 24, 2022

Done!

beginner_source/maskedtensor_overview.rst

+              Semantics
+              +++++++++
+              MaskedTensor vs NumPy's MaskedArray

jisaacso Sep 21, 2022

I feel like this can be moved to the end? The flow of the tutorial should be:

what is a MT
How do you access/slice
What operations can you do / reductions
How does this compare with alternatives to masked tensor

Contributor Author

george-qi Sep 24, 2022

Discussed offline - this will include "what is a MT and why is an MT useful"

beginner_source/maskedtensor_overview.rst

+              Reduction semantics
+              -------------------
+              The basis for reduction semantics `has been documented and discussed at length <https://github.com/pytorch/rfcs/pull/27>`__,

jisaacso Sep 21, 2022

I feel like this can be rephrased.

The purpose of a tutorial is to be a self contained, helpful, end to end showcase of a new feature. It is not assumed that the reader has seen RFCs.

I would start off explaining what a reduction semantic is (we ignore masked values for your favorite functions!), then give the code examples before, then conclude by saying "for more details on the basis for reduction semantics, see this RFC".

beginner_source/maskedtensor_overview.rst Show resolved Hide resolved

beginner_source/maskedtensor_overview.rst Show resolved Hide resolved

jisaacso reviewed

View reviewed changes

jisaacso left a comment

adding one more comment

beginner_source/maskedtensor_overview.rst

+                  >>> t0 = mt0.to_tensor(0)
+                  >>> t1 = mt1.to_tensor(0)
+                  >>> mt2 = masked_tensor(t0 + t1, mt0.get_mask() & mt1.get_mask())

jisaacso Sep 21, 2022 •

edited

Loading

Why & instead of |?

It feels like to me, that combining two masks should be the union operator of those two masks, not the &?

Contributor Author

george-qi Sep 24, 2022

NumPy uses logical_or and our mask is the inverse of theirs, so ours is & if you want the same behavior

jisaacso reviewed

View reviewed changes

beginner_source/maskedtensor_overview.rst

+                  >>> t0 = mt0.to_tensor(0)
+                  >>> t1 = mt1.to_tensor(0)
+                  >>> mt2 = masked_tensor(t0 + t1, mt0.get_mask() & mt1.get_mask())

jisaacso Sep 21, 2022

is there a way to union two masks and return another mask, without returning a boolean? Right now

>>> mt0.get_mask() & mt1.get_mask()
tensor([[False, False, False, False],
        [False, False, False, False],
        [False, False, False, False]])

# it would be nice if union ops on two raw MTs returned a MT
>>> mt0 | mt1
MaskedTensor(
  [
    [  --,       --,       --,       --],
    [      --,   --,       --,       --],
    [      --,   --,       --,       --]
  ]
)

Contributor Author

george-qi Sep 24, 2022

Yeah, we could add this in!

jisaacso Sep 26, 2022

are you tracking features anywhere?

george-qi changed the base branch from gh/george-qi/1/base to master

September 22, 2022 06:35

This was referenced Sep 24, 2022

[maskedtensor] Overview tutorial [1/4] #2050

Merged

[maskedtensor] Sparsity tutorial [2/4] #2051

Merged

[maskedtensor] Adagrad sparse semantics [3/4] #2052

Merged

[maskedtensor] Advanced semantics [4/4] #2053

Merged

george-qi closed this

facebook-github-bot deleted the gh/george-qi/1/head branch

October 27, 2022 14:20

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels