`RuntimeError: grad_input must be contiguous` when tensor size is large

## Reproduce
```python
import intel_pytorch_extension as ipex
import torch
import torch.nn.functional as F

# Run successfully if size is small.

out = torch.randn(10, 10, requires_grad=True, device=ipex.DEVICE)
mask = torch.randint(5, (10,), dtype=torch.long, device=ipex.DEVICE)

loss =  F.cross_entropy(out, mask, ignore_index=1)
loss.backward()

# RuntimeError: grad_input must be contiguous

out = torch.randn(10, 10, 500, 1000, requires_grad=True, device=ipex.DEVICE)
mask = torch.randint(5, (10, 500, 1000,), dtype=torch.long, device=ipex.DEVICE)

loss =  F.cross_entropy(out, mask, ignore_index=1)
loss.backward()
```
## Traceback
```
Traceback (most recent call last):
  File "reproduce.py", line 19, in <module>
    loss.backward()
  File "/opt/conda/envs/torch_env/lib/python3.7/site-packages/torch/tensor.py", line 245, in backward
    torch.autograd.backward(self, gradient, retain_graph, create_graph, inputs=inputs)
  File "/opt/conda/envs/torch_env/lib/python3.7/site-packages/torch/autograd/__init__.py", line 147, in backward
    allow_unreachable=True, accumulate_grad=True)  # allow_unreachable flag
RuntimeError: grad_input must be contiguous
```



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`RuntimeError: grad_input must be contiguous` when tensor size is large #175

Reproduce

Traceback

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

RuntimeError: grad_input must be contiguous when tensor size is large #175

Description

Reproduce

Traceback

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`RuntimeError: grad_input must be contiguous` when tensor size is large #175