Many groupby tests depend on a bug in DataFrameGroupBy.apply

Discovered while fixing an issue in #28541 

A large number of tests relating to `DataFrameGroupBy.apply` depend on a bug in `apply` where the grouped column is not removed from the returned DataFrame.

Here are some examples:

- [`test_apply_chunk_view`](https://github.com/pandas-dev/pandas/blob/4edf938aedf55b9e6fbfb3199f70f857e8ec7e41/pandas/tests/groupby/test_apply.py#L366)
- [`test_apply_multiindex_fail`](https://github.com/pandas-dev/pandas/blob/4edf938aedf55b9e6fbfb3199f70f857e8ec7e41/pandas/tests/groupby/test_apply.py#L366)

There are around ~20 cases of this.

---

In general, they expect the result of something like this:

```
df = pd.DataFrame({"key": [1, 1, 1, 2, 2, 2, 3, 3, 3], "value": range(9)})
df.groupby('key').apply(lambda x: x.sum())
```

To include the "key" column in the result, which should not be the case.

```
     key  value
key
1      3      3
2      6     12
3      9     21
```

The underlying issue with `apply` is a straightforward fix, all that needs to be done is change [this return](https://github.com/pandas-dev/pandas/blob/master/pandas/core/groupby/groupby.py#L729) to use the `_group_selection_context`, but the PR will have to include updates to many tests.




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Many groupby tests depend on a bug in DataFrameGroupBy.apply #28549

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Many groupby tests depend on a bug in DataFrameGroupBy.apply #28549

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions