-
-
Notifications
You must be signed in to change notification settings - Fork 18.5k
PERF: Implement groupby idxmax/idxmin in Cython #54234
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 2 commits
5acec90
60407c5
73927a7
6958c95
f1b9ad1
ec6a33b
cede8b4
03f30f6
9e4aabc
3e3f14b
fa941b2
bfc9cc9
212247b
bb65902
590b6d4
929d5e9
8043f29
d5f328b
056c590
c0c79ef
b2641a4
cbcca6d
03d4510
6e727b4
b56d49f
51432ec
6a4f51e
1268eac
448581a
f9bb55e
ff32210
57d7b81
75bde4b
587a054
f1d2b5c
00e4347
6658a98
1539925
dadd01e
9d7d082
0bfd131
0d9d54c
363212d
52a3413
30bc4c7
95f35a4
97a52f8
ff00e20
df282d8
3d2d8a0
ad07653
b07e9ba
da8088c
38b3f38
5c416fe
75638a5
b666563
a8a5412
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -416,7 +416,7 @@ Performance improvements | |
- Performance improvement when doing various reshaping operations on :class:`arrays.IntegerArrays` & :class:`arrays.FloatingArray` by avoiding doing unnecessary validation (:issue:`53013`) | ||
- Performance improvement when indexing with pyarrow timestamp and duration dtypes (:issue:`53368`) | ||
- Performance improvement when passing an array to :meth:`RangeIndex.take`, :meth:`DataFrame.loc`, or :meth:`DataFrame.iloc` and the DataFrame is using a RangeIndex (:issue:`53387`) | ||
- | ||
- Performance improvement in :meth:`SeriesGroupBy.idxmax`, :meth:`SeriesGroupBy.idxmin`, :meth:`DataFrameGroupBy.idxmax`, :meth:`DataFrameGroupBy.idxmin` (PR#??) | ||
jbrockmendel marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
.. --------------------------------------------------------------------------- | ||
.. _whatsnew_210.bug_fixes: | ||
|
@@ -574,6 +574,7 @@ Groupby/resample/rolling | |
- Bug in :meth:`GroupBy.var` failing to raise ``TypeError`` when called with datetime64, timedelta64 or :class:`PeriodDtype` values (:issue:`52128`, :issue:`53045`) | ||
- Bug in :meth:`DataFrameGroupby.resample` with ``kind="period"`` raising ``AttributeError`` (:issue:`24103`) | ||
- Bug in :meth:`Resampler.ohlc` with empty object returning a :class:`Series` instead of empty :class:`DataFrame` (:issue:`42902`) | ||
- Bug in :meth:`SeriesGroupBy.idxmax`, :meth:`SeriesGroupBy.idxmin`, :meth:`DataFrameGroupBy.idxmax`, :meth:`DataFrameGroupBy.idxmin` would fail if the groupings have unobserved categories (:issue:`10694`) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. im back to being confused here. why isn't raising correct? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. We now consistently raise on empty groups from unobserved categories. We warn on empty groups from NA values in parallel with DataFrame/Series. |
||
- Bug in :meth:`SeriesGroupBy.nth` and :meth:`DataFrameGroupBy.nth` after performing column selection when using ``dropna="any"`` or ``dropna="all"`` would not subset columns (:issue:`53518`) | ||
- Bug in :meth:`SeriesGroupBy.nth` and :meth:`DataFrameGroupBy.nth` raised after performing column selection when using ``dropna="any"`` or ``dropna="all"`` resulted in rows being dropped (:issue:`53518`) | ||
- Bug in :meth:`SeriesGroupBy.sum` and :meth:`DataFrameGroupby.sum` summing ``np.inf + np.inf`` and ``(-np.inf) + (-np.inf)`` to ``np.nan`` (:issue:`53606`) | ||
|
Uh oh!
There was an error while loading. Please reload this page.