Skip to content

ENH: In pd.cut(), allow bins='auto' (leveraging np.histogram_bin_edges) #59165

Closed
@Hari-Shankar-Karthik

Description

@Hari-Shankar-Karthik

Feature Type

  • Adding new functionality to pandas

  • Changing existing functionality in pandas

  • Removing existing functionality in pandas

Problem Description

While converting a quantitative variable into a qualitative one, pd.cut() comes in clutch. However, it requires the user to specify bins as either an integer or a list of bin edges. I wish it was allowed to specify bins='auto' similar to how np.histogram allows it. It internally leverages np.histogram_bin_edges to compute these. Thank you.

Expectation

Instead of coding
pd.cut(df['x1'], bins=np.histogram_bin_edges(df['x1'], bins='auto'))
Allow for coding
pd.cut(df['x1'], bins='auto')

Additional Context

Calculation of bin edges is already done via np.histogram_bin_edges. Reference: https://numpy.org/doc/stable/reference/generated/numpy.histogram_bin_edges.html#numpy-histogram-bin-edges

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions