Skip to content

BUG: prod and groupby.prod behave differently #46573

Closed
@tf401

Description

@tf401

Pandas version checks

  • I have checked that this issue has not already been reported.

  • I have confirmed this bug exists on the latest version of pandas.

  • I have confirmed this bug exists on the main branch of pandas.

Reproducible Example

import pandas as pd

df = pd.DataFrame(
    data=[
        [5,	2, 17, 11, 7, 2, 3],
        [5,	2, 17, 41, 7, 2, 3],
        [5,	2, 17, 17, 7, 2, 3],
        [5,	2, 17, 37, 7, 2, 3],
        [5,	2, 17, 7, 7, 2, 3],
        [5,	2, 17, 29, 7, 2, 3],
        [5,	2, 17, 31, 7, 2, 3],
        [5,	2, 17, 2, 7, 2, 3],
        [5,	2, 17, 3, 7, 2, 3],
        [5,	2, 17, 43, 7, 2, 3],
        [5,	2, 17, 5, 7, 2, 3],
        [5,	2, 17, 47, 7, 2, 3],
        [5,	2, 17, 19, 7, 2, 3],
        [5,	2, 17, 23, 7, 2, 3],

    ],
    columns=['A','B','C','D','E','F','G']
)

df.prod()
#A            6103515625
#B                 16384
#C    168377826559400929
#D     47299214045268570 <-- correct value
#E          678223072849
#F                 16384
#G               4782969
#dtype: int64

df.groupby(['A','B','C','E','F','G']).prod()

#						D
#A	B	C	E	F	G	
#5	2	17	7	2	3	47299214045268568 <-- wrong value

Issue Description

During a project that uses pandas groupby prod method I noticed possible bug.

The reproducible example should be clear but the core issue is that is seems that pandas prod() method behaves different
than pandas groupby prod method. The result is inconsistent. In the example, please inspect column "D"

Expected Behavior

Correct output of column D during the groupby operation

Installed Versions

'1.4.1'

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions