Open
Description
I have spent a lot of time trying to understand users and their behaviors in order to optimize for them. As a part of this work, I have done numerous studies on what gets used in pandas.
This will be extremely useful when it comes to defining a dataframe standard, because what people are using can help inform us on what behaviors to support.
For this study, we scraped the top 6000 notebooks from Kaggle by upvote.
Repo here, reproduction script included: https://github.com/modin-project/study_kaggle_usage
Results here: results.csv
Metadata
Metadata
Assignees
Labels
No labels