Description
This topic came up on a PyArrow issue by Polars developers working on their native Dataframe Protocol Implementation. To note, in the PyArrow implementation of the protocol we decided to cast bit-packed boolean values to uint8
when producing the interchange object and we cast uint8
to bit-packed boolean when consuming an interchange object.
As this topic came up again and pandas has added support for bitmask conversion in pandas-dev/pandas#52824 it would make sense to try to support bit-packed boolean dtypes in pyarrow implementation also (without converting to uint8
), but I haven't found any information in the specification of the protocol about bit vs byte-packed boolean values.
Are both, bit and byte-packed booleans, supported by the Dataframe Interchange Protocol?