Skip to content

Commit d0063ba

Browse files
committed
DOC: ecosystem.rst: Koalas
1 parent 421ae9d commit d0063ba

File tree

1 file changed

+9
-0
lines changed

1 file changed

+9
-0
lines changed

doc/source/ecosystem.rst

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -313,6 +313,15 @@ Increasingly, packages are being built on top of pandas to address specific need
313313
* vaex.from_pandas
314314
* vaex.to_pandas_df
315315

316+
`Koalas <https://koalas.readthedocs.io/en/latest/>`__
317+
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
318+
319+
The Koalas project implements the pandas DataFrame API on top of Apache Spark so that data scientists can be more productive when interacting with big data.
320+
pandas is the de facto standard (single-node) DataFrame implementation in Python, while Spark is the de facto standard for big data processing.
321+
With this package, you can:
322+
323+
* Be immediately productive with Spark, with no learning curve, if you are already familiar with pandas.
324+
* Have a single codebase that works both with pandas (tests, smaller datasets) and with Spark (distributed datasets).
316325

317326
.. _ecosystem.data_validation:
318327

0 commit comments

Comments
 (0)