You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: doc/source/install.rst
+3-1Lines changed: 3 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -286,7 +286,9 @@ Optional Dependencies
286
286
`xsel <http://www.vergenet.net/~conrad/software/xsel/>`__, or
287
287
`xclip <https://github.com/astrand/xclip/>`__: necessary to use
288
288
:func:`~pandas.read_clipboard`. Most package managers on Linux distributions will have ``xclip`` and/or ``xsel`` immediately available for installation.
289
-
* `pandas-gbq <https://pandas-gbq.readthedocs.io/en/latest/install.html#dependencies>`__: for Google BigQuery I/O.
* `Backports.lzma <https://pypi.org/project/backports.lzma/>`__: Only for Python 2, for writing to and/or reading from an xz compressed DataFrame in CSV; Python 3 support is built into the standard library.
Copy file name to clipboardExpand all lines: doc/source/whatsnew/v0.24.0.rst
+73-10Lines changed: 73 additions & 10 deletions
Original file line number
Diff line number
Diff line change
@@ -24,7 +24,8 @@ New features
24
24
the user to override the engine's default behavior to include or omit the
25
25
dataframe's indexes from the resulting Parquet file. (:issue:`20768`)
26
26
- :meth:`DataFrame.corr` and :meth:`Series.corr` now accept a callable for generic calculation methods of correlation, e.g. histogram intersection (:issue:`22684`)
27
-
27
+
- :func:`DataFrame.to_string` now accepts ``decimal`` as an argument, allowing
28
+
the user to specify which decimal separator should be used in the output. (:issue:`23614`)
:func:`Datafame.merge` and :func:`Dataframe.join` can now be used to join multi-indexed ``Dataframe`` instances on the overlaping index levels (:issue:`6360`)
- New method :meth:`HDFStore.walk` will recursively walk the group hierarchy of an HDF5 file (:issue:`10932`)
225
270
- :func:`read_html` copies cell data across ``colspan`` and ``rowspan``, and it treats all-``th`` table rows as headers if ``header`` kwarg is not given and there is no ``thead`` (:issue:`17054`)
226
271
- :meth:`Series.nlargest`, :meth:`Series.nsmallest`, :meth:`DataFrame.nlargest`, and :meth:`DataFrame.nsmallest` now accept the value ``"all"`` for the ``keep`` argument. This keeps all ties for the nth largest/smallest value (:issue:`16818`)
@@ -246,6 +291,7 @@ Other Enhancements
246
291
- :meth:`read_excel()` now accepts ``usecols`` as a list of column names or callable (:issue:`18273`)
247
292
- :meth:`MultiIndex.to_flat_index` has been added to flatten multiple levels into a single-level :class:`Index` object.
248
293
- :meth:`DataFrame.to_stata` and :class:` pandas.io.stata.StataWriter117` can write mixed sting columns to Stata strl format (:issue:`23633`)
294
+
- :meth:`DataFrame.between_time` and :meth:`DataFrame.at_time` have gained the an ``axis`` parameter (:issue: `8839`)
249
295
250
296
.. _whatsnew_0240.api_breaking:
251
297
@@ -271,17 +317,19 @@ If installed, we now require:
271
317
+-----------------+-----------------+----------+
272
318
| bottleneck | 1.2.0 ||
273
319
+-----------------+-----------------+----------+
320
+
| fastparquet | 0.1.2 ||
321
+
+-----------------+-----------------+----------+
274
322
| matplotlib | 2.0.0 ||
275
323
+-----------------+-----------------+----------+
276
324
| numexpr | 2.6.1 ||
277
325
+-----------------+-----------------+----------+
278
-
| pytables | 3.4.2 ||
279
-
+-----------------+-----------------+----------+
280
-
| scipy | 0.18.1 ||
326
+
| pandas-gbq | 0.8.0 ||
281
327
+-----------------+-----------------+----------+
282
328
| pyarrow | 0.7.0 ||
283
329
+-----------------+-----------------+----------+
284
-
| fastparquet | 0.1.2 ||
330
+
| pytables | 3.4.2 ||
331
+
+-----------------+-----------------+----------+
332
+
| scipy | 0.18.1 ||
285
333
+-----------------+-----------------+----------+
286
334
287
335
Additionally we no longer depend on `feather-format` for feather based storage
@@ -960,7 +1008,10 @@ Other API Changes
960
1008
- Slicing a single row of a DataFrame with multiple ExtensionArrays of the same type now preserves the dtype, rather than coercing to object (:issue:`22784`)
961
1009
- :class:`DateOffset` attribute `_cacheable` and method `_should_cache` have been removed (:issue:`23118`)
962
1010
- Comparing :class:`Timedelta` to be less or greater than unknown types now raises a ``TypeError`` instead of returning ``False`` (:issue:`20829`)
1011
+
- :meth:`Categorical.searchsorted`, when supplied a scalar value to search for, now returns a scalar instead of an array (:issue:`23466`).
1012
+
- :meth:`Categorical.searchsorted` now raises a ``KeyError`` rather that a ``ValueError``, if a searched for key is not found in its categories (:issue:`23466`).
963
1013
- :meth:`Index.hasnans` and :meth:`Series.hasnans` now always return a python boolean. Previously, a python or a numpy boolean could be returned, depending on circumstances (:issue:`23294`).
1014
+
- The order of the arguments of :func:`DataFrame.to_html` and :func:`DataFrame.to_string` is rearranged to be consistent with each other. (:issue:`23614`)
964
1015
965
1016
.. _whatsnew_0240.deprecations:
966
1017
@@ -981,10 +1032,14 @@ Deprecations
981
1032
- The ``fastpath`` keyword of the different Index constructors is deprecated (:issue:`23110`).
982
1033
- :meth:`Timestamp.tz_localize`, :meth:`DatetimeIndex.tz_localize`, and :meth:`Series.tz_localize` have deprecated the ``errors`` argument in favor of the ``nonexistent`` argument (:issue:`8917`)
983
1034
- The class ``FrozenNDArray`` has been deprecated. When unpickling, ``FrozenNDArray`` will be unpickled to ``np.ndarray`` once this class is removed (:issue:`9031`)
1035
+
- The methods :meth:`DataFrame.update` and :meth:`Panel.update` have deprecated the ``raise_conflict=False|True`` keyword in favor of ``errors='ignore'|'raise'`` (:issue:`23585`)
984
1036
- Deprecated the `nthreads` keyword of :func:`pandas.read_feather` in favor of
985
1037
`use_threads` to reflect the changes in pyarrow 0.11.0. (:issue:`23053`)
986
1038
- :func:`pandas.read_excel` has deprecated accepting ``usecols`` as an integer. Please pass in a list of ints from 0 to ``usecols`` inclusive instead (:issue:`23527`)
987
1039
- Constructing a :class:`TimedeltaIndex` from data with ``datetime64``-dtyped data is deprecated, will raise ``TypeError`` in a future version (:issue:`23539`)
1040
+
- The ``keep_tz=False`` option (the default) of the ``keep_tz`` keyword of
1041
+
:meth:`DatetimeIndex.to_series` is deprecated (:issue:`17832`).
1042
+
- Timezone converting a tz-aware ``datetime.datetime`` or :class:`Timestamp` with :class:`Timestamp` and the ``tz`` argument is now deprecated. Instead, use :meth:`Timestamp.tz_convert` (:issue:`23579`)
- Improved performance of :func:`IndexEngine.get_indexer_non_unique` for sorted, non-unique indexes (:issue:`9466`)
1089
1144
- Improved performance of :func:`PeriodIndex.unique` (:issue:`23083`)
1090
1145
- Improved performance of :func:`pd.concat` for `Series` objects (:issue:`23404`)
1146
+
- Improved performance of :meth:`DatetimeIndex.normalize` and :meth:`Timestamp.normalize` for timezone naive or UTC datetimes (:issue:`23634`)
1147
+
- Improved performance of :meth:`DatetimeIndex.tz_localize` and various ``DatetimeIndex`` attributes with dateutil UTC timezone (:issue:`23772`)
1091
1148
1092
1149
1093
1150
.. _whatsnew_0240.docs:
@@ -1217,8 +1274,8 @@ Numeric
1217
1274
Strings
1218
1275
^^^^^^^
1219
1276
1220
-
-
1221
-
-
1277
+
- Bug in :meth:`Index.str.partition` was not nan-safe (:issue:`23558`).
1278
+
- Bug in :meth:`Index.str.split` was not nan-safe (:issue:`23677`).
1222
1279
-
1223
1280
1224
1281
Interval
@@ -1315,16 +1372,20 @@ Notice how we now instead output ``np.nan`` itself instead of a stringified form
1315
1372
- :func:`read_csv()` and func:`read_table()` will throw ``UnicodeError`` and not coredump on badly encoded strings (:issue:`22748`)
1316
1373
- :func:`read_csv()` will correctly parse timezone-aware datetimes (:issue:`22256`)
1317
1374
- Bug in :func:`read_csv()` in which memory management was prematurely optimized for the C engine when the data was being read in chunks (:issue:`23509`)
1375
+
- Bug in :func:`read_csv()` in unnamed columns were being improperly identified when extracting a multi-index (:issue:`23687`)
1318
1376
- :func:`read_sas()` will parse numbers in sas7bdat-files that have width less than 8 bytes correctly. (:issue:`21616`)
1319
1377
- :func:`read_sas()` will correctly parse sas7bdat files with many columns (:issue:`22628`)
1320
1378
- :func:`read_sas()` will correctly parse sas7bdat files with data page types having also bit 7 set (so page type is 128 + 256 = 384) (:issue:`16615`)
1321
1379
- Bug in :meth:`detect_client_encoding` where potential ``IOError`` goes unhandled when importing in a mod_wsgi process due to restricted access to stdout. (:issue:`21552`)
1380
+
- Bug in :func:`to_html()` with ``index=False`` misses truncation indicators (...) on truncated DataFrame (:issue:`15019`, :issue:`22783`)
1322
1381
- Bug in :func:`DataFrame.to_string()` that broke column alignment when ``index=False`` and width of first column's values is greater than the width of first column's header (:issue:`16839`, :issue:`13032`)
1323
1382
- Bug in :func:`DataFrame.to_string()` that caused representations of :class:`DataFrame` to not take up the whole window (:issue:`22984`)
1324
1383
- Bug in :func:`DataFrame.to_csv` where a single level MultiIndex incorrectly wrote a tuple. Now just the value of the index is written (:issue:`19589`).
1325
1384
- Bug in :meth:`HDFStore.append` when appending a :class:`DataFrame` with an empty string column and ``min_itemsize`` < 8 (:issue:`12242`)
1385
+
- Bug in :func:`read_csv()` in which incorrect error messages were being raised when ``skipfooter`` was passed in along with ``nrows``, ``iterator``, or ``chunksize`` (:issue:`23711`)
1326
1386
- Bug in :meth:`read_csv()` in which :class:`MultiIndex` index names were being improperly handled in the cases when they were not provided (:issue:`23484`)
1327
1387
- Bug in :meth:`read_html()` in which the error message was not displaying the valid flavors when an invalid one was provided (:issue:`23549`)
1388
+
- Bug in :meth:`read_excel()` in which extraneous header names were extracted, even though none were specified (:issue:`11733`)
1328
1389
- Bug in :meth:`read_excel()` in which ``index_col=None`` was not being respected and parsing index columns anyway (:issue:`20480`)
1329
1390
- Bug in :meth:`read_excel()` in which ``usecols`` was not being validated for proper column names when passed in as a string (:issue:`20480`)
1330
1391
@@ -1373,6 +1434,7 @@ Reshaping
1373
1434
- Bug in :func:`pandas.concat` when concatenating a multicolumn DataFrame with tz-aware data against a DataFrame with a different number of columns (:issue:`22796`)
1374
1435
- Bug in :func:`merge_asof` where confusing error message raised when attempting to merge with missing values (:issue:`23189`)
1375
1436
- Bug in :meth:`DataFrame.nsmallest` and :meth:`DataFrame.nlargest` for dataframes that have a :class:`MultiIndex` for columns (:issue:`23033`).
1437
+
- Bug in :meth:`DataFrame.append` with a :class:`Series` with a dateutil timezone would raise a ``TypeError`` (:issue:`23682`)
1376
1438
1377
1439
.. _whatsnew_0240.bug_fixes.sparse:
1378
1440
@@ -1387,6 +1449,7 @@ Sparse
1387
1449
- Bug in ``DataFrame.groupby`` not including ``fill_value`` in the groups for non-NA ``fill_value`` when grouping by a sparse column (:issue:`5078`)
1388
1450
- Bug in unary inversion operator (``~``) on a ``SparseSeries`` with boolean values. The performance of this has also been improved (:issue:`22835`)
1389
1451
- Bug in :meth:`SparseArary.unique` not returning the unique values (:issue:`19595`)
1452
+
- Bug in :meth:`SparseArray.nonzero` and :meth:`SparseDataFrame.dropna` returning shifted/incorrect results (:issue:`21172`)
0 commit comments