You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: doc/source/io.rst
+11-5Lines changed: 11 additions & 5 deletions
Original file line number
Diff line number
Diff line change
@@ -1211,7 +1211,7 @@ You can create/modify an index for a table with ``create_table_index`` after dat
1211
1211
1212
1212
Query via Data Columns
1213
1213
~~~~~~~~~~~~~~~~~~~~~~
1214
-
You can designate (and index) certain columns that you want to be able to perform queries (other than the `indexable` columns, which you can always query). For instance say you want to perform this common operation, on-disk, and return just the frame that matches this query.
1214
+
You can designate (and index) certain columns that you want to be able to perform queries (other than the `indexable` columns, which you can always query). For instance say you want to perform this common operation, on-disk, and return just the frame that matches this query. You can specify ``data_columns = True`` to force all columns to be data_columns
1215
1215
1216
1216
.. ipython:: python
1217
1217
@@ -1377,21 +1377,26 @@ External Compatibility
1377
1377
.. ipython:: python
1378
1378
:suppress:
1379
1379
1380
-
legacy_store.close()
1380
+
store_export.close()
1381
1381
import os
1382
-
os.remove('store_export.h5')
1382
+
os.remove('export.h5')
1383
1383
1384
1384
Backwards Compatibility
1385
1385
~~~~~~~~~~~~~~~~~~~~~~~
1386
1386
1387
1387
0.10.1 of ``HDFStore`` is backwards compatible for reading tables created in a prior version of pandas however, query terms using the prior (undocumented) methodology are unsupported. ``HDFStore`` will issue a warning if you try to use a prior-version format file. You must read in the entire file and write it out using the new format, using the method ``copy`` to take advantage of the updates. The group attribute ``pandas_version`` contains the version information. ``copy`` takes a number of options, please see the docstring.
- You can pass ``chunksize=an integer`` to ``append``, to change the writing chunksize (default is 50000). This will signficantly lower your memory usage on writing.
1416
1421
- You can pass ``expectedrows=an integer`` to the first ``append``, to set the TOTAL number of expectedrows that ``PyTables`` will expected. This will optimize read/write performance.
1417
1422
- Duplicate rows can be written to tables, but are filtered out in selection (with the last items being selected; thus a table is unique on major, minor pairs)
1423
+
- A ``PerformanceWarning`` will be raised if you are attempting to store types that will be pickled by PyTables (rather than stored as endemic types). See <http://stackoverflow.com/questions/14355151/how-to-make-pandas-hdfstore-put-operation-faster/14370190#14370190> for more information and some solutions.
0 commit comments