Skip to content

Incorrect column count estimation on some tables #1334

Open
@jsanchezf

Description

@jsanchezf

I am trying to parse a large set of documents and extract information from tables contained within. However, python-docx is not navigating correctly the table cells. I think it is due to inconsistency in the column count. len(_Row.cells) is in some cases greater than
Table._column_count.

I attach a sample document, a simple parsing script and the resulting output to show the problem.

extract_FP.py.txt
Sample-log.txt
Sample.docx

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions