Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

IndexError in lattice #493

Open
BoBoBrccc opened this issue Mar 26, 2024 · 1 comment
Open

IndexError in lattice #493

BoBoBrccc opened this issue Mar 26, 2024 · 1 comment
Labels
bug Something isn't working

Comments

@BoBoBrccc
Copy link

Describe the bug

An IndexError is raised in _reduce_index method in lattice.py
It happens when a text is starting within the table, but finishing outside.

Steps to reproduce the bug
read_pdf of attached file

Expected behavior

No error !

Code

import camelot

camelot.read_pdf(myfile, flavor='lattice', split_text=True)

PDF
camelot.pdf

Environment

  • OS: Windows
  • Python version: 3.9.18
  • Numpy version: 1.26.0
  • OpenCV version: 4.6.0
  • Ghostscript version: 10.02.1
  • Camelot version: 0.11.0

Additional context

N/A

@BoBoBrccc BoBoBrccc added the bug Something isn't working label Mar 26, 2024
@bosd
Copy link
Collaborator

bosd commented Apr 3, 2024

Hey!

As #343 this repo is no longer maintained, we try to build a maintained fork at pypdf_table_extraction.

Do you want to check out the cod ethere to see if the issue still persists. If so please open an issue there.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants