This repository contains a C# port of the original PDFTableExtractor with the developer's permission. The application extracts tables from PDF files and converts them into Excel (XLSX) format.
- Parallel extraction: Process multiple PDFs at once.
- Customizable settings: Configure how the app works.
- Version checking: Keeps track of the latest version on GitHub.
- User interface: Easily configure settings through the app’s interface.
- Error handling: Logs any errors in an
error.txt
file for troubleshooting.
- Go to the Releases section and download the latest installer file.
- Run the installer (avoid installing to
ProgramFiles
). - A PDFTableExtractor shortcut will appear on the Desktop.
- Drag & drop one or more PDFs onto the Desktop shortcut.
- Alternatively, right-click on the PDF and select the extract option (must be enabled in settings).
- A command prompt will appear, printing information about the processing.
- XLSX files will be created in the same directory where the PDFs were located.
For customizing output, check out the Settings wiki page.
To bring up the settings menu, start the desktop icon normally.
- Keep pages with rows/columns: Skip exporting all pages/sheets that don't meet the criteria.
- Skip empty rows/columns: Different options for choosing row/column skipping methods.
- Page naming strategy: How to name pages/sheets in the Excel file.
- Autosize columns: Resizes created columns before saving.
- Parallel file processing: Enables processing multiple PDFs at the same time.
- Context menu: When turned on, an extraction option appears in the right-click menu of PDF files.
- When a new version is available, a message will appear in the console saying that the local version is out of date.
- Go to the Releases section.
- Download the new installer.
- Uninstall the old version.
- Install the new version.
- Create a new issue with a descriptive title.
- Try to include more information, e.g., the PDF you tried to extract (if you're allowed to), your settings,
error.txt
. - If the expected output is wrong, demonstrate what the expected output would be and what the output of the app was.
- When a program error occurs, a file named
error.txt
gets created in the directory of the application.
- Create an issue describing the feature/filter you need, giving it a descriptive name.
- Write a short description of what the feature/filter would do.
- Post screenshots of the input and the expected output.
This C# port of the original PDFTableExtractor was created with the developer's permission.