pdf-tables-parser

Library to extract text tables from pdf files.

Background (why)

Sometimes your server has to retrieve information from pdf files E.g. financial reports, where the information is inside tables (rows, columns).

However there's no an easy way to extract this information from Nodejs applications. All the alterantives I tried need an extra processing to get the tables I wanted, so finally I decided to create one of my own.

Demo

You can test online the library here

Installation

$ npm install -g @pomgui/pdf-tables-parser

Usage

const
    { PdfDocument } = require('@pomgui/pdf-tables-parser'),
    fs = require('fs');

const pdf = new PdfDocument();
pdf.load('report.pdf')
    .then(() => fs.writeFileSync('report.json', JSON.stringify(pdf, null, 2), 'utf8'))
    .catch(err => console.error(err));

Result Example

{
  "numPages": 1,
  "pages": [
    {
      "pageNumber": 1,
      "tables": [
        {
          "tableNumber": 1,
          "numrows": 65,
          "numcols": 3,
          "data": [
            ["name", "age", "amount"],
            ["John", "49", "150,000.00"],
            ["Mary", "25", "10,000.00"],
            ["..."]
          ]
        }
      ]
    }
  ]
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
demo		demo
lib		lib
spec		spec
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
webpack.config.js		webpack.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pdf-tables-parser

Background (why)

Demo

Installation

Usage

Result Example

About

Releases

Packages

Contributors 2

Languages

License

pomgui/pdf-tables-parser

Folders and files

Latest commit

History

Repository files navigation

pdf-tables-parser

Background (why)

Demo

Installation

Usage

Result Example

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages