document-understanding

Star

Here are 18 public repositories matching this topic...

infiniflow / ragflow

Star

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Updated Jul 2, 2025
Python

deepdoctection / deepdoctection

Star

A Repo For Document AI

python nlp ocr tensorflow pytorch document-parser document-layout-analysis table-recognition table-detection document-understanding publaynet layoutlm document-ai document-image-analysis pubtabnet

Updated Jun 30, 2025
Python

X-PLUG / mPLUG-DocOwl

Star

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

multimodal table-understanding document-understanding mllm multimodal-large-language-models chart-understanding

Updated May 30, 2025
Python

OpenBMB / VisRAG

Star

Parsing-free RAG supported by VLMs

retrieval multi-modal document-retrieval rag multi-modality document-understanding vision-language-model retrieval-augmented-generation

Updated Feb 19, 2025
Python

wenwenyu / PICK-pytorch

Star

Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)

document-analysis graph-convolutional-network graph-learning graph-neural-networks document-understanding key-information-extraction

Updated Jul 25, 2024
Python

jpWang / LiLT

Star

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

nlp information-extraction document-analysis document-understanding multilingual-models document-ai multimodal-pre-trained-model

Updated Oct 31, 2022
Python

huggingface / chug

Star

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

computer-vision pdf-document datasets distributed-training dataloading document-understanding multi-modal-learning webdataset

Updated Apr 3, 2024
Python

microsoft / CompHRDoc

Star

Datasets and Evaluation Scripts for CompHRDoc

document-understanding document-structure-analysis rag-related

Updated Feb 25, 2025
Python

ZeningLin / PEneo

Star

[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.

ocr document-understanding key-information-extraction document-ai visual-information-extraction

Updated Apr 7, 2025
Python

jacobmarks / pytesseract-ocr-plugin

Star

Run optical character recognition with PyTesseract from the FiftyOne App!

python plugin nlp ocr computer-vision tesseract tesseract-ocr document-understanding fiftyone

Updated Apr 5, 2024
Python

irgroup / labelstudio-to-fonduer

Star

This small module connects Label Studio with Fonduer by creating a fonduer labeling function for gold labels from a label studio export. Documentation: https://irgroup.github.io/labelstudio-to-fonduer/

data-annotation knowledge-base-construction document-understanding label-studio fonduer

Updated Feb 14, 2023
Python

Haruhiyuki / yuque-rag

Star

将语雀知识库接入大语言模型，实现基于 RAG（检索增强生成）的智能问答系统，支持FastAPI，兼容OpenAI API与本地Ollama模型。

ai-search rag document-understanding

Updated Jun 12, 2025
Python

PAIR-Systems-Inc / little-dorrit-editor

Star

Multimodal benchmark for evaluating handwritten editorial correction in printed text.

benchmark ocr multimodal-deep-learning document-understanding llm-evaluation

Updated Apr 17, 2025
Python

marcel-lamott / SlimDoc

Star

Official implementation for "SlimDoc: Lightweight Distillation of Document Transformer Models," published in the International Journal on Document Analysis and Recognition (IJDAR), 2025

distillation document-understanding

Updated Jun 22, 2025
Python

Pu5hk4r / PROJECT-PDF-CHAT-BOT

Star

PDF Chatbot is an AI-driven application that lets users chat with their PDF documents. It extracts text from uploaded PDFs and uses a powerful language model to answer user queries in a context-aware manner. The chatbot is built with Python, Gradio for the web interface, PyPDF2 for PDF parsing, and Hugging Face Transformers + LangChain for natural

pdf machine-learning chatbot python-3 language-model nlp-machine-learning context-awareness pdf-parser document-understanding vector-database huggingface-transformers gradio-interface langchain

Updated Apr 28, 2025
Python

mycielski / textract_study

Star

Analysing expense reports/invoices with AWS Textract and boto3.

shell aws script invoices aws-cli expenses boto3 textract document-understanding

Updated Nov 27, 2023
Python

phong-lt / LiGT_VQA

Star

This repository includes the ReceiptVQA dataset and the Pytorch implementation of the LiGT method and other evaluated baselines.

vietnamese-language visual-question-answering document-understanding

Updated Apr 5, 2025
Python

Lucky-akash321 / Document-Q-A-with-Google-Gemma

Star

The Document Q&A with Google Gemma project involves building an intelligent system for extracting and answering questions from documents using the Google Gemma API. It integrates natural language processing (NLP) techniques to provide accurate, context-aware responses.

natural-language-processing python-programming data-preprocessing machine-learning-models document-understanding google-gemma-api text-extraction-techniques ai-driven-question-answering

Updated Feb 13, 2025
Python

Improve this page

Add a description, image, and links to the document-understanding topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the document-understanding topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

document-understanding

Here are 18 public repositories matching this topic...

infiniflow / ragflow

deepdoctection / deepdoctection

X-PLUG / mPLUG-DocOwl

OpenBMB / VisRAG

wenwenyu / PICK-pytorch

jpWang / LiLT

huggingface / chug

microsoft / CompHRDoc

ZeningLin / PEneo

jacobmarks / pytesseract-ocr-plugin

irgroup / labelstudio-to-fonduer

Haruhiyuki / yuque-rag

PAIR-Systems-Inc / little-dorrit-editor

marcel-lamott / SlimDoc

Pu5hk4r / PROJECT-PDF-CHAT-BOT

mycielski / textract_study

phong-lt / LiGT_VQA

Lucky-akash321 / Document-Q-A-with-Google-Gemma

Improve this page

Add this topic to your repo