Knowledge Agents and Management in the Cloud
-
Updated
Apr 29, 2025 - Python
Knowledge Agents and Management in the Cloud
🔥 The Python library for PDF forms.
pdfCropMargins -- a program to crop the margins of PDF files
A small utility making use of the pypdf library to provide a (somewhat) lighter alternative to pdftk
A tool to sign PDF files. With Linux support.
CCKS2019评测任务五-公众公司公告信息抽取,第3名
Meet MultiPDF 📚 Chat AI App! 🚀 Chat seamlessly with Multiple PDFs using Langchain, Google Gemini Pro & FAISS Vector DB with Seamless Streamlit Deployment. Get instant, accurate responses from Awesome Google Gemini OpenSource language Model. 📚💬 Transform your PDF experience now! 🔥✨
PDFViewer is a GUI tool, written using python3 and tkinter, which lets you view PDF documents.
Python library to manipulate PDF page labels
The first developer-oriented document platform. Generate, host and track PDFs with a single API, beautifully.
This is a Python application that allows you to load a PDF and ask questions about it using natural language. The application uses a LLM to generate a response about your PDF. The LLM will not answer questions unrelated to the document.
Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!
A Streamlit-powered application that provides a user-friendly interface for editing PDF documents.
A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines.
Search and replace text in PDF files with PyPDF.
✨ A batch of useful code/scripts: run commands automatically, finish repetitive stupid operations, perform format conversions, etc.
Prepare documents for distribution
Create a ChatGPT for uploaded pdf using Langchain
PDF2PPT Generator is a Python tool that creates Powerpoint presentations from PDF files by using smart summarization techniques assisted by GPT-3.5-Turbo
This repo contains script using Tesseract OCR to digitize pdf ebooks to text format.
Add a description, image, and links to the pdf-document-processor topic page so that developers can more easily learn about it.
To associate your repository with the pdf-document-processor topic, visit your repo's landing page and select "manage topics."