A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.
-
Updated
Jul 12, 2024 - Python
A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.
🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows
A sentence splitting (sentence boundary disambiguation) library for Go. It is rule-based and works out-of-the-box.
JChunk is a lightweight and flexible library designed to provide multiple strategies for text chunking within Spring Boot applications
An exploration of text splitting and chunking in JavaScript
LangChain is a framework, which is very helpful and easy to build applications based on available Large Language Models.
This is an experiment in learning langchain, pinecone and stuff, don't mind
Matching strings between lists based on length
Add a description, image, and links to the text-splitting topic page so that developers can more easily learn about it.
To associate your repository with the text-splitting topic, visit your repo's landing page and select "manage topics."