RAG Chat App Basic Example

Basic RAG chat sample app with ChatGPT style.

Tech Stacks

All works locally
- Ollama llama3.1 for local LLM
- Azure AI Search Emulator for local search
LiteLLM
- as a adaptor for various LLM
minimal sample code
- backend is Python FastAPI
- frontend is plain html (instead of React stuff)
- two types: simple response and stream response like ChatGPT
Chainlit python low-code UI

Quick Start

start AzureSearchEmulator (see setup below)

cd AzureSearchEmulator

docker compose up -d
docker compose logs -f

start dev server

poetry install

./start_devserver.sh

Simple Chat: http://127.0.0.1:8000/static/index.html
Stream Chat: http://127.0.0.1:8000/static/chat-stream.html
Chainlit UI: http://127.0.0.1:8000/chainlit/
FastAPI Doc: http://127.0.0.1:8000/docs

# open with VSCode
poetry shell
export REQUESTS_CA_BUNDLE=~/.aspnet/https/certificate.pem
code .

Curl Test

curl -v -c cookies.txt -X POST "http://127.0.0.1:8000/chat-stream" \
  -H "Content-Type: application/json" -d '{"input":"hello"}'

curl -v -b cookies.txt -X GET \
  "http://127.0.0.1:8000/chat-history?session_id=521b158d-9daa-4a70-b419-1074cef0c768"

Example Messages Structure with Placeholder

system role message: setting context and guiding the model
adding a placeholder for the assistant can be a good practice
- it’s common practice to include a placeholder message with an empty content string.
- especially if you want to clearly indicate that the assistant's response is expected next.

messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "What is the weather today?"},
    {"role": "assistant", "content": "It's sunny and warm."},
    {"role": "user", "content": "What about tomorrow?"},  # User's last input
    {"role": "assistant", "content": ""}  # Placeholder for the next assistant response
]

Setup Notes

FastAPI: https://github.com/fastapi/fastapi

pyenv local 3.12.5

poetry init -n

poetry add -G dev ipykernel
poetry add fastapi[standard] litellm azure-search-documents python-dotenv

# it's safe to install in case error: No module named 'itsdangerous'
poetry add itsdangerous

Ollama: https://github.com/ollama/ollama
- used with LiteLLM-Ollama: https://docs.litellm.ai/docs/providers/ollama

brew install ollama

ollama run llama3.1  # or phi3 or something you prefer
# >>> Hello!
# Hello there! How can I help you today?
# >>> /bye

drawdown.js: markdown to html converter
- https://github.com/adamvleggett/drawdown

curl -OL https://raw.githubusercontent.com/adamvleggett/drawdown/refs/heads/master/drawdown.js

Azure Search Emulator

Azure Search Emulator: https://github.com/feature23/AzureSearchEmulator
dotnet-sdk: https://formulae.brew.sh/cask/dotnet-sdk
setup local https: https://qiita.com/j_kitayama_hoge000/items/26cd7a5ef0b2fac53fce

dotnet dev-certs https --check

# maybe you will need this
dotnet dev-certs https --trust

# create new certs (path can be different)
dotnet dev-certs https -ep ~/.aspnet/https/aspnetapp.pfx -p password

clone AzureSearchEmulator and edit docker-compose.yml
- update ASPNETCORE_Kestrel__Certificates__Default__Path and volumes

services:
  web:
    build: .
    ports:
      - 5080:80
      - 5081:443
    environment:
      - ASPNETCORE_URLS=https://+;http://+
      - ASPNETCORE_HTTPS_PORT=5081
      - ASPNETCORE_Kestrel__Certificates__Default__Password=password
      - ASPNETCORE_Kestrel__Certificates__Default__Path=/https/aspnetapp.pfx
    volumes:
      - indexes:/app/indexes
      - ~/.aspnet/https:/https:ro
volumes:
  indexes:

start server and try from curl

docker compose up -d

# you should see some json from curl output
curl https://localhost:5081/

convert pfx to pem for Python (MacOS user)

cd ~/.aspnet/https

openssl pkcs12 -in aspnetapp.pfx -out certificate.pem -nodes

# Python requires this environment variable
export REQUESTS_CA_BUNDLE=~/.aspnet/https/certificate.pem

you may want to try from Postman or Insomnia for debug in case of trouble
(another option?): https://github.com/tomasloksa/azure-search-emulator

Add index and documents from Python notebook

Note: collection/ComplexField are not implemented in AzureSearchEmulator
make sure to install Jupyter extension in your VSCode
open and run with VSCode src/notes/azure-ai-search-notes.py

UI libs Note

popular Python UI libraries

setup Chainlit

# need to remove the current fastapi
# because chainlit (1.2.0) depends on fastapi (>=0.110.1,<0.113)
poetry remove fastapi
poetry add "fastapi[standard]"@^0.112.0 chainlit

# this ensures all dependencies are resolved properly
poetry update

# verify the versions of fastapi and its dependencies
poetry show fastapi

# verify the versions of installed packages
poetry show

Upgrade Packages

poetry show --outdated

poetry update

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
src		src
static		static
.env		.env
.flake8		.flake8
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
start_devserver.sh		start_devserver.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Chat App Basic Example

Tech Stacks

Quick Start

Curl Test

Example Messages Structure with Placeholder

Setup Notes

Azure Search Emulator

Add index and documents from Python notebook

UI libs Note

setup Chainlit

Upgrade Packages

About

Releases

Packages

Languages

kenfj/rag-chat-app

Folders and files

Latest commit

History

Repository files navigation

RAG Chat App Basic Example

Tech Stacks

Quick Start

Curl Test

Example Messages Structure with Placeholder

Setup Notes

Azure Search Emulator

Add index and documents from Python notebook

UI libs Note

setup Chainlit

Upgrade Packages

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages