- llm base link:- https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGML/resolve/main/llama-2-7b-chat.ggmlv3.q8_0.bin (download before running)
- Needs atleast 9gb of ram free to run
- Install Python version above 3.9 or above as required by th latest version of packages
- Install pip for python
- Insall required packages for the project
pip install -r requirements.txt
- Run the project
uvicorn app:app --host 0.0.0.0 --port 8000
You can run and test this project using chainLit as well