GCP-based Retrieval-Augmented Generation (RAG) System

Welcome to the GCP-based Retrieval-Augmented Generation (RAG) System repository. This project leverages Google Cloud Platform (GCP) to build a scalable RAG system for handling large amounts of data. The data, originating from various formats and conditions, undergoes preprocessing before being ingested into a GCP Datastore. The system uses the Gemini API for data search and summary, with a user interface built using Streamlit.

Overview

This project involves several key steps:

Local Database Creation: Build a local version of the company's database.
Data Preprocessing: Convert and format data files from various formats (doc, pdf) to a consistent format.
Cloud Storage: Store the processed data in GCP Cloud Buckets.
Datastore Creation: Use GCP Console to create a scalable Datastore, serving as the vector database.
API Integration: Utilize the Vertexai API for data search and summary generation.
User Interface: Implement a Streamlit-based UI for interaction.

Google Cloud Console

GCP account setup:
- Create Google cloud account
- Upload documents to cloud storage bucket (only .pdf/doc)
- Create data store and import storage bucket (use ** after path to include sub-directories)
- Create a search app in agent builder and link to data store
- Test search bot in preview tap

Installation for local deployment

Install Google cloud CLI:

I used the SDK installer: https://cloud.google.com/sdk/docs/install#installation_instructions This is for Authorization during development. Be sure to setup the CLI in the SDK manager with your google accound and setup the ADC.

Clone the repository:

python -m venv venv
source venv/bin/activate  # On Windows use `venv\Scripts\activate`

Create a virtual environment:

python -m venv search_venv   
python -m venv search_venv

Install the dependencies:
```
pip install -r .\requirements.txt
```
Fill in project details:

Fill in project_id, location, engine_id, data_store_id in the following files:
- app.py
- api_call_test.py
- Gemini_api.py file
Enable APIs and IAM permissions
- Enable VertexAI API, Discovery engine
- Verify API is working by running api_call_test.py
Run Streamlit app:
```
 streamlit run app.py
```

Installation for deployment on Cloud Run

The following article was followed for deployement: https://medium.com/@faizififita1/how-to-deploy-your-streamlit-web-app-to-google-cloud-run-ba776487c5fe

A summarized version is shown:

Install docker on your system
Create a service account for authentification
- Navigate to IAM & Admin -> Service accounts
- Select 'Create Service Account'
- Grant the following roles
  - Service account admin
- Select service account/keys
- Create Json key and add to project directory

Containerize your app

 FROM python:3.8
EXPOSE 8080
WORKDIR /app
COPY . ./
RUN pip install -r requirements.txt
ENTRYPOINT ["streamlit", "run", "app.py", "--server.port=8080", "--server.address=0.0.0.0"]

Verify appliction is working locally

FROM python:3.8
EXPOSE 8080
WORKDIR /app
COPY . ./
RUN pip install -r requirements.txt
ENTRYPOINT ["streamlit", "run", "app.py", "--server.port=8080", "--server.address=0.0.0.0"]

Build docker container

 gcloud builds submit --tag gcr.io/<PROJECT_ID>/<SOME_PROJECT_NAME> --timeout=2h

Verify container is in artifact registry

Deploy on Cloud Run using consol UI
- Navigate to Cloud Run tab
- Select "Deploy container-service"
- Select your image
- Select "Allow unauthenticated invocations**
- Select "create"
- Link will be provided for app

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
Doc_ingestion		Doc_ingestion
.gitattributes		.gitattributes
Dockerfile		Dockerfile
Gemini_api.py		Gemini_api.py
README.md		README.md
app.py		app.py
download_buttons.py		download_buttons.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GCP-based Retrieval-Augmented Generation (RAG) System

Overview

Google Cloud Console

Installation for local deployment

Installation for deployment on Cloud Run

About

Releases

Packages

Languages

jordanpym17/RAG-with-GCP

Folders and files

Latest commit

History

Repository files navigation

GCP-based Retrieval-Augmented Generation (RAG) System

Overview

Google Cloud Console

Installation for local deployment

Installation for deployment on Cloud Run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages