AInsight: On-the-Fly Insights Based on Historical Data for Decision Support in High-Stakes Domain

About the Project

This project is developed for CS889: Advanced Topics in HCI, Interfaces for Human-AI Interaction at the University of Waterloo in Winter 2025.

Project Contributors:

Mohammad Abolnejadian
Shakiba Amirshahi

This Streamlit-based prototype aims to help doctors make more informed decisions by generating on-the-fly insights grounded in historical data while having conversations with patients. The system can detect key information from patient conversations, including:

The patient's medical problems
Additional contextual information
Solutions proposed by the doctor

Data Sources

The sample data is collected from Open Government Canada. The system uses embeddings stored in a ChromaDB database. A sample database is provided, but it can be augmented by running the preprocessing file.

Technical Architecture

The backend uses a pipeline built with LangChain that integrates several Azure AI services:

Speech-to-Text for transcribing conversations
Embedding models for semantic understanding
Chat completion models for generating insights

Requirements

To run this code, you need an Azure account with the following models deployed:

Speech-to-Text (STT)
Embedding model
Chat completion model

Setup

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Set up environment variables:

cp .env.example .env
# Edit .env with your Azure configuration

Required environment variables include:

Azure OpenAI endpoints and API keys
Azure Speech-to-Text configuration
Azure Chat Completion model settings

Running the Application

1. Preprocessing Data (Optional)

To augment the sample embedding database with additional data:

python preprocess.py

This will process text and CSV files from the sample_data directory and add them to the embedding database.

2. Running the User Interface

Start the Streamlit application:

streamlit run run.py

Application Workflow

Start Screen: The application begins with a welcome screen where users can start a new session.
Interaction Screen: During the session, the application:
- Listens to the doctor-patient conversation
- Transcribes the audio in real-time
- Identifies medical problems and relevant context
- Generates insights based on historical data
- Captures solutions proposed by the doctor
The interactive screen while listening to the conversation
Complete Conversation: As the conversation evolves, different parts of the UI are populated with detected information and insights.

The interactive screen after conversation analysis is complete

Project Structure

agent/: Contains the LangChain pipeline and conversation processing logic
view/: UI components and Streamlit interface code
sample_data/: Example medical data for testing
sample_embedding_db/: Pre-built database of embeddings
preprocess.py: Script for processing additional data into embeddings
run.py: Main application entry point

API Documentation

When running locally, the application is accessible at:

http://localhost:8501

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AInsight: On-the-Fly Insights Based on Historical Data for Decision Support in High-Stakes Domain

About the Project

Data Sources

Technical Architecture

Requirements

Setup

Running the Application

1. Preprocessing Data (Optional)

2. Running the User Interface

Application Workflow

Project Structure

API Documentation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
agent		agent
images		images
sample_data		sample_data
sample_embedding_db		sample_embedding_db
testing_scenario		testing_scenario
view		view
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
preprocess.py		preprocess.py
requirements.txt		requirements.txt
run.py		run.py

theablemo/HAIProject

Folders and files

Latest commit

History

Repository files navigation

AInsight: On-the-Fly Insights Based on Historical Data for Decision Support in High-Stakes Domain

About the Project

Data Sources

Technical Architecture

Requirements

Setup

Running the Application

1. Preprocessing Data (Optional)

2. Running the User Interface

Application Workflow

Project Structure

API Documentation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages