VisionValidator

VisionValidator is a single-page web application that evaluates and compares how different AI vision models perform object detection tasks by returning bounding box data.

Overview

This tool helps developers, researchers and AI enthusiasts:

Test various AI vision models with your own images
Validate the returned bounding box data against a schema
Visualize object detection results with bounding boxes
Compare consistency across multiple queries with heat maps
Analyze contextual attributes identified in the image

Features

Multi-Model Support: Works with OpenAI GPT-4o, Google Gemini, and Anthropic Claude 3
Bounding Box Visualization: Displays detected objects with colored bounding boxes
Heat Map Generation: Creates visual representations of AI consistency across multiple queries
Schema Validation: Ensures AI responses conform to a standardized format
Secure API Key Management: Stores API keys locally in your browser
Multiple Query Analysis: Makes 5 identical requests to analyze model consistency
Response Comparison: Toggle between different AI responses for the same image

Getting Started

Usage

Select an AI Model: Choose between OpenAI GPT-4o, Google Gemini, or Anthropic Claude 3
Enter Your API Key: Provide your API key for the selected service (stored locally in your browser)
Upload an Image: Select any image you wish to analyze
Process the Image: Click "Send to AI & Validate" to start analysis
View Results:
- See bounding boxes drawn on your image
- View the heat map showing detection consistency
- Toggle between different AI responses using the selector
- Review the full JSON response below

Requirements

Modern web browser (Chrome, Firefox, Safari, Edge)
API key for at least one of the supported AI services:

Technical Details

VisionValidator is a frontend-only application built with:

HTML, JavaScript, and Tailwind CSS
LangChain.js for AI model integration
Ajv for JSON schema validation
Canvas API for visualization

Privacy Considerations

All processing happens in your browser
Images never touch our servers - they're sent directly from your browser to the AI provider
API keys are stored in your browser's localStorage and only sent to their respective providers

Examples

Use Cases

AI Research: Evaluate and compare different vision models
Quality Assurance: Test consistency of object detection across multiple queries
Model Evaluation: Determine which AI performs best for specific image types
Educational Tool: Learn about how AI systems interpret visual information

Contributing

Contributions are welcome! Feel free to submit issues or pull requests.

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
LICENSE		LICENSE
README.md		README.md
index.html		index.html
specification.md		specification.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VisionValidator

Overview

Features

Getting Started

Usage

Requirements

Technical Details

Privacy Considerations

Examples

Use Cases

Contributing

About

Releases

Packages

Languages

License

cs-util-com/VisionValidator

Folders and files

Latest commit

History

Repository files navigation

VisionValidator

Overview

Features

Getting Started

Usage

Requirements

Technical Details

Privacy Considerations

Examples

Use Cases

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages