Skip to content

Are you smarter than LLM, a UI interface to test against question bank LLM use the benchmark

Notifications You must be signed in to change notification settings

brianhuang822/MMLUHumanQuiz

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Loaded dataset of MMLU and put it into a page to let the laypeople to test their skill against the dataset

Code was mostly generated using GPT-4, was an experience to see how it did.

GPT-4 did a pretty good job minus a few bugs here and there which needed human intervention, but otherwise it let me make this site in 1 hour instead of 5 hours

Go to https://brianhuang822.github.io/MMLUHumanQuiz/ for static website

About

Are you smarter than LLM, a UI interface to test against question bank LLM use the benchmark

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published