danesherbs

Follow

🍹

Poolside

Dane danesherbs

🍹

Poolside

Follow

30 followers · 21 following

Melbourne, Australia
16:25 (UTC +11:00)
danesherbs.com
@danesherbs

Achievements

Achievements

Pinned Loading

openai/evals openai/evals Public

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 15.2k 2.6k
openai/mle-bench openai/mle-bench Public

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 560 62
summarizing-from-human-feedback summarizing-from-human-feedback Public

Implementation of OpenAI's "Learning to Summarize with Human Feedback"

Jupyter Notebook 7 1
bitblaster-16 bitblaster-16 Public

BitBlaster-16 is a 16-bit computer built from scratch using only NAND gates and data flip-flops as primitives! :)

Python 2
fermi-poker fermi-poker Public

Want to get better at making better estimates under uncertainty? No? Well, now you can!

Python 3
self-taught-critiquer self-taught-critiquer Public

Reducing the time to create critique-writing models by 100-1000x on n-digit arithmetic problems by getting the model to learn from its own generated outputs.

Python 1