Skip to content
Change the repository type filter

All

    Repositories list

    • Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"
      Jupyter Notebook
      MIT License
      44200Updated Feb 27, 2025Feb 27, 2025
    • ccc-docs

      Public
      CAIS Compute Cluster (CCC) documentation
      MIT License
      0130Updated Feb 27, 2025Feb 27, 2025
    • hle

      Public
      Humanity's Last Exam
      Python
      MIT License
      2650800Updated Feb 26, 2025Feb 26, 2025
    • HPC cluster code and configurations for running on OCI
      Python
      Universal Permissive License v1.0
      14700Updated Feb 15, 2025Feb 15, 2025
    • AISES

      Public
      CSS
      2001Updated Feb 13, 2025Feb 13, 2025
    • CSS
      MIT License
      2040Updated Jan 27, 2025Jan 27, 2025
    • Measuring correlations between safety benchmarks and general AI capabilities benchmarks.
      Python
      MIT License
      1600Updated Oct 2, 2024Oct 2, 2024
    • HTML
      MIT License
      0300Updated Sep 20, 2024Sep 20, 2024
    • Forecasting.
      TypeScript
      113210Updated Sep 11, 2024Sep 11, 2024
    • HarmBench

      Public
      HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
      Jupyter Notebook
      MIT License
      79560225Updated Aug 16, 2024Aug 16, 2024
    • This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.
      Python
      MIT License
      288400Updated May 19, 2024May 19, 2024
    • wmdp

      Public
      WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.
      Jupyter Notebook
      MIT License
      3010281Updated Apr 27, 2024Apr 27, 2024
    • HTML
      MIT License
      0000Updated Mar 28, 2024Mar 28, 2024
    • JavaScript
      MIT License
      0100Updated Mar 6, 2024Mar 6, 2024
    • Prometheus exporter for performance metrics from Slurm.
      Go
      GNU General Public License v3.0
      156251Updated Nov 1, 2023Nov 1, 2023
    • Jupyter Notebook
      0400Updated Oct 30, 2023Oct 30, 2023
    • reading

      Public
      1100Updated Oct 26, 2023Oct 26, 2023
    • Cost-effectiveness models, tools, and results for various AI safety field-building programs.
      Python
      MIT License
      4502Updated Aug 15, 2023Aug 15, 2023
    • Website for the Trojan Detection Challenge NeurIPS 2022 competition
      JavaScript
      MIT License
      0000Updated Jul 28, 2023Jul 28, 2023
    • GoSlurmMailer - drop in replacement for default slurm MailProg. Delivers slurm job messages to various destinations.
      Go
      7000Updated Jun 21, 2023Jun 21, 2023
    • 196700Updated May 31, 2023May 31, 2023