CSR-Bench Repository

Overview

The csr-bench repository provides a benchmarking and retrieval system with utilities for executing bash commands, interacting with models, and web searching. This repository is organized into several Python scripts that serve various functions, such as data retrieval, bash utilities, and model execution.

For more details, please check our NAACL 2025 paper CSR-Bench: Benchmarking LLM Agents in Deployment of Computer Science Research Repositories

Repository Structure

main.py: The entry point of the repository where the core functionality is implemented.
docker_setup.sh: A shell script to set up the environment and conduct experiments using Docker.
requirements.txt: A list of Python dependencies required to run the project.
csr/: The main module directory containing various Python utilities:
- bash_utils.py: Functions related to executing and handling bash commands.
- retriever.py: Handles data and information retrieval from Github Issues.
- model.py: Manages machine learning model-related operations.
- dataloader.py: Contains utilities for loading and managing datasets.
- editor.py: Utility for editing data or files.
- web_search.py: Implements web searching capabilities.
- utils.py: Helper functions used throughout the project.
- const.py: Constants and configuration used across modules.

Prerequisites

To set up and run the repository, ensure the following prerequisites are met:

Python 3.x: Ensure you have Python 3 installed. You can download it from here.
Docker (Optional): If you wish to use Docker for a containerized environment, ensure Docker is installed.
- Follow instructions here to install Docker.

Installation

Clone the repository to your local machine:

git clone <repository_url>
cd git-bench-main

Install the required Python dependencies using pip:

pip install -r requirements.txt

(Optional) Set up Docker if you'd prefer to run the environment in a containerized setup:

bash docker_setup.sh

Usage

The primary script to run the system is main.py. You can execute it directly via the command line:

python main.py

Bash Utilities

The csr/bash_utils.py module provides a set of utilities to execute and interact with bash commands programmatically. To use this, import the necessary functions into your script:

from csr.bash_utils import your_function_here

Model Operations

The csr/model.py provides functions for loading, evaluating, or interacting with machine learning models. Example:

from csr.model import run_model

model_results = run_model(model_input)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
csr-agent/utils		csr-agent/utils
csr		csr
preparation		preparation
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
SECURITY.md		SECURITY.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CSR-Bench Repository

Overview

Repository Structure

Prerequisites

Installation

Usage

Bash Utilities

Model Operations

About

Releases

Packages

Contributors 2

Languages

amazon-science/CSR-Bench

Folders and files

Latest commit

History

Repository files navigation

CSR-Bench Repository

Overview

Repository Structure

Prerequisites

Installation

Usage

Bash Utilities

Model Operations

About

Topics

Resources

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages