MobileAgentBench

An automated benchmark for mobile LLM agents.

Usage

Install AndroidStudio

Install AndroidStudio. AndroidStudio installs other debugging tools for you, such as ADB and Android emulators.

You may need to setup your envrionmnet variables.

export ANDROID_HOME=~/Library/Android/sdk
export PATH="$PATH:$ANDROID_HOME/tools:$ANDROID_HOME/tools/bin:$ANDROID_HOME/platform-tools"

Download Benchmarking Apps

The default benchmarking tasks use apps from SimpleMobileTools. Please download and install the following apps to your testing device (Android emulator is preferred). If you're using an Android emulator, you can simply drag and drop the APK files to install.

Calculator
Calendar
Contacts
FileManager
Gallery
AppLauncher
Messager
MusicPlayer
Notes
Recorder

Build MobileBenchMark as a Python Library

Clone this repo. Run the following commands to install it as a Python library. So you can use import it in other repos.

python3 -m pip install --upgrade build
python3 -m build

You'll find the mobile_agent_benchmark-0.0.1-py3-none-any.whl file under the dist folder. Activate your agent's virtual environment, then you can run pip install mobile_agent_benchmark-0.0.1-py3-none-any.whl to install the library.

Dummy Agent

For testing purpose, you can run the dummy_agent.py file. It acts as the simplest agent. In a for loop, it does nothing but just sleep for a few seconds. You can simulate what a real agent would do to test if the benchmark can successfully detect tash completion.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
src/mobile_agent_benchmark		src/mobile_agent_benchmark
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MobileAgentBench

Usage

Install AndroidStudio

Download Benchmarking Apps

Build MobileBenchMark as a Python Library

Dummy Agent

About

Releases

Packages

Languages

MobileAgentBench/mobile-agent-bench

Folders and files

Latest commit

History

Repository files navigation

MobileAgentBench

Usage

Install AndroidStudio

Download Benchmarking Apps

Build MobileBenchMark as a Python Library

Dummy Agent

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages