AutoML benchmark for FEDOT framework - [OBSOLETE, see AMLB and pytsbe for actual examples]

This tool will help you to execute different AutoMl frameworks with problem data you want. The repository already has some cases (i.e. credit_scoring), the ability to work with PMLB datasets and open to new experiments.

How to

Execute existing cases

All the existing cases are located in test_cases directory. To execute an experiment open the directory with the case and run the script case_name.py inside.

The main part presents the CaseExecutor with the params, models and metrics to run.

result_metrics = CaseExecutor(params=ExecutionParams(train_file=train_file,
                                                     test_file=test_file,
                                                     task=TaskTypesEnum.classification,
                                                     target_name='default',
                                                     case_label='scoring'),
                              models=[BenchmarkModelTypesEnum.baseline,
                                      BenchmarkModelTypesEnum.tpot,
                                      BenchmarkModelTypesEnum.fedot],
                              metric_list=['roc_auc', 'f1']).execute()

To understand which hyperparameters were used for AutoML models have a look at the realisation of the get_models_hyperparameters function to see or tailor the requirement parameters.

result_metrics['hyperparameters'] = get_models_hyperparameters()

The following function saves the result of the execution to json file next to the case script.

save_metrics_result_file(result_metrics, file_name='scoring_metrics')

Add custom experiment

To build an experiment create a directory with the name of your case in test_cases directory. Create a directory named data inside to put your data files here and a script named as your case and fill it in as follows:

Note! Do not forget to replace all the your_case phrases in names to the name of your case

from benchmark_model_types import BenchmarkModelTypesEnum
from executor import CaseExecutor, ExecutionParams
from core.repository.tasks import TaskTypesEnum
from benchmark_utils import (get_models_hyperparameters,
                             save_metrics_result_file,
                             get_your_case_data_paths,
                             )

if __name__ == '__main__':
    train_file, test_file = get_your_case_data_paths()

    result_metrics = CaseExecutor(params=ExecutionParams(train_file=train_file,
                                                         test_file=test_file,
                                                         task=TaskTypesEnum.classification,
                                                         target_name='default',
                                                         case_label='your_case'),
                                  models=[BenchmarkModelTypesEnum.baseline,
                                          BenchmarkModelTypesEnum.tpot,
                                          BenchmarkModelTypesEnum.fedot],
                                  metric_list=['roc_auc', 'f1']).execute()

     result_metrics['hyperparameters'] = get_models_hyperparameters()

     save_metrics_result_file(result_metrics, file_name='your_case_metrics')

To import your data properly make a corresponding function for your case in benchmark_utils script:

def get_your_case_data_paths() -> Tuple[str, str]:
    train_file_path = os.path.join('test_cases', 'your_directory', 'data', 'your_case_name_train.csv')
    test_file_path = os.path.join('test_cases', 'your_directory', 'data', 'your_case_name_test.csv')
    full_train_file_path = os.path.join(str(project_root()), train_file_path)
    full_test_file_path = os.path.join(str(project_root()), test_file_path)

    return full_train_file_path, full_test_file_path

Pay attention to the task and model types and target_name(the target column name). All the supported task types and model types are available in the TaskTypesEnum and BenchmarkModelTypesEnum objects respectively.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
baseline		baseline
experiments		experiments
model		model
test_cases		test_cases
.gitignore		.gitignore
README.rst		README.rst
benchmark_model_types.py		benchmark_model_types.py
benchmark_utils.py		benchmark_utils.py
core_requirements.txt		core_requirements.txt
executor.py		executor.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AutoML benchmark for FEDOT framework - [OBSOLETE, see AMLB and pytsbe for actual examples]

How to

Execute existing cases

Add custom experiment

About

Releases

Packages

Contributors 4

Languages

ITMO-NSS-team/FEDOT-benchmarks

Folders and files

Latest commit

History

Repository files navigation

AutoML benchmark for FEDOT framework - [OBSOLETE, see AMLB and pytsbe for actual examples]

How to

Execute existing cases

Add custom experiment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages