Add support for custom intervals #814

tgerdesnv · 2024-01-17T19:01:06Z

Fix support for Perf Analyzer's request-intervals

Fixes #808

model_analyzer/config/generate/brute_plus_binary_search_run_config_generator.py

model_analyzer/config/generate/quick_plus_concurrency_sweep_run_config_generator.py

nv-braf · 2024-01-26T16:10:17Z

model_analyzer/config/generate/perf_analyzer_config_generator.py

-        if self._cli_config.is_llm_model():
+        # The possible inference loads are concurrency, request rate, periodic concurrency, or custom (request-intervals)
+        # - If custom is specified, it is used
+        # - For LLM models, periodic concurrency is used


Rework comment as LLM has gone away

model_analyzer/config/generate/perf_analyzer_config_generator.py

nv-braf · 2024-01-26T16:14:39Z

model_analyzer/config/generate/quick_plus_concurrency_sweep_run_config_generator.py

@@ -153,3 +157,9 @@ def _set_concurrency(self, run_config: RunConfig, concurrency: int) -> RunConfig
            perf_config.update_config({"concurrency-range": concurrency})

        return run_config
+


This method is duplicated (also in brute search). Maybe this should be a static method in ModelProfileSpec?

Yes, this is still duplicated. I didn't clean it up yet. Both classes implement ConfigGeneratorInterface. You could create a base class with common code if you want.

model_analyzer/config/generate/quick_run_config_generator.py

nv-braf · 2024-01-26T16:17:07Z

model_analyzer/config/generate/quick_run_config_generator.py

@@ -511,9 +511,12 @@ def _get_next_perf_analyzer_config(

        perf_analyzer_config.update_config_from_profile_config(model_name, self._config)

-        concurrency = self._calculate_concurrency(dimension_values)
+        perf_config_params = {"batch-size": 1}


I feel like it'd be cleaner if the PerfAnalyzerConfig() constructor initialized batch-size: 1. Seems like we are always needed to add this in the code.

This is done

model_analyzer/config/generate/quick_run_config_generator.py

model_analyzer/result/inference_load_search.py

tgerdesnv · 2024-03-30T21:24:31Z

model_analyzer/config/generate/quick_run_config_generator.py

-            "concurrency-range": default_concurrency,
-        }
-        default_perf_analyzer_config.update_config(perf_config_params)
+        if not "request-intervals" in model.perf_analyzer_flags():


Doing a self-review: I'm wondering if this should be if not model.is_load_specified() just like line 515

github-advanced-security bot found potential problems Jan 17, 2024

View reviewed changes

model_analyzer/config/generate/brute_plus_binary_search_run_config_generator.py Fixed Show fixed Hide fixed

model_analyzer/config/generate/quick_plus_concurrency_sweep_run_config_generator.py Fixed Show fixed Hide fixed

nv-braf reviewed Jan 26, 2024

View reviewed changes

tgerdesnv force-pushed the tgerdes-fix-custom-intervals branch from 1b24043 to 38abbe8 Compare January 26, 2024 16:48

tgerdesnv added 5 commits March 27, 2024 17:07

Rough draft support for custom intervals

d92cfba

default batch size of 1 so it doesn't need to be specified every time

8c0c70e

fix variable name

0ebae08

Clean up code around load args

71e7a90

fix unit tests

6b3a199

tgerdesnv force-pushed the tgerdes-fix-custom-intervals branch from fae1195 to 6b3a199 Compare March 30, 2024 13:40

update comments

57735dc

tgerdesnv commented Mar 30, 2024

View reviewed changes

tgerdesnv marked this pull request as ready for review March 30, 2024 21:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for custom intervals #814

Add support for custom intervals #814

tgerdesnv commented Jan 17, 2024 •

edited

Loading

nv-braf Jan 26, 2024

nv-braf Jan 26, 2024

tgerdesnv Mar 30, 2024

nv-braf Jan 26, 2024

tgerdesnv Mar 30, 2024

tgerdesnv Mar 30, 2024

		@@ -153,3 +157,9 @@ def _set_concurrency(self, run_config: RunConfig, concurrency: int) -> RunConfig
		perf_config.update_config({"concurrency-range": concurrency})

		return run_config

Add support for custom intervals #814

Are you sure you want to change the base?

Add support for custom intervals #814

Conversation

tgerdesnv commented Jan 17, 2024 • edited Loading

nv-braf Jan 26, 2024

Choose a reason for hiding this comment

nv-braf Jan 26, 2024

Choose a reason for hiding this comment

tgerdesnv Mar 30, 2024

Choose a reason for hiding this comment

nv-braf Jan 26, 2024

Choose a reason for hiding this comment

tgerdesnv Mar 30, 2024

Choose a reason for hiding this comment

tgerdesnv Mar 30, 2024

Choose a reason for hiding this comment

tgerdesnv commented Jan 17, 2024 •

edited

Loading