You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
And then I receive following error from game_buffer_muzero script:
value_list = value_list.reshape(-1) * (
ValueError: operands could not be broadcast together with shapes (6144,) (1536,)
Here is the Traceback:
Traceback (most recent call last):
File "/home/arsham/miniconda3/envs/mlagents/lib/python3.10/runpy.py", line 196, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/home/arsham/miniconda3/envs/mlagents/lib/python3.10/runpy.py", line 86, in _run_code
exec(code, run_globals)
File "/home/arsham/.vscode/extensions/ms-python.debugpy-2025.0.1-linux-x64/bundled/libs/debugpy/adapter/../../debugpy/launcher/../../debugpy/__main__.py", line 71, in <module>
cli.main()
File "/home/arsham/.vscode/extensions/ms-python.debugpy-2025.0.1-linux-x64/bundled/libs/debugpy/adapter/../../debugpy/launcher/../../debugpy/../debugpy/server/cli.py", line 501, in main
run()
File "/home/arsham/.vscode/extensions/ms-python.debugpy-2025.0.1-linux-x64/bundled/libs/debugpy/adapter/../../debugpy/launcher/../../debugpy/../debugpy/server/cli.py", line 351, in run_file
runpy.run_path(target, run_name="__main__")
File "/home/arsham/.vscode/extensions/ms-python.debugpy-2025.0.1-linux-x64/bundled/libs/debugpy/_vendored/pydevd/_pydevd_bundle/pydevd_runpy.py", line 310, in run_path
return _run_module_code(code, init_globals, run_name, pkg_name=pkg_name, script_name=fname)
File "/home/arsham/.vscode/extensions/ms-python.debugpy-2025.0.1-linux-x64/bundled/libs/debugpy/_vendored/pydevd/_pydevd_bundle/pydevd_runpy.py", line 127, in _run_module_code
_run_code(code, mod_globals, init_globals, mod_name, mod_spec, pkg_name, script_name)
File "/home/arsham/.vscode/extensions/ms-python.debugpy-2025.0.1-linux-x64/bundled/libs/debugpy/_vendored/pydevd/_pydevd_bundle/pydevd_runpy.py", line 118, in _run_code
exec(code, run_globals)
File "/home/arsham/LightZero/zoo/matchthree/config/matchthree_stochastic_muzero_config.py", line 104, in <module>
train_muzero([main_config, create_config], seed=0, model_path=main_config.policy.model_path, max_env_step=max_env_step)
File "/home/arsham/LightZero/lzero/entry/train_muzero.py", line 202, in train_muzero
train_data = replay_buffer.sample(batch_size, policy)
File "/home/arsham/LightZero/lzero/mcts/buffer/game_buffer_muzero.py", line 141, in sample
batch_rewards, batch_target_values = self._compute_target_reward_value(
File "/home/arsham/LightZero/lzero/mcts/buffer/game_buffer_muzero.py", line 485, in _compute_target_reward_value
value_list = value_list.reshape(-1) * (
ValueError: operands could not be broadcast together with shapes (6144,) (1536,)
The problem is value_list have shape 4x transition_batch_size.
I would be grateful for any guidance on how to resolve the issue.
The text was updated successfully, but these errors were encountered:
Could you confirm whether any modifications have been made to components other than config and the environment?
Additionally, could you provide detailed debugging information within _compute_target_reward_value()? Specifically, details such as m_output.latent_state.shape, transition_batch_size, and value_list.shape would be helpful.
By default, transition_batch_size is calculated as game_segment_batch_size * (num_unroll_steps + 1), where num_unroll_steps defaults to 5, and game_segment_batch_size corresponds to batch_size in config.
Thank you for your attention! If you have any further questions, please feel free to ask.
Hi, I am trying to use stochastic muZero to play a game. Here is my config file:
And then I receive following error from game_buffer_muzero script:
value_list = value_list.reshape(-1) * (
ValueError: operands could not be broadcast together with shapes (6144,) (1536,)
Here is the Traceback:
The problem is value_list have shape 4x transition_batch_size.
I would be grateful for any guidance on how to resolve the issue.
The text was updated successfully, but these errors were encountered: