benchmarl.algorithms.MappoConfig

class MappoConfig(share_param_critic: bool, clip_epsilon: float, entropy_coef: float, critic_coef: float, loss_critic_type: str, lmbda: float, scale_mapping: str, use_tanh_normal: bool, minibatch_advantage: bool)[source]

Bases: AlgorithmConfig

Configuration dataclass for Mappo.

share_param_critic: bool = <dataclasses._MISSING_TYPE object>

clip_epsilon: float = <dataclasses._MISSING_TYPE object>

entropy_coef: float = <dataclasses._MISSING_TYPE object>

critic_coef: float = <dataclasses._MISSING_TYPE object>

loss_critic_type: str = <dataclasses._MISSING_TYPE object>

lmbda: float = <dataclasses._MISSING_TYPE object>

scale_mapping: str = <dataclasses._MISSING_TYPE object>

use_tanh_normal: bool = <dataclasses._MISSING_TYPE object>

minibatch_advantage: bool = <dataclasses._MISSING_TYPE object>

static associated_class() → Type[Algorithm][source]: The algorithm class associated to the config

static supports_continuous_actions() → bool[source]: If the algorithm supports continuous actions

static supports_discrete_actions() → bool[source]: If the algorithm supports discrete actions

static on_policy() → bool[source]: If the algorithm has to be run on policy or off policy

static has_centralized_critic() → bool[source]: If the algorithm uses a centralized critic