LAV2_base

SKRL configuration for the LAV2 base task.

类：

名称	描述
`Shared`	Shared actor-critic model for PPO.

函数：

名称	描述
`get_agent_cfg`	Return the PPO agent configuration.
`get_model_class`	Return the model class for the agent.
`get_agent_class`	Return the agent class.
`get_memory_class`	Return the memory class.
`get_memory_cfg`	Return the memory configuration.
`get_trainer_class`	Return the trainer class.
`get_trainer_cfg`	Return the trainer configuration.

Shared

Shared(observation_space, state_space, action_space, device, clip_actions=False, clip_log_std=True, min_log_std=-20, max_log_std=2, reduction='sum')

Bases: GaussianMixin, DeterministicMixin, Model

Shared actor-critic model for PPO.

Initialize shared model layers and mixins.

方法：

名称	描述
`act`	Compute actions or values depending on role.
`compute`	Compute policy/value outputs depending on role.

act(inputs: Any, role: str) -> Any

Compute actions or values depending on role.

返回：

类型	描述
`Any`	tuple \| None: Action outputs for the given role, or `None` if role is unknown.

compute(inputs: Any, role: str) -> Any

Compute policy/value outputs depending on role.

返回：

类型	描述
`Any`	tuple \| None: `(output_tensor, extras_dict)` for policy/value roles, or `None`.

get_agent_cfg() -> Any

Return the PPO agent configuration.

get_model_class() -> Any

Return the model class for the agent.

get_agent_class() -> Any

Return the agent class.

get_memory_class() -> Any

Return the memory class.

get_memory_cfg() -> Any

Return the memory configuration.

get_trainer_class() -> Any

Return the trainer class.

get_trainer_cfg() -> Any

Return the trainer configuration.