base
Shared MLP model for basic LAV2 tasks.
类:
| 名称 | 描述 |
|---|---|
SharedMLP |
Shared actor-critic with a configurable MLP trunk. |
SharedMLP
SharedMLP(observation_space, state_space, action_space, device, clip_actions=False, clip_log_std=True, min_log_std=-20, max_log_std=2, reduction='sum', hidden_dims=(64, 64), activation=nn.ELU)
Bases: GaussianMixin, DeterministicMixin, Model
Shared actor-critic with a configurable MLP trunk.
参数:
| 名称 | 类型 | 描述 | 默认 |
|---|---|---|---|
|
Hidden dimensions of the MLP. |
(64, 64)
|
|
|
Activation module class to instantiate. |
ELU
|
Initialize shared model layers and mixins.
方法:
| 名称 | 描述 |
|---|---|
act |
Compute actions or values depending on role. |
compute |
Compute policy/value outputs depending on role. |
act
act(inputs: Any, role: str) -> Any
Compute actions or values depending on role.
返回:
| 类型 | 描述 |
|---|---|
Any
|
tuple | None: Action outputs for the given role, or |
compute
compute(inputs: Any, role: str) -> Any
Compute policy/value outputs depending on role.
返回:
| 类型 | 描述 |
|---|---|
Any
|
tuple | None: |