moe
PPO variant with MoE bias updates.
Classes:
| Name | Description |
|---|---|
PPO_MoE |
PPO variant with MoE bias updates. |
PPO_MoE
Bases: PPO
PPO variant with MoE bias updates.
Methods:
| Name | Description |
|---|---|
update |
Run PPO update and then update MoE biases. |
update
update(*args: Any, **kwargs: Any) -> None
Run PPO update and then update MoE biases.