Skip to content

moe

PPO variant with MoE bias updates.

Classes:

Name Description
PPO_MoE

PPO variant with MoE bias updates.

PPO_MoE

Bases: PPO

PPO variant with MoE bias updates.

Methods:

Name Description
update

Run PPO update and then update MoE biases.

update

update(*args: Any, **kwargs: Any) -> None

Run PPO update and then update MoE biases.