跳转至

moe

PPO variant with MoE bias updates.

类:

名称 描述
PPO_MoE

PPO variant with MoE bias updates.

PPO_MoE

Bases: PPO

PPO variant with MoE bias updates.

方法:

名称 描述
update

Run PPO update and then update MoE biases.

update

update(*args: Any, **kwargs: Any) -> None

Run PPO update and then update MoE biases.