loader

Policy loading and inference runners for SKRL evaluation.

Classes:

Name	Description
`BasePolicyRunner`	Base policy runner interface.
`JITPolicyRunner`	TorchScript policy runner.
`ONNXPolicyRunner`	ONNX policy runner.
`TensorRTPolicyRunner`	TensorRT policy runner for serialized engine artifacts.
`SKRLPolicyRunner`	Adapter exposing a unified `act(observations)` interface for SKRL.

Functions:

Name	Description
`load_agent`	Load a trained PyTorch agent for evaluation/inference.
`load_policy`	Load a policy as a unified runner supporting JIT, ONNX, or TensorRT.

BasePolicyRunner

BasePolicyRunner(action_dim: int | None = None)

Bases: ABC

Base policy runner interface.

Initialize the base runner.

Parameters:

Name	Type	Description	Default
`action_dim`	`int \| None`	Optional action dimension hint.	`None`

Methods:

Name	Description
`act`	Return action predictions for the given observations.

act `abstractmethod`

act(observations: ndarray | Tensor) -> np.ndarray

Return action predictions for the given observations.

JITPolicyRunner

JITPolicyRunner(model_path: str, device: str | device)

Bases: BasePolicyRunner

TorchScript policy runner.

Initialize a TorchScript policy runner.

Parameters:

Name	Type	Description	Default
`model_path`	`str`	Path to the JIT model.	required
`device`	`str \| device`	Target device.	required

Methods:

Name	Description
`act`	Compute actions from observations.

act

act(observations: ndarray | Tensor) -> np.ndarray

Compute actions from observations.

Returns:

Type	Description
`ndarray`	np.ndarray: Action array with shape matching the model output.

ONNXPolicyRunner

ONNXPolicyRunner(model_path: str, providers: list | None = None, session_options: Any | None = None)

Bases: BasePolicyRunner

ONNX policy runner.

Initialize an ONNX policy runner.

Parameters:

Name	Type	Description	Default
`model_path`	`str`	Path to the ONNX model.	required
`providers`	`list \| None`	Optional ONNX providers list.	`None`
`session_options`	`Any \| None`	Optional ONNX Runtime session options.	`None`

Raises:

Type	Description
`ImportError`	If onnxruntime is not installed.
`RuntimeError`	If the ONNX model has no inputs.

Methods:

Name	Description
`act`	Compute actions from observations.

act

act(observations: ndarray | Tensor) -> np.ndarray

Compute actions from observations.

Returns:

Type	Description
`ndarray`	np.ndarray: Action array with shape matching the model output.

TensorRTPolicyRunner

TensorRTPolicyRunner(model_path: str)

Bases: BasePolicyRunner

TensorRT policy runner for serialized engine artifacts.

Initialize a TensorRT policy runner.

Parameters:

Name	Type	Description	Default
`model_path`	`str`	Path to a serialized TensorRT engine.	required

Raises:

Type	Description
`FileNotFoundError`	If the TensorRT engine file is missing.
`RuntimeError`	If CUDA is unavailable.
`RuntimeError`	If the engine cannot be deserialized or has no I/O.

Methods:

Name	Description
`convert_onnx_to_engine`	Build a serialized TensorRT engine from an ONNX policy.
`act`	Compute actions from observations with a TensorRT engine.

convert_onnx_to_engine `classmethod`

convert_onnx_to_engine(onnx_path: str, engine_path: str | None = None, *, workspace_size_bytes: int = 1 << 30, fp16: bool = True, input_shapes: dict[str, tuple[tuple[int, ...], tuple[int, ...], tuple[int, ...]]] | None = None) -> str

Build a serialized TensorRT engine from an ONNX policy.

Parameters:

Name	Type	Description	Default
`onnx_path`	`str`	Path to the ONNX model.	required
`engine_path`	`str \| None`	Output path for the serialized engine. Defaults to replacing the suffix with `.engine`.	`None`
`workspace_size_bytes`	`int`	TensorRT workspace memory pool size.	`1 << 30`
`fp16`	`bool`	Whether to enable FP16 builder mode when supported.	`True`
`input_shapes`	`dict[str, tuple[tuple[int, ...], tuple[int, ...], tuple[int, ...]]] \| None`	Optional dynamic-shape profiles keyed by input tensor name as `(min_shape, opt_shape, max_shape)`.	`None`

Returns:

Name	Type	Description
`str`	`str`	Path to the serialized TensorRT engine.

Raises:

Type	Description
`FileNotFoundError`	If the ONNX model does not exist.
`ImportError`	If TensorRT is unavailable.
`RuntimeError`	If parsing or engine building fails.

act

act(observations: ndarray | Tensor) -> np.ndarray

Compute actions from observations with a TensorRT engine.

SKRLPolicyRunner

SKRLPolicyRunner(policy: Any, observation_preprocessor: Any, device: str | device)

Bases: BasePolicyRunner

Adapter exposing a unified act(observations) interface for SKRL.

Initialize an SKRL policy runner.

Methods:

Name	Description
`act`	Run policy inference and return numpy actions.

act

act(observations: ndarray | Tensor) -> np.ndarray

Run policy inference and return numpy actions.

load_agent

load_agent(model_cls: type, agent_cfg: Any, checkpoint_path: str, observation_shape: tuple, action_shape: tuple, state_shape: tuple | None = None, device: str | device | None = None) -> tuple

Load a trained PyTorch agent for evaluation/inference.

Parameters:

Name	Type	Description	Default
`model_cls`	`type`	Class of the model (policy).	required
`agent_cfg`	`Any`	Agent configuration object.	required
`checkpoint_path`	`str`	Path to the checkpoint file.	required
`observation_shape`	`tuple`	Tuple defining observation dimensions.	required
`action_shape`	`tuple`	Tuple defining action dimensions.	required
`state_shape`	`tuple \| None`	Tuple defining state dimensions.	`None`
`device`	`str \| device \| None`	Torch device (auto-detected if None).	`None`

Returns:

Name	Type	Description
`tuple`	`tuple`	(policy, observation_preprocessor, device).

Raises:

Type	Description
`FileNotFoundError`	If the checkpoint file is not found.
`ImportError`	If gymnasium is not installed.

load_policy

load_policy(model_path: str, device: str | device = 'cpu', providers: list | None = None, session_options: Any | None = None) -> BasePolicyRunner

Load a policy as a unified runner supporting JIT, ONNX, or TensorRT.

The returned object exposes act(observations) and returns numpy actions. Observations may be numpy arrays or torch tensors. A 1-D observation is treated as a single batch and the returned actions are 1-D accordingly.

Returns:

Name	Type	Description
`BasePolicyRunner`	`BasePolicyRunner`	Loaded policy runner.

Raises:

Type	Description
`ValueError`	If `model_path` is `None`.

loader

BasePolicyRunner

`action_dim`

act `abstractmethod`

JITPolicyRunner

`model_path`

`device`

act

ONNXPolicyRunner

`model_path`

`providers`

`session_options`

act

TensorRTPolicyRunner

`model_path`

convert_onnx_to_engine `classmethod`

`onnx_path`

`engine_path`

`workspace_size_bytes`

`fp16`

`input_shapes`

act

SKRLPolicyRunner

act

load_agent

`model_cls`

`agent_cfg`

`checkpoint_path`

`observation_shape`

`action_shape`

`state_shape`

`device`

load_policy

loader

BasePolicyRunner

action_dim

act abstractmethod

JITPolicyRunner

model_path

device

act

ONNXPolicyRunner

model_path

providers

session_options

act

TensorRTPolicyRunner

model_path

convert_onnx_to_engine classmethod

onnx_path

engine_path

workspace_size_bytes

fp16

input_shapes

act

SKRLPolicyRunner

act

load_agent

model_cls

agent_cfg

checkpoint_path

observation_shape

action_shape

state_shape

device

load_policy

`action_dim`

act `abstractmethod`

`model_path`

`device`

`model_path`

`providers`

`session_options`

`model_path`

convert_onnx_to_engine `classmethod`

`onnx_path`

`engine_path`

`workspace_size_bytes`

`fp16`

`input_shapes`

`model_cls`

`agent_cfg`

`checkpoint_path`

`observation_shape`

`action_shape`

`state_shape`

`device`