Skip to content

LAV2 Documentation

Overview

BIT-PRIMAL-Lab/LAV2

LAV2 Documentation

BIT-PRIMAL-Lab/LAV2

Home
User Guide
User Guide
- Installation
- Simulation Components
  Simulation Components
- RL Tasks
  RL Tasks
- Transfer and Deployment
  Transfer and Deployment
  - Overview Overview
    Table of contents
    
    Reading Order
    
    Scope
  - Real2Sim
  - Sim2Sim
  - Sim2Real
Development
Development
API Reference
API Reference
- lav2
  lav2
  - assets
  - controller
    
    controller
    
    base
    
    geo
    
    mapping
    
    mixer
    
    pid
    
    run
    
    torch
    
    torch
    
    mapping
    
    mixer
    
    pid
    
    utils
    
    xadap
  - dynamics
    
    dynamics
    
    base
    
    params
    
    rotor
    
    torch
    
    torch
    
    rotor
    
    track
    
    track
  - gamepads
    
    gamepads
    
    common
    
    sdl2
  - runner
    
    runner
    
    common
    
    common
    
    mujoco
    
    skrl
    
    skrl
    
    cfg
    
    cfg
    
    LAV2_base
    
    LAV2_base_moe
    
    LAV2_base_vel
    
    LAV2_navrl
    
    eval
    
    isaaclab
  - tasks
    
    tasks
    
    genesis_forge
    
    genesis_forge
    
    LAV2_base
    
    LAV2_base
    
    environment
    
    eval
    
    mdp
    
    mdp
    
    actions
    
    commands
    
    rewards
    
    train
    
    LAV2_base_velocity
    
    LAV2_base_velocity
    
    environment
    
    eval
    
    mdp
    
    mdp
    
    actions
    
    rewards
    
    train
    
    isaaclab
    
    isaaclab
    
    LAV2_base
    
    LAV2_base
    
    LAV2_bimodal_vel_env_cfg
    
    LAV2_env_cfg
    
    LAV2_track_env_cfg
    
    LAV2_vel_env_cfg
    
    agents
    
    agents
    
    rsl_rl_ppo_cfg
    
    mdp
    
    mdp
    
    actions
    
    commands
    
    events
    
    observations
    
    rewards
    
    terminations
    
    LAV2_base_direct
    
    LAV2_base_direct
    
    agents
    
    agents
    
    rsl_rl_ppo_cfg
    
    quadcopter_env
    
    LAV2_navrl
    
    LAV2_navrl
    
    LAV2_navrl_env
    
    agents
    
    LAV2_trajectory
    
    LAV2_trajectory
    
    LAV2_traj_env_cfg
    
    agents
    
    agents
    
    rsl_rl_ppo_cfg
    
    mdp
    
    mdp
    
    commands
    
    mjlab
    
    mjlab
    
    LAV2_base
    
    LAV2_base
    
    LAV2_env_cfg
    
    LAV2_rl_cfg
    
    mdp
    
    mdp
    
    actions
    
    commands
    
    rewards
  - trajectories
    
    trajectories
    
    base
    
    helix
    
    hover_step
    
    lemniscate
    
    lissajous
    
    rectangle
    
    torch
    
    torch
    
    base
    
    helix
    
    hover_step
    
    lemniscate
    
    lissajous
    
    rectangle

Transfer and Deployment

This section covers what happens after the core simulator and RL task definitions are already working: moving trained policies into other runtimes, checking that they still behave correctly, and wiring them into deployment-side software stacks.

flowchart TD
  A[Real platform and logs] --> B[Real2Sim]
  B --> C[Aligned parameters and task assumptions]
  C --> D[RL training]
  D --> E[Exported policy]
  E --> F[Sim2Sim]
  E --> G[Sim2Real]
  F --> H[Replay and alignment validation]
  G --> I[ROS middleware and PX4 integration]

Reading Order

Real2Sim

Bring simulation parameters closer to the real platform through parameter alignment and system identification.

Sim2Sim

Export trained policies and replay them in MuJoCo under controlled conditions.

Sim2Real

Assemble ROS middleware, software-in-the-loop integration, and hardware-facing workflows.

Scope

Use this section when the main question is how to move a trained or validated stack into another runtime and keep it usable there, from parameter alignment through replay and deployment-side integration.