Lerobot dataset filehandler #2802

ashwinvkNV · 2025-06-27T23:21:36Z

LeRobot Dataset File Handler

Overview

The LeRobot Dataset File Handler (LeRobotDatasetFileHandler) is a configuration-driven system for automatically extracting and recording episode data from Isaac Lab environments to the LeRobot dataset format. It provides a seamless bridge between Isaac Lab's manager-based environments and the HuggingFace LeRobot ecosystem, enabling efficient dataset creation for Vision-Language-Action (VLA) model training.

Architecture

Core Components

Configuration System (LeRobotDatasetCfg)
- Defines which observations to record and how to organize them
- Supports both regular observations and state observations
- Configurable task descriptions
Feature Extraction Engine
- Automatically analyzes environment observation and action managers
- Handles nested observation structures with group-based access
- Detects and processes video/image features automatically
Data Conversion Pipeline
- Converts Isaac Lab episode data to LeRobot format
- Handles tensor shape transformations and data type conversions
- Manages video/image format conversions ([B, H, W, C] → [C, H, W])
Dataset Management
- Creates and manages LeRobot dataset files
- Handles episode writing and metadata management
- Provides efficient storage with MP4 videos and Parquet files

Configuration System

LeRobotDatasetCfg Structure

@configclass
class LeRobotDatasetCfg:
    # Regular observations saved as "observation.{key}"
    observation_keys_to_record: List[tuple[str, str]] = MISSING
    
    # State observations combined into "observation.state"
    state_observation_keys: List[tuple[str, str]] = MISSING
    
    # Task description for all episodes
    task_description: str = MISSING

Configuration Patterns

Basic Configuration

env.cfg.lerobot_dataset.observation_keys_to_record = [
    ("policy", "joint_pos"), 
    ("policy", "camera_rgb")
]
env.cfg.lerobot_dataset.state_observation_keys = [
    ("policy", "joint_vel")
]
env.cfg.lerobot_dataset.task_description = "Stack the red cube on top of the blue cube"

Multi-Group Observations

env.cfg.lerobot_dataset.observation_keys_to_record = [
    ("policy", "joint_pos"),      # From policy group
    ("policy", "camera_rgb"),     # From policy group
    ("critic", "joint_vel")       # From critic group
]

Video/Image Support

env.cfg.lerobot_dataset.observation_keys_to_record = [
    ("policy", "camera_rgb")      # Automatically detected as video
]
# Handles [B, H, W, C] format and converts to [C, H, W] for LeRobot

Feature Extraction Process

1. Environment Analysis

The handler automatically analyzes the environment's structure:

def _extract_features_from_env(self, env) -> Dict[str, Dict]:
    features = {}
    
    # Extract action features
    features.update(self._extract_action_features(env))
    
    # Extract observation features
    features.update(self._extract_observation_features(env))
    
    # Add annotation features
    features.update(self._extract_annotation_features(env))
    
    return features

2. Action Feature Extraction

def _extract_action_features(self, env) -> Dict[str, Dict]:
    return {
        "action": {
            "dtype": "float32",
            "shape": (env.action_manager.total_action_dim,),
            "names": None
        }
    }

3. Observation Feature Extraction

The system processes observations based on configuration:

Regular Observations: Saved as observation.{key}
State Observations: Combined into observation.state
Video Detection: Automatically detects 4D tensors in [B, H, W, C] format

4. Video/Image Processing

def _is_video_feature(self, tensor: torch.Tensor) -> bool:
    # Check if tensor has exactly 4 dimensions
    if tensor.ndim == 4:
        # Validate [B, H, W, C] format
        if (tensor.shape[1] > 1 and tensor.shape[2] > 1 and tensor.shape[3] <= 4):
            return True
    return False

Data Flow

Episode Recording Process

Episode Creation

handler = LeRobotDatasetFileHandler()
handler.create("dataset.lerobot", env_name="my_env", env=env)

Feature Schema Generation
- Analyzes environment observation and action managers
- Creates LeRobot-compatible feature specifications
- Validates configuration against available observations
Episode Writing
```
handler.write_episode(episode_data)
```

Frame-by-Frame Processing

def _convert_and_save_episode(self, episode: EpisodeData):
    for frame_idx in range(num_frames):
        frame_data = {}
        
        # Process actions
        frame_data.update(self._process_actions(frame_action))
        
        # Process observations
        frame_obs = self._extract_frame_observations(obs_dict, frame_idx)
        frame_data.update(self._process_observations(frame_obs))
        
        # Add to dataset
        self._dataset.add_frame(frame_data, task)

Integration with Isaac Lab

Environment Configuration

The handler integrates seamlessly with Isaac Lab's manager-based environments:

# In environment configuration
env_cfg.lerobot_dataset = LeRobotDatasetCfg()
env_cfg.lerobot_dataset.observation_keys_to_record = [("policy", "camera_rgb")]
env_cfg.lerobot_dataset.state_observation_keys = [("policy", "joint_pos")]
env_cfg.lerobot_dataset.task_description = "Custom task"

# In recorder configuration
env_cfg.recorders.dataset_file_handler_class_type = LeRobotDatasetFileHandler

Recording Script Integration

The record_demos.py script automatically detects LeRobot format:

# Configure dataset format based on file extension
use_lerobot_format = args_cli.dataset_file.endswith('.lerobot')

if use_lerobot_format:
    env_cfg.recorders.dataset_file_handler_class_type = LeRobotDatasetFileHandler
else:
    env_cfg.recorders.dataset_file_handler_class_type = HDF5DatasetFileHandler

Dataset Structure

LeRobot Format Organization

dataset.lerobot/
├── dataset_info.json          # HuggingFace dataset metadata
├── state.json                 # Dataset state information
├── data/                      # Parquet files with episode data
│   ├── train-00000-of-00001.parquet
│   └── ...
├── videos/                    # Video files for camera observations
│   ├── episode_000000/
│   │   ├── front.mp4
│   │   ├── wrist.mp4
│   │   └── ...
│   └── episode_000001/
│       └── ...
└── meta/                      # Additional metadata
    └── info.json              # Isaac Lab specific metadata

Feature Naming Conventions

Camera observations: observation.images.{camera_position}
Robot state: observation.state
Regular observations: observation.{obs_key}
Actions: action
Episode metadata: episode_index, frame_index, timestamp, task

Error Handling and Validation

Configuration Validation

# Validate that required configuration exists
if not observation_keys_to_record or not state_observation_keys:
    raise ValueError(
        "LeRobotDatasetCfg must have at least one observation configured. "
        "Please set either observation_keys_to_record or state_observation_keys (or both)."
    )

# Validate observation groups and keys
if group_name not in obs_sample:
    available_groups = list(obs_sample.keys())
    raise ValueError(f"Observation group '{group_name}' not found. Available groups: {available_groups}")

Video Format Validation

def _is_video_feature(self, tensor: torch.Tensor) -> bool:
    if tensor.ndim == 4:
        if not (tensor.shape[1] > 1 and tensor.shape[2] > 1 and tensor.shape[3] <= 4):
            raise ValueError(
                f"Image data must be in [B, H, W, C] format, but got shape {tensor.shape}. "
                f"Expected format: batch_size > 0, height > 1, width > 1, channels <= 4"
            )
        return True
    return False

Performance Considerations

Memory Management

Efficient tensor processing with minimal memory overhead
Automatic cleanup of temporary data structures
Background video writing for large datasets

Storage Optimization

MP4 compression for video observations
Parquet format for efficient tabular data storage
Incremental episode writing to avoid memory accumulation

Usage Examples

Basic Recording

# Configure environment
env_cfg.lerobot_dataset.observation_keys_to_record = [("policy", "camera_rgb")]
env_cfg.lerobot_dataset.state_observation_keys = [("policy", "joint_pos")]
env_cfg.lerobot_dataset.task_description = "Stack cubes"

# Record dataset
./isaaclab.sh -p scripts/tools/record_demos.py \
    --task Isaac-Stack-Cube-Franka-IK-Rel-v0 \
    --teleop_device spacemouse \
    --dataset_file ./datasets/demo.lerobot

Advanced Configuration

# Multi-camera setup
env_cfg.lerobot_dataset.observation_keys_to_record = [
    ("policy", "camera_rgb"),      # Main camera
    ("policy", "camera_depth"),    # Depth camera
    ("policy", "wrist_camera"),    # Wrist camera
    ("policy", "end_effector_pos") # End effector position
]

# Comprehensive state representation
env_cfg.lerobot_dataset.state_observation_keys = [
    ("policy", "joint_pos"),       # Joint positions
    ("policy", "joint_vel"),       # Joint velocities
    ("policy", "gripper_state")    # Gripper state
]

Dependencies and Installation

Required Dependencies

pip install datasets opencv-python imageio[ffmpeg]

Optional Dependencies

huggingface_hub for dataset sharing
lerobot for training pipeline integration

Future Enhancements

Planned Features

Dynamic Task Descriptions: Support for episode-specific task descriptions
Multi-Modal Support: Enhanced support for audio and other sensor modalities
Compression Options: Configurable video compression settings
Streaming Support: Real-time dataset writing for long recording sessions
Validation Tools: Enhanced dataset validation and quality checks

Integration Improvements

Direct LeRobot Training: Seamless integration with LeRobot training pipelines
HuggingFace Hub Integration: Automated dataset upload and versioning
Dataset Versioning: Support for dataset versioning and incremental updates

Conclusion

The LeRobot Dataset File Handler provides a robust, configuration-driven solution for creating LeRobot-compatible datasets from Isaac Lab environments. Its automatic feature extraction, flexible configuration system, and seamless integration with the Isaac Lab ecosystem make it an essential tool for VLA model training and dataset sharing within the robotics community.

The handler's design emphasizes:

Ease of Use: Minimal configuration required for basic usage
Flexibility: Support for complex observation structures and multi-modal data
Performance: Efficient storage and processing of large datasets
Compatibility: Full compatibility with the LeRobot ecosystem
Extensibility: Easy to extend for new data types and use cases

ashwinvkNV added 10 commits June 24, 2025 08:51

so-100 prototype

451c11e

update usd path

6336c85

remove ik relative env

ed669e1

add udpates for new usd model

500c1d9

remove ignoring all datasets folders and add lerobot file handler

1368e66

remove print statements and viz debug

d2955b6

retune the action_gain

6db76cf

restructure lerobot dataset converter

e8ce16d

add temporary design document

c8fb015

add dependency

3c59b32

ashwinvkNV requested review from jsmith-bdai, kellyguo11, Mayankm96, pascal-roth and jtigue-bdai as code owners June 27, 2025 23:21

ashwinvkNV marked this pull request as draft June 27, 2025 23:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Lerobot dataset filehandler #2802

Lerobot dataset filehandler #2802

Uh oh!

ashwinvkNV commented Jun 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

TMZ Celebrity News – Breaking Stories, Videos & Gossip

🎥 Watch TMZ Live

Lerobot dataset filehandler #2802

Are you sure you want to change the base?

Lerobot dataset filehandler #2802

Uh oh!

Conversation

ashwinvkNV commented Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

LeRobot Dataset File Handler

Overview

Architecture

Core Components

Configuration System

LeRobotDatasetCfg Structure

Configuration Patterns

Basic Configuration

Multi-Group Observations

Video/Image Support

Feature Extraction Process

1. Environment Analysis

2. Action Feature Extraction

3. Observation Feature Extraction

4. Video/Image Processing

Data Flow

Episode Recording Process

Integration with Isaac Lab

Environment Configuration

Recording Script Integration

Dataset Structure

LeRobot Format Organization

Feature Naming Conventions

Error Handling and Validation

Configuration Validation

Video Format Validation

Performance Considerations

Memory Management

Storage Optimization

Usage Examples

Basic Recording

Advanced Configuration

Dependencies and Installation

Required Dependencies

Optional Dependencies

Future Enhancements

Planned Features

Integration Improvements

Conclusion

Uh oh!

Uh oh!

TMZ Celebrity News – Breaking Stories, Videos & Gossip

🎥 Watch TMZ Live

ashwinvkNV commented Jun 27, 2025 •

edited

Loading