[feat] Adding configuration to the `torch.utils.data.DataLoader` #1409

YushunXiang · 2025-06-30T12:35:45Z

This pull request introduces enhancements to the training pipeline configuration and data loader setup to improve performance and flexibility. The most important changes include adding new configuration options to the TrainPipelineConfig class and utilizing these options in the data loader initialization.

Enhancements to training pipeline configuration:

lerobot/configs/train.py: Added two new configuration options, persistent_workers (defaulting to False) and prefetch_factor (defaulting to None), to the TrainPipelineConfig class. These options provide finer control over data loading behavior.

Updates to data loader setup:

lerobot/scripts/train.py: Updated the data loader initialization in the train function to use the new persistent_workers and prefetch_factor options from the TrainPipelineConfig. The prefetch_factor is conditionally applied only when num_workers > 0.

Copilot

Pull Request Overview

This PR enhances the training pipeline by adding configuration options to fine-tune the DataLoader behavior, improving both performance and flexibility.

Added two new configuration options—persistent_workers and prefetch_factor—to TrainPipelineConfig.
Updated the DataLoader initialization in the training script (train.py) to conditionally apply prefetch_factor when num_workers > 0.

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
lerobot/scripts/train.py	Updated DataLoader setup to include persistent_workers and prefetch_factor.
lerobot/configs/train.py	Added new configuration options persistent_workers and prefetch_factor to TrainPipelineConfig.

Comments suppressed due to low confidence (2)

lerobot/scripts/train.py:186

Consider adding an inline comment explaining that prefetch_factor is applied only when num_workers > 0, which can help clarify the conditional usage for future maintainers.

        prefetch_factor=cfg.prefetch_factor if cfg.num_workers > 0 else None,

lerobot/configs/train.py:66

Consider enhancing the class-level documentation or adding inline comments that explain the purpose and expected behavior of the persistent_workers and prefetch_factor configuration options.

    persistent_workers: bool = False

[feat] Adding configuration to the torch.utils.data.DataLoader

2edc0c3

Copilot AI review requested due to automatic review settings June 30, 2025 12:35

Copilot AI reviewed Jun 30, 2025

View reviewed changes

Merge branch 'main' into dataloader-config

c86de9e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[feat] Adding configuration to the `torch.utils.data.DataLoader` #1409

[feat] Adding configuration to the `torch.utils.data.DataLoader` #1409

YushunXiang commented Jun 30, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

[feat] Adding configuration to the torch.utils.data.DataLoader #1409

Are you sure you want to change the base?

[feat] Adding configuration to the torch.utils.data.DataLoader #1409

Conversation

YushunXiang commented Jun 30, 2025

Enhancements to training pipeline configuration:

Updates to data loader setup:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

[feat] Adding configuration to the `torch.utils.data.DataLoader` #1409

[feat] Adding configuration to the `torch.utils.data.DataLoader` #1409