You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
# Normalize from range [0,1] to [-1,1] as expacted by siglip
This looks like an important detail which was overlooked. I believe that the default visual transform should be MIN_MAX to avoid incorrect scaling of images to SigLIP or maybe another way is to add an assertion before we prepare images for the backbone.
The text was updated successfully, but these errors were encountered:
System Info
Information
Reproduction
Expected behavior
Thanks for implementing Pi0! I gave it a shot and observed good results using my robot arm.
Upon further inspection of your code, I found out that the default normalization mode for input images is identity:
lerobot/lerobot/common/policies/pi0/configuration_pi0.py
Line 35 in ee5525f
And the policy wrapper makes an assumption that given an identity transform, the input images are already in the range [0, 1], see:
lerobot/lerobot/common/policies/pi0/modeling_pi0.py
Line 360 in ee5525f
This looks like an important detail which was overlooked. I believe that the default visual transform should be
MIN_MAX
to avoid incorrect scaling of images to SigLIP or maybe another way is to add an assertion before we prepare images for the backbone.The text was updated successfully, but these errors were encountered: