Action normalization in pre/postprocessor

by xiaoBOSS - opened Jan 16

Jan 16

Why is the action normalized with std_mean in the preprocessor, but with identity in the postprocessor? Can someone help to explain this?

jadechoghari

Jan 17

The unnormalizer in the post-processor is only applied to action outputs (not visual or state outputs), since the policy (e.g., X-VLA) does not produce image or state logits.
That said, the action post-processor unnormalizer (MEAN_STD / IDENTITY / QUANTILES, etc.) must match the normalizer mode used in the pre-processor of course.
I’ve just updated the base model to fix this.
Note that we generally recommend users choose the normalizer they want at when finetuning the base model.

xiaoBOSS changed discussion status to closed Jan 19

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment