OpenMOSS-Team
/

MOSS-VL-Instruct-0408

Video-Text-to-Text

feature-extraction

Video-Understanding

Image-Understanding

vision-language

Model card Files Files and versions

sjzhou commited on 30 days ago

Commit

c3cd456

·

verified ·

1 Parent(s): 3ac87d3

Update processing_moss_vl.py

Files changed (1) hide show

processing_moss_vl.py +1 -1

processing_moss_vl.py CHANGED Viewed

@@ -116,7 +116,7 @@ class MossVLImageProcessor(Qwen2VLImageProcessor):
                 stacked_images = self.resize(
                     image=stacked_images,
                     size=SizeDict(height=resized_height, width=resized_width),
-                    interpolation=resample,
                 )
             resized_images_grouped[shape] = stacked_images
         resized_images = reorder_images(resized_images_grouped, grouped_images_index)

                 stacked_images = self.resize(
                     image=stacked_images,
                     size=SizeDict(height=resized_height, width=resized_width),
+                    resample=resample,
                 )
             resized_images_grouped[shape] = stacked_images
         resized_images = reorder_images(resized_images_grouped, grouped_images_index)