Spaces:

broadfield-dev
/

HF-Dataset-Commander

Sleeping

broadfield-dev commited on Dec 30, 2025

Commit

e1c75d3

verified ·

1 Parent(s): 0020ded

Update processor.py

Files changed (1) hide show

processor.py CHANGED Viewed

@@ -334,8 +334,10 @@ The following operations were applied to the source data:
             ds_stream = load_dataset(source_id, name=conf, split=split, streaming=True, token=self.token)
             count = 0
             for i, row in enumerate(ds_stream):
                 if max_rows and count >= int(max_rows):
                     break
                 # 1. Filter
                 if recipe.get('filter_rule'):
@@ -389,6 +391,7 @@ The following operations were applied to the source data:
             for i, row in enumerate(ds_stream):
                 if len(processed) >= 5: break
                 # Check Filter
                 passed = True

             ds_stream = load_dataset(source_id, name=conf, split=split, streaming=True, token=self.token)
             count = 0
             for i, row in enumerate(ds_stream):
                 if max_rows and count >= int(max_rows):
                     break
+                row = dict(row)
                 # 1. Filter
                 if recipe.get('filter_rule'):
             for i, row in enumerate(ds_stream):
                 if len(processed) >= 5: break
+                row = dict(row)
                 # Check Filter
                 passed = True