Spaces:

vidore
/

README

Running

manu commited on Jun 27, 2024

Commit

4426238

verified ·

1 Parent(s): 0351994

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -51,13 +51,14 @@ We organized datasets into collections to constitute our benchmark ViDoRe and it
 - [*Captioning Baseline*](https://huggingface.co/collections/vidore/vidore-captioning-baseline-6658a2a62d857c7a345195fd):  Datasets in this collection are the same as in ViDoRe but preprocessed for textual retrieving. The original ViDoRe benchmark was passed to Unstructured to partition each page into chunks. Visual chunks are captioned using Claude Sonnet.
 ## Intended use
 You can either load a specific dataset using the standard `load_dataset` function from huggingface.
 ```python
   from datasets import load_dataset
-  dataset = load_dataset(dataset_item.item_id)
 ```
 To use the whole benchmark, you can list the datasets in the collection using the following snippet.

 - [*Captioning Baseline*](https://huggingface.co/collections/vidore/vidore-captioning-baseline-6658a2a62d857c7a345195fd):  Datasets in this collection are the same as in ViDoRe but preprocessed for textual retrieving. The original ViDoRe benchmark was passed to Unstructured to partition each page into chunks. Visual chunks are captioned using Claude Sonnet.
 ## Intended use
 You can either load a specific dataset using the standard `load_dataset` function from huggingface.
 ```python
   from datasets import load_dataset
+  dataset = load_dataset(<dataset>)
 ```
 To use the whole benchmark, you can list the datasets in the collection using the following snippet.