Spaces:

rag-datasets
/

README

Running

tillwenke commited on Oct 28, 2023

Commit

f0ccb80

1 Parent(s): d3bdfe1

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -7,4 +7,17 @@ sdk: static
 pinned: false
 ---
-Edit this `README.md` markdown file to author your organization card.

 pinned: false
 ---
+To test your RAG solution it would be powerful to have access to a dataset that consists of a text corpus,
+correct responses to queries (e.g. question-answer) to test the solution end-to-end and maybe even a set of relevant passages
+from the text corpus for each query to test the retrieval component separately as well.
+We call this a question-answer-passages dataset.
+There are plenty of large-scale datasets of this kind such as [Google's Natural Questions](https://ai.google.com/research/NaturalQuestions/).
+Still we lack such datasets that are **small-scale** and **narrow-domain** to just test our RAG solution quickly or to see how it performs
+in a certain domain context.
+We created this space to create a collections of such datasets to boost the developement of RAG solutions.
+Datasets consist of:
+* asdf