MolecularDatasetCurationGuide / sections /02_getting_started_with_hf.md
maom's picture
Add details on aspects of dataset card that are specific to Rosetta Data Bazaar Datasets in sections/02_getting_started_with_hf.md (#1)
42b2403

2 Getting Started with HuggingFace

HuggingFace documentation resources

Gating access to dataset

Dataset Card

The dataset card is a standardized description of the dataset that facilitates its use and limitations. For context, (Mitchell et al. 2018) provide guidance for the related concept of Model Cards.

Model Cards for Model Reporting
Margaret Mitchell, Simone Wu, Andrew Zaldivar, Parker Barnes, Lucy Vasserman, Ben Hutchinson, Elena Spitzer, Inioluwa Deborah Raji, Timnit Gebru (2018) https://arxiv.org/abs/1810.03993

Some aspects of the dataset card that are specific to Rosetta Data Bazaar Datasets:

  • Quickstart Usage section that descibes hwo to install HuggingFace Datasets package, load model datasets, and how to use different parts of the dataset