Buckets:
| license: openrail | |
| dataset_info: | |
| features: | |
| - name: ads_id | |
| dtype: string | |
| - name: arxiv_id | |
| dtype: string | |
| - name: title | |
| dtype: string | |
| - name: abstract | |
| dtype: string | |
| - name: embed | |
| sequence: float64 | |
| - name: umap_x | |
| dtype: float64 | |
| - name: umap_y | |
| dtype: float64 | |
| - name: date | |
| dtype: date32 | |
| - name: cites | |
| dtype: int64 | |
| - name: bibcode | |
| dtype: string | |
| - name: keywords | |
| sequence: string | |
| - name: ads_keywords | |
| sequence: string | |
| - name: read_count | |
| dtype: int64 | |
| - name: doi | |
| sequence: string | |
| - name: authors | |
| sequence: string | |
| - name: aff | |
| sequence: string | |
| - name: cite_bibcodes | |
| sequence: string | |
| - name: ref_bibcodes | |
| sequence: string | |
| splits: | |
| - name: train | |
| num_bytes: 20400431642 | |
| num_examples: 1149771 | |
| download_size: 12212217773 | |
| dataset_size: 20400431642 | |
| configs: | |
| - config_name: default | |
| data_files: | |
| - split: train | |
| path: data/train-* | |
| This dataset is associated with the Pathfinder app (https://pfdr.app, paper at https://arxiv.org/abs/2408.01556) and is updated roughly monthly to keep pace with new literature in astrophysics and cosmology. | |
Xet Storage Details
- Size:
- 1.16 kB
- Xet hash:
- 193f99cad7c0ee5199eea4e264d722daf8a22b14ba5c5b3e347a1b3cc003eea3
·
Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.