Upload plugins/mlintern/skills/hf-paper-search/SKILL.md with huggingface_hub
Browse files
plugins/mlintern/skills/hf-paper-search/SKILL.md
ADDED
|
@@ -0,0 +1,60 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
name: hf-paper-search
|
| 3 |
+
description: "Search for ML research papers on Hugging Face and arXiv, read methodology sections, trace citation graphs, and find linked datasets and models."
|
| 4 |
+
disable-model-invocation: false
|
| 5 |
+
---
|
| 6 |
+
|
| 7 |
+
# hf-paper-search — Paper Research
|
| 8 |
+
|
| 9 |
+
## Purpose
|
| 10 |
+
|
| 11 |
+
Find and analyze ML research papers to extract training recipes, datasets, hyperparameters, and known failure modes before implementing.
|
| 12 |
+
|
| 13 |
+
## Tools
|
| 14 |
+
|
| 15 |
+
- `paper_search`: Search HF papers by query. Good for quick discovery.
|
| 16 |
+
- For deeper operations (details, section reading, citation graphs, snippet search, recommendations, linked resources), use the paper research script:
|
| 17 |
+
|
| 18 |
+
```bash
|
| 19 |
+
python skills/ml-intern-harness/scripts/papers.py <operation> [args]
|
| 20 |
+
```
|
| 21 |
+
|
| 22 |
+
## Paper Script Operations
|
| 23 |
+
|
| 24 |
+
| Operation | Description |
|
| 25 |
+
|---|---|
|
| 26 |
+
| `search` | Search papers with Semantic Scholar filters (date, citations, categories) |
|
| 27 |
+
| `trending` | Get daily trending papers |
|
| 28 |
+
| `paper_details` | Get abstract, AI summary, GitHub link |
|
| 29 |
+
| `read_paper` | Read arXiv/ar5iv HTML sections |
|
| 30 |
+
| `citation_graph` | Find references or citations |
|
| 31 |
+
| `snippet_search` | Semantic search across paper passages |
|
| 32 |
+
| `recommend` | Find similar papers |
|
| 33 |
+
| `find_datasets` | Find HF datasets linked to a paper |
|
| 34 |
+
| `find_models` | Find HF models linked to a paper |
|
| 35 |
+
| `find_collections` | Find HF collections linked to a paper |
|
| 36 |
+
| `find_all_resources` | All linked resources at once |
|
| 37 |
+
|
| 38 |
+
## Workflow
|
| 39 |
+
|
| 40 |
+
1. Use `paper_search` for quick discovery or `papers.py search` for filtered Semantic Scholar search.
|
| 41 |
+
2. Read methodology and results sections with `papers.py read_paper --arxiv-id <id> --section 3`.
|
| 42 |
+
3. Trace citation graphs with `papers.py citation_graph --arxiv-id <id> --direction citations`.
|
| 43 |
+
4. Extract concrete recipes: dataset, preprocessing, method, hyperparameters, model, metric, result.
|
| 44 |
+
5. Find linked HF datasets/models/collections with `papers.py find_all_resources`.
|
| 45 |
+
6. Inspect promising datasets with `inspect_dataset.py`.
|
| 46 |
+
7. Read current HF docs and GitHub examples before implementing.
|
| 47 |
+
|
| 48 |
+
## Example
|
| 49 |
+
|
| 50 |
+
```
|
| 51 |
+
_paper_search(query="direct preference optimization")
|
| 52 |
+
python skills/ml-intern-harness/scripts/papers.py read_paper --arxiv-id 2305.18290 --section 3
|
| 53 |
+
python skills/ml-intern-harness/scripts/papers.py citation_graph --arxiv-id 2305.18290 --direction citations --limit 10
|
| 54 |
+
python skills/ml-intern-harness/scripts/papers.py find_all_resources --arxiv-id 2305.18290
|
| 55 |
+
```
|
| 56 |
+
|
| 57 |
+
## Research Standard
|
| 58 |
+
|
| 59 |
+
For paper-backed tasks, attribute important choices to a source:
|
| 60 |
+
- "Dataset X + method Y + model Z produced metric M on benchmark B" (source: arXiv:2305.18290, Section 4.2)."
|