razvan commited on
Commit
75519f0
·
verified ·
1 Parent(s): c0744db

Upload plugins/mlintern/skills/hf-paper-search/SKILL.md with huggingface_hub

Browse files
plugins/mlintern/skills/hf-paper-search/SKILL.md ADDED
@@ -0,0 +1,60 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ name: hf-paper-search
3
+ description: "Search for ML research papers on Hugging Face and arXiv, read methodology sections, trace citation graphs, and find linked datasets and models."
4
+ disable-model-invocation: false
5
+ ---
6
+
7
+ # hf-paper-search — Paper Research
8
+
9
+ ## Purpose
10
+
11
+ Find and analyze ML research papers to extract training recipes, datasets, hyperparameters, and known failure modes before implementing.
12
+
13
+ ## Tools
14
+
15
+ - `paper_search`: Search HF papers by query. Good for quick discovery.
16
+ - For deeper operations (details, section reading, citation graphs, snippet search, recommendations, linked resources), use the paper research script:
17
+
18
+ ```bash
19
+ python skills/ml-intern-harness/scripts/papers.py <operation> [args]
20
+ ```
21
+
22
+ ## Paper Script Operations
23
+
24
+ | Operation | Description |
25
+ |---|---|
26
+ | `search` | Search papers with Semantic Scholar filters (date, citations, categories) |
27
+ | `trending` | Get daily trending papers |
28
+ | `paper_details` | Get abstract, AI summary, GitHub link |
29
+ | `read_paper` | Read arXiv/ar5iv HTML sections |
30
+ | `citation_graph` | Find references or citations |
31
+ | `snippet_search` | Semantic search across paper passages |
32
+ | `recommend` | Find similar papers |
33
+ | `find_datasets` | Find HF datasets linked to a paper |
34
+ | `find_models` | Find HF models linked to a paper |
35
+ | `find_collections` | Find HF collections linked to a paper |
36
+ | `find_all_resources` | All linked resources at once |
37
+
38
+ ## Workflow
39
+
40
+ 1. Use `paper_search` for quick discovery or `papers.py search` for filtered Semantic Scholar search.
41
+ 2. Read methodology and results sections with `papers.py read_paper --arxiv-id <id> --section 3`.
42
+ 3. Trace citation graphs with `papers.py citation_graph --arxiv-id <id> --direction citations`.
43
+ 4. Extract concrete recipes: dataset, preprocessing, method, hyperparameters, model, metric, result.
44
+ 5. Find linked HF datasets/models/collections with `papers.py find_all_resources`.
45
+ 6. Inspect promising datasets with `inspect_dataset.py`.
46
+ 7. Read current HF docs and GitHub examples before implementing.
47
+
48
+ ## Example
49
+
50
+ ```
51
+ _paper_search(query="direct preference optimization")
52
+ python skills/ml-intern-harness/scripts/papers.py read_paper --arxiv-id 2305.18290 --section 3
53
+ python skills/ml-intern-harness/scripts/papers.py citation_graph --arxiv-id 2305.18290 --direction citations --limit 10
54
+ python skills/ml-intern-harness/scripts/papers.py find_all_resources --arxiv-id 2305.18290
55
+ ```
56
+
57
+ ## Research Standard
58
+
59
+ For paper-backed tasks, attribute important choices to a source:
60
+ - "Dataset X + method Y + model Z produced metric M on benchmark B" (source: arXiv:2305.18290, Section 4.2)."