Pretrained LLMs from scratch.
Y. Yu
PursuitOfDataScience
AI & ML interests
LLM, GPU Computing, PyTorch
Recent Activity
updated
a collection
1 day ago
ArgonneAI
updated
a model
1 day ago
PursuitOfDataScience/Argonne-2.0
published
a model
1 day ago
PursuitOfDataScience/Argonne-2.0
Organizations
None yet
models
22
PursuitOfDataScience/Argonne-2.0
Text Generation
•
6B
•
Updated
•
34
PursuitOfDataScience/llama3.2-1b-thinking
Text Generation
•
1B
•
Updated
PursuitOfDataScience/llama-3-2-1b-open-r1-mot-sft
Text Generation
•
1B
•
Updated
PursuitOfDataScience/qwen2.5-0.5b-r1-dpo
Text Generation
•
0.5B
•
Updated
PursuitOfDataScience/qwen2.5-0.5b-dpo
Text Generation
•
0.5B
•
Updated
•
1
PursuitOfDataScience/qwen2.5-0.5b-open-r1-mot-cot-sft
Text Generation
•
0.5B
•
Updated
PursuitOfDataScience/llama3.2-1b-dpo
Text Generation
•
1B
•
Updated
PursuitOfDataScience/qwen2.5-0.5b-ultrachat-sft-multi-turn
0.5B
•
Updated
•
1
PursuitOfDataScience/finetuned-llama-3.2-3b-math-reasoning
3B
•
Updated
•
1
PursuitOfDataScience/finetuned-llama-3.2-3b-dpo
Text Generation
•
3B
•
Updated
•
2
datasets
44
PursuitOfDataScience/toucan-agentic-thinking
Viewer
•
Updated
•
119k
•
27
PursuitOfDataScience/arxiv-qa-thinking
Viewer
•
Updated
•
215k
•
20
PursuitOfDataScience/0.9M-thinking
Viewer
•
Updated
•
898k
•
118
PursuitOfDataScience/0.5M-thinking
Viewer
•
Updated
•
499k
•
195
PursuitOfDataScience/MiniMax-M2.1-Mixture-of-Thoughts
Viewer
•
Updated
•
349k
•
360
•
2
PursuitOfDataScience/gsm8k-thinking
Viewer
•
Updated
•
8.79k
•
9
PursuitOfDataScience/bbc-news-llama4-maverick-summary
Viewer
•
Updated
•
174k
•
17
PursuitOfDataScience/govreport-llama4-maverick-summary
Viewer
•
Updated
•
19.5k
•
14
•
1
PursuitOfDataScience/arxiv-llama4-maverick-abstract
Viewer
•
Updated
•
198k
•
23
PursuitOfDataScience/xsum-llama4-maverick-summary
Viewer
•
Updated
•
227k
•
14