AI & ML interests
Insanely fast LLM pre-training and fine-tuning for modern NVIDIA GPUs.
Recent Activity
models
11
surogate/Qwen3-14B-NVFP4
Text Generation
•
8B
•
Updated
•
6
surogate/Qwen3-32B-NVFP4
Text Generation
•
17B
•
Updated
•
6
surogate/Llama-3.3-70B-Instruct-NVFP4
41B
•
Updated
•
5
surogate/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
•
Updated
•
5
surogate/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
•
Updated
•
4
surogate/Qwen3-30B-A3B-NVFP4
Text Generation
•
16B
•
Updated
•
5
surogate/Qwen3-8B-NVFP4
Text Generation
•
5B
•
Updated
•
6
surogate/gemma-3-270m-it-NVFP4
0.4B
•
Updated
•
1
surogate/Qwen3-0.6B-NVFP4
0.6B
•
Updated
surogate/Qwen3-0.6B-AWQ
0.4B
•
Updated
•
43
datasets
11
surogate/hellaswag-ro
Viewer
•
Updated
•
9.25k
•
8
surogate/cc-pretrain
Viewer
•
Updated
•
981
•
3
surogate/brd-en
Viewer
•
Updated
•
143
•
1
surogate/brd
Viewer
•
Updated
•
143
•
1
surogate/densemax-self-cognition
Viewer
•
Updated
•
124
•
3
surogate/self-cognition-dan
Viewer
•
Updated
•
2k
•
2
surogate/self-cognition-generated
Viewer
•
Updated
•
2k
•
3
surogate/self-cognition-qwen3
Viewer
•
Updated
•
50
•
1
surogate/self-cognition
Viewer
•
Updated
•
50
•
9
surogate/alpaca-gpt4-data-en
Viewer
•
Updated
•
52k
•
12