-
Shekswess/tiny-think-dpo-math-stem-dpo-beta0-5-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 123 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 69 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta2-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 67 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr1e-6-e1-bs8
Text Generation • 0.1B • Updated • 72
Bojan Jakimovski
Shekswess
AI & ML interests
AWS Ambassador | Machine Learning Lead | College Professor | GenAI | MLOps
Recent Activity
updated
a collection
about 15 hours ago
Tiny Think
updated
a collection
about 15 hours ago
Tiny Think DPO Checkpoints
updated
a collection
about 15 hours ago
Tiny Think DPO Checkpoints
Organizations
Tiny Think SFT Checkpoints
-
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-e3-bs8
Text Generation • 0.1B • Updated • 287 -
Shekswess/tiny-think-sft-math-stem-loss-nll-bf16-e3-bs8
Text Generation • 0.1B • Updated • 190 -
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-lr2e-5-e2-bs8
Text Generation • 0.1B • Updated • 120 -
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-lr5e-5-e2-bs8
Text Generation • 0.1B • Updated • 117
Tiny Think DPO Checkpoints
-
Shekswess/tiny-think-dpo-math-stem-dpo-beta0-5-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 123 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 69 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta2-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 67 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr1e-6-e1-bs8
Text Generation • 0.1B • Updated • 72
Tiny Think SFT Checkpoints
-
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-e3-bs8
Text Generation • 0.1B • Updated • 287 -
Shekswess/tiny-think-sft-math-stem-loss-nll-bf16-e3-bs8
Text Generation • 0.1B • Updated • 190 -
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-lr2e-5-e2-bs8
Text Generation • 0.1B • Updated • 120 -
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-lr5e-5-e2-bs8
Text Generation • 0.1B • Updated • 117
models
31
Shekswess/tiny-think-dpo-math-stem-apo_zero-beta0_3-lr3e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
67
Shekswess/tiny-think-dpo-math-stem-apo_zero-beta1-lr3e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
69
Shekswess/tiny-think-dpo-math-stem-apo_zero-beta0_5-lr3e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
69
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr3e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
144
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr5e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
72
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr1e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
72
Shekswess/tiny-think-dpo-math-stem-dpo-beta2-lr2e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
67
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr2e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
69
Shekswess/tiny-think-dpo-math-stem-dpo-beta0-5-lr2e-6-e1-bs8
Text Generation
•
0.1B
•
Updated
•
123
Shekswess/tiny-think-sft-math-stem-loss-nll-bf16-e2-bs8
0.1B
•
Updated
•
59
datasets
34
Shekswess/tiny-think-dpo-math-n-stem
Viewer
•
Updated
•
2.86k
•
91
Shekswess/tiny-think-sft-math-n-stem
Viewer
•
Updated
•
29.1k
•
82
Shekswess/trlm-sft-stage-1-final-2
Viewer
•
Updated
•
58k
•
20
Shekswess/trlm-sft-stage-2-final-2
Viewer
•
Updated
•
78k
•
98
Shekswess/trlm-dpo-stage-3-final-2
Viewer
•
Updated
•
50k
•
53
Shekswess/customer-support
Viewer
•
Updated
•
1k
•
28
•
1
Shekswess/scientific-research
Viewer
•
Updated
•
1k
•
10
•
3
Shekswess/technical-manuals
Viewer
•
Updated
•
1k
•
48
•
3
Shekswess/legal-documents
Viewer
•
Updated
•
1k
•
45
•
4
Shekswess/financial-reports
Viewer
•
Updated
•
1k
•
12
•
1