-
Shekswess/tiny-think-dpo-math-stem-dpo-beta0-5-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 123 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 69 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta2-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 67 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr1e-6-e1-bs8
Text Generation • 0.1B • Updated • 72
Bojan Jakimovski
Shekswess
AI & ML interests
AWS Ambassador | Machine Learning Lead | College Professor | GenAI | MLOps
Recent Activity
updated
a collection
about 17 hours ago
Tiny Think
updated
a collection
about 17 hours ago
Tiny Think DPO Checkpoints
updated
a collection
about 17 hours ago
Tiny Think DPO Checkpoints
Organizations
Tiny Think
Collection dedicated to all the datasets, checkpoints and any additional artifacts for Tiny Think
Tiny Language Model Datasets
Collection of Synthetic Datasets that can be used in pretraining of any the Tiny Language Model
Stable Diffusion XL Neuron Models
Collection of Stable Diffusion XL Models that can run on AWS Silicon Chips (specifically AWS Inferentia 2)
Tiny Think SFT Checkpoints
-
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-e3-bs8
Text Generation • 0.1B • Updated • 287 -
Shekswess/tiny-think-sft-math-stem-loss-nll-bf16-e3-bs8
Text Generation • 0.1B • Updated • 190 -
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-lr2e-5-e2-bs8
Text Generation • 0.1B • Updated • 120 -
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-lr5e-5-e2-bs8
Text Generation • 0.1B • Updated • 117
Tiny Reasoning Language Model
Collection dedicated to the development of the Tiny Reasoning Language Model (trlm)
SynthGenAI Datasets
Collection of Synthetic Datasets created by using SynthGenAI
Medical Instruct Models
Collection of all the medical instruct fine-tuned LLMs with 7B parameters
Tiny Think DPO Checkpoints
-
Shekswess/tiny-think-dpo-math-stem-dpo-beta0-5-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 123 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 69 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta2-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 67 -
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr1e-6-e1-bs8
Text Generation • 0.1B • Updated • 72
Tiny Think SFT Checkpoints
-
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-e3-bs8
Text Generation • 0.1B • Updated • 287 -
Shekswess/tiny-think-sft-math-stem-loss-nll-bf16-e3-bs8
Text Generation • 0.1B • Updated • 190 -
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-lr2e-5-e2-bs8
Text Generation • 0.1B • Updated • 120 -
Shekswess/tiny-think-sft-math-stem-loss-dft-bf16-lr5e-5-e2-bs8
Text Generation • 0.1B • Updated • 117
Tiny Think
Collection dedicated to all the datasets, checkpoints and any additional artifacts for Tiny Think
Tiny Reasoning Language Model
Collection dedicated to the development of the Tiny Reasoning Language Model (trlm)
Tiny Language Model Datasets
Collection of Synthetic Datasets that can be used in pretraining of any the Tiny Language Model
SynthGenAI Datasets
Collection of Synthetic Datasets created by using SynthGenAI
Stable Diffusion XL Neuron Models
Collection of Stable Diffusion XL Models that can run on AWS Silicon Chips (specifically AWS Inferentia 2)
Medical Instruct Models
Collection of all the medical instruct fine-tuned LLMs with 7B parameters