Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E5-S9
Kai Rawal
kairawal
AI & ML interests
None yet
Recent Activity
updated a collection about 1 month ago
MLSFT-Models-E3-S3407 updated a model about 1 month ago
kairawal/Qwen3-8B-HI-SynthDolly-r16alpha32-E3-S3407 published a model about 1 month ago
kairawal/Qwen3-8B-HI-SynthDolly-r16alpha32-E3-S3407Organizations
MLSFT-Models-E1-S9
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E1-S9
MLSFT-Models-E5-S3407
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E5-S3407
-
kairawal/Qwen3-8B-EN-SynthDolly-r16alpha32-E5-S3407
Text Generation • 8B • Updated • 50 -
kairawal/Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E5-S3407
Text Generation • 8B • Updated • 51 -
kairawal/Qwen3-0.6B-EN-SynthDolly-r16alpha128-E5-S3407
Text Generation • 0.6B • Updated • 5 -
kairawal/Qwen3-0.6B-DA-SynthDolly-r16alpha128-E5-S3407
Text Generation • 0.6B • Updated • 3
MLSFT-Models-E1-S3407
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E1-S3407
-
kairawal/Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E1-S3407
Text Generation • 8B • Updated • 50 -
kairawal/Qwen3-8B-EN-SynthDolly-r16alpha32-E1-S3407
Text Generation • 8B • Updated • 51 -
kairawal/Qwen3-8B-HI-SynthDolly-r16alpha32-E1-S3407
Text Generation • 8B • Updated • 48 -
kairawal/Qwen3-14B-EN-SynthDolly-r16alpha32-E1-S3407
Text Generation • 15B • Updated • 52
MLSFT-Models-E8-S73
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E8-S73
-
kairawal/Qwen3-0.6B-DA-SynthDolly-r16alpha32-E8-S73
Text Generation • 0.6B • Updated • 1 -
kairawal/Qwen3-0.6B-ZH-SynthDolly-r16alpha32-E8-S73
Text Generation • 0.6B • Updated • 2 -
kairawal/Qwen3-0.6B-GA-SynthDolly-r16alpha32-E8-S73
Text Generation • 0.6B • Updated • 2 -
kairawal/Qwen3-0.6B-PT-SynthDolly-r16alpha32-E8-S73
Text Generation • 0.6B • Updated • 2
MLSFT-Models-E3-S73
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E3-S73
-
kairawal/Qwen3-32B-HI-SynthDolly-r16alpha32-E3-S73
Text Generation • 33B • Updated • 38 -
kairawal/Qwen3-32B-DA-SynthDolly-r16alpha32-E3-S73
Text Generation • 33B • Updated • 40 -
kairawal/Qwen3-32B-ZH-SynthDolly-r16alpha32-E3-S73
Text Generation • 33B • Updated • 36 -
kairawal/Qwen3-32B-EL-SynthDolly-r16alpha32-E3-S73
Text Generation • 33B • Updated • 40
MLSFT-LLMs-E01
Benign Multilingual Fine-tuning: Single Epoch fine-tuning with SynthDolly data. Models available: Qwen8B, Qwen14B, and Qwen32B.
MLSFT-SmallLMs-E05
Benign Multilingual Fine-tuning: Five Epochs with SynthDolly data. Models available: Qwen0.6B, Qwen4B, Llama1B, Llama3B, Gemma1B and Gemma4B.
-
kairawal/Qwen3-0.6B-HI-SynthDolly-1A-E5
Text Generation • 0.6B • Updated • 5 -
kairawal/Qwen3-0.6B-DA-SynthDolly-1A-E5
Text Generation • 0.6B • Updated • 5 -
kairawal/Qwen3-0.6B-ZH-SynthDolly-1A-E5
Text Generation • 0.6B • Updated • 4 • 1 -
kairawal/Qwen3-0.6B-PT-SynthDolly-1A-E5
Text Generation • 0.6B • Updated • 7
MLSFT-SmallLMs-E01
Benign Multilingual Fine-tuning: One Epoch with SynthDolly data. Models available: Qwen0.6B, Qwen4B, Llama1B, Llama3B, Gemma1B and Gemma4B.
-
kairawal/Qwen3-4B-DA-SynthDolly-1A-E1
Text Generation • 4B • Updated • 10 • 1 -
kairawal/Qwen3-4B-ES-SynthDolly-1A-E1
Text Generation • 4B • Updated • 4 • 1 -
kairawal/Qwen3-0.6B-HI-SynthDolly-1A-E1
Text Generation • 0.6B • Updated • 5 -
kairawal/Qwen3-0.6B-DA-SynthDolly-1A-E1
Text Generation • 0.6B • Updated • 5
MLSFT-Models-E3-S9
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E3-S9
MLSFT-Models-E8-S3407
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E8-S3407
-
kairawal/Qwen3-8B-EN-SynthDolly-r16alpha32-E8-S3407
Text Generation • 8B • Updated • 53 -
kairawal/Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E8-S3407
Text Generation • 8B • Updated • 48 -
kairawal/Qwen3-0.6B-EN-SynthDolly-r16alpha128-E8-S3407
Text Generation • 0.6B • Updated • 6 -
kairawal/Qwen3-0.6B-DA-SynthDolly-r16alpha128-E8-S3407
Text Generation • 0.6B • Updated • 3
MLSFT-Models-E3-S3407
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E3-S3407
MLSFT-Models-E8-S9
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E8-S9
-
kairawal/Qwen3-32B-DA-SynthDolly-r16alpha32-E8-S9
Text Generation • 33B • Updated • 42 -
kairawal/Qwen3-32B-ZH-SynthDolly-r16alpha32-E8-S9
Text Generation • 33B • Updated • 56 -
kairawal/Qwen3-32B-EL-SynthDolly-r16alpha32-E8-S9
Text Generation • 33B • Updated • 57 -
kairawal/Qwen3-8B-EN-SynthDolly-r16alpha32-E8-S9
Text Generation • 8B • Updated • 51
MLSFT-Models-E5-S73
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E5-S73
-
kairawal/Qwen3-32B-HI-SynthDolly-r16alpha32-E5-S73
Text Generation • 33B • Updated • 44 -
kairawal/Qwen3-32B-DA-SynthDolly-r16alpha32-E5-S73
Text Generation • 33B • Updated • 45 -
kairawal/Qwen3-32B-ZH-SynthDolly-r16alpha32-E5-S73
Text Generation • 33B • Updated • 43 -
kairawal/Qwen3-0.6B-HI-SynthDolly-r16alpha32-E5-S73
Text Generation • 0.6B • Updated • 2
MLSFT-Models-E1-S73
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E1-S73
-
kairawal/Qwen3-0.6B-HI-SynthDolly-r16alpha32-E1-S73
Text Generation • 0.6B • Updated • 3 -
kairawal/Qwen3-0.6B-DA-SynthDolly-r16alpha32-E1-S73
Text Generation • 0.6B • Updated • 2 -
kairawal/Qwen3-0.6B-ZH-SynthDolly-r16alpha32-E1-S73
Text Generation • 0.6B • Updated • 2 -
kairawal/Qwen3-0.6B-EL-SynthDolly-r16alpha32-E1-S73
Text Generation • 0.6B • Updated • 3
MLSFT-SmallLMs-E08
Benign Multilingual Fine-tuning: Eight Epochs with SynthDolly data. Models available: Qwen0.6B, Qwen4B, Llama1B, Llama3B, Gemma1B and Gemma4B.
-
kairawal/Qwen3-0.6B-DA-SynthDolly-1A-E8
Text Generation • 0.6B • Updated • 4 -
kairawal/Qwen3-0.6B-ZH-SynthDolly-1A-E8
Text Generation • 0.6B • Updated • 8 -
kairawal/Qwen3-0.6B-PT-SynthDolly-1A-E8
Text Generation • 0.6B • Updated • 5 -
kairawal/Qwen3-0.6B-ES-SynthDolly-1A-E8
Text Generation • 0.6B • Updated • 5
MLSFT-SmallLMs-E03
Benign Multilingual Fine-tuning: Three Epochs with SynthDolly data. Models available: Qwen0.6B, Qwen4B, Llama1B, Llama3B, Gemma1B and Gemma4B.
-
kairawal/Qwen3-0.6B-PT-SynthDolly-1A-E3
Text Generation • 0.6B • Updated • 5 • 1 -
kairawal/Qwen3-0.6B-TL-SynthDolly-1A-E3
Text Generation • 0.6B • Updated • 5 • 1 -
kairawal/Qwen3-0.6B-HI-SynthDolly-1A-E3
Text Generation • 0.6B • Updated • 5 -
kairawal/Qwen3-0.6B-DA-SynthDolly-1A-E3
Text Generation • 0.6B • Updated • 5
mlsft-datasets
SynthDolly benign multilingual finetuning data; and Multilingual SorryBench evaluation data. Languages available: ZH, DA, EL, HI, GA, PT, ES, & TL.
MLSFT-Models-E5-S9
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E5-S9
MLSFT-Models-E3-S9
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E3-S9
MLSFT-Models-E1-S9
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E1-S9
MLSFT-Models-E8-S3407
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E8-S3407
-
kairawal/Qwen3-8B-EN-SynthDolly-r16alpha32-E8-S3407
Text Generation • 8B • Updated • 53 -
kairawal/Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E8-S3407
Text Generation • 8B • Updated • 48 -
kairawal/Qwen3-0.6B-EN-SynthDolly-r16alpha128-E8-S3407
Text Generation • 0.6B • Updated • 6 -
kairawal/Qwen3-0.6B-DA-SynthDolly-r16alpha128-E8-S3407
Text Generation • 0.6B • Updated • 3
MLSFT-Models-E5-S3407
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E5-S3407
-
kairawal/Qwen3-8B-EN-SynthDolly-r16alpha32-E5-S3407
Text Generation • 8B • Updated • 50 -
kairawal/Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E5-S3407
Text Generation • 8B • Updated • 51 -
kairawal/Qwen3-0.6B-EN-SynthDolly-r16alpha128-E5-S3407
Text Generation • 0.6B • Updated • 5 -
kairawal/Qwen3-0.6B-DA-SynthDolly-r16alpha128-E5-S3407
Text Generation • 0.6B • Updated • 3
MLSFT-Models-E3-S3407
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E3-S3407
MLSFT-Models-E1-S3407
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E1-S3407
-
kairawal/Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E1-S3407
Text Generation • 8B • Updated • 50 -
kairawal/Qwen3-8B-EN-SynthDolly-r16alpha32-E1-S3407
Text Generation • 8B • Updated • 51 -
kairawal/Qwen3-8B-HI-SynthDolly-r16alpha32-E1-S3407
Text Generation • 8B • Updated • 48 -
kairawal/Qwen3-14B-EN-SynthDolly-r16alpha32-E1-S3407
Text Generation • 15B • Updated • 52
MLSFT-Models-E8-S9
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E8-S9
-
kairawal/Qwen3-32B-DA-SynthDolly-r16alpha32-E8-S9
Text Generation • 33B • Updated • 42 -
kairawal/Qwen3-32B-ZH-SynthDolly-r16alpha32-E8-S9
Text Generation • 33B • Updated • 56 -
kairawal/Qwen3-32B-EL-SynthDolly-r16alpha32-E8-S9
Text Generation • 33B • Updated • 57 -
kairawal/Qwen3-8B-EN-SynthDolly-r16alpha32-E8-S9
Text Generation • 8B • Updated • 51
MLSFT-Models-E8-S73
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E8-S73
-
kairawal/Qwen3-0.6B-DA-SynthDolly-r16alpha32-E8-S73
Text Generation • 0.6B • Updated • 1 -
kairawal/Qwen3-0.6B-ZH-SynthDolly-r16alpha32-E8-S73
Text Generation • 0.6B • Updated • 2 -
kairawal/Qwen3-0.6B-GA-SynthDolly-r16alpha32-E8-S73
Text Generation • 0.6B • Updated • 2 -
kairawal/Qwen3-0.6B-PT-SynthDolly-r16alpha32-E8-S73
Text Generation • 0.6B • Updated • 2
MLSFT-Models-E5-S73
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E5-S73
-
kairawal/Qwen3-32B-HI-SynthDolly-r16alpha32-E5-S73
Text Generation • 33B • Updated • 44 -
kairawal/Qwen3-32B-DA-SynthDolly-r16alpha32-E5-S73
Text Generation • 33B • Updated • 45 -
kairawal/Qwen3-32B-ZH-SynthDolly-r16alpha32-E5-S73
Text Generation • 33B • Updated • 43 -
kairawal/Qwen3-0.6B-HI-SynthDolly-r16alpha32-E5-S73
Text Generation • 0.6B • Updated • 2
MLSFT-Models-E3-S73
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E3-S73
-
kairawal/Qwen3-32B-HI-SynthDolly-r16alpha32-E3-S73
Text Generation • 33B • Updated • 38 -
kairawal/Qwen3-32B-DA-SynthDolly-r16alpha32-E3-S73
Text Generation • 33B • Updated • 40 -
kairawal/Qwen3-32B-ZH-SynthDolly-r16alpha32-E3-S73
Text Generation • 33B • Updated • 36 -
kairawal/Qwen3-32B-EL-SynthDolly-r16alpha32-E3-S73
Text Generation • 33B • Updated • 40
MLSFT-Models-E1-S73
Fine-tuned models from the MLSFT pipeline: MLSFT-Models-E1-S73
-
kairawal/Qwen3-0.6B-HI-SynthDolly-r16alpha32-E1-S73
Text Generation • 0.6B • Updated • 3 -
kairawal/Qwen3-0.6B-DA-SynthDolly-r16alpha32-E1-S73
Text Generation • 0.6B • Updated • 2 -
kairawal/Qwen3-0.6B-ZH-SynthDolly-r16alpha32-E1-S73
Text Generation • 0.6B • Updated • 2 -
kairawal/Qwen3-0.6B-EL-SynthDolly-r16alpha32-E1-S73
Text Generation • 0.6B • Updated • 3
MLSFT-LLMs-E01
Benign Multilingual Fine-tuning: Single Epoch fine-tuning with SynthDolly data. Models available: Qwen8B, Qwen14B, and Qwen32B.
MLSFT-SmallLMs-E08
Benign Multilingual Fine-tuning: Eight Epochs with SynthDolly data. Models available: Qwen0.6B, Qwen4B, Llama1B, Llama3B, Gemma1B and Gemma4B.
-
kairawal/Qwen3-0.6B-DA-SynthDolly-1A-E8
Text Generation • 0.6B • Updated • 4 -
kairawal/Qwen3-0.6B-ZH-SynthDolly-1A-E8
Text Generation • 0.6B • Updated • 8 -
kairawal/Qwen3-0.6B-PT-SynthDolly-1A-E8
Text Generation • 0.6B • Updated • 5 -
kairawal/Qwen3-0.6B-ES-SynthDolly-1A-E8
Text Generation • 0.6B • Updated • 5
MLSFT-SmallLMs-E05
Benign Multilingual Fine-tuning: Five Epochs with SynthDolly data. Models available: Qwen0.6B, Qwen4B, Llama1B, Llama3B, Gemma1B and Gemma4B.
-
kairawal/Qwen3-0.6B-HI-SynthDolly-1A-E5
Text Generation • 0.6B • Updated • 5 -
kairawal/Qwen3-0.6B-DA-SynthDolly-1A-E5
Text Generation • 0.6B • Updated • 5 -
kairawal/Qwen3-0.6B-ZH-SynthDolly-1A-E5
Text Generation • 0.6B • Updated • 4 • 1 -
kairawal/Qwen3-0.6B-PT-SynthDolly-1A-E5
Text Generation • 0.6B • Updated • 7
MLSFT-SmallLMs-E03
Benign Multilingual Fine-tuning: Three Epochs with SynthDolly data. Models available: Qwen0.6B, Qwen4B, Llama1B, Llama3B, Gemma1B and Gemma4B.
-
kairawal/Qwen3-0.6B-PT-SynthDolly-1A-E3
Text Generation • 0.6B • Updated • 5 • 1 -
kairawal/Qwen3-0.6B-TL-SynthDolly-1A-E3
Text Generation • 0.6B • Updated • 5 • 1 -
kairawal/Qwen3-0.6B-HI-SynthDolly-1A-E3
Text Generation • 0.6B • Updated • 5 -
kairawal/Qwen3-0.6B-DA-SynthDolly-1A-E3
Text Generation • 0.6B • Updated • 5
MLSFT-SmallLMs-E01
Benign Multilingual Fine-tuning: One Epoch with SynthDolly data. Models available: Qwen0.6B, Qwen4B, Llama1B, Llama3B, Gemma1B and Gemma4B.
-
kairawal/Qwen3-4B-DA-SynthDolly-1A-E1
Text Generation • 4B • Updated • 10 • 1 -
kairawal/Qwen3-4B-ES-SynthDolly-1A-E1
Text Generation • 4B • Updated • 4 • 1 -
kairawal/Qwen3-0.6B-HI-SynthDolly-1A-E1
Text Generation • 0.6B • Updated • 5 -
kairawal/Qwen3-0.6B-DA-SynthDolly-1A-E1
Text Generation • 0.6B • Updated • 5
mlsft-datasets
SynthDolly benign multilingual finetuning data; and Multilingual SorryBench evaluation data. Languages available: ZH, DA, EL, HI, GA, PT, ES, & TL.