This is a collection of Benchmarking results between Gradio and FastMCP
Kshitij Thakkar PRO
kshitijthakkar
AI & ML interests
AI observability + MoE efficiency engineer. Building tools that make GenAI traceable, measurable, and production-ready.
Recent Activity
updated
a model 1 day ago
kshitijthakkar/loggenix-moe-1b-sft-s4-checkpoints published
a model 1 day ago
kshitijthakkar/loggenix-moe-1b-sft-s4-checkpoints updated
a dataset 1 day ago
kshitijthakkar/loggenix-moe-1b-pretrain-data Organizations
Large MoE Architecture Search (1B-2B)
Systematic search for 1B-2B MoE models. Best: bs=1, ctx=2048 achieves 0.32 loss. Top-8 routing beats top-2.
OutageOdyssey
My submission to Agents-MCP-Hackathon
Loggenix-MOE
Collection of Loggenix Models, Eval Dataset, Demo Playground. Soon will add the training dataset.
- Build error2
Loggenix Moe 0.3B A0.1B Demo
🏢2Demo Space for my model loggenix-moe-0.3B-A0.1B
-
kshitijthakkar/loggenix-synthetic-ai-tasks-eval-with-outputs
Viewer • Updated • 28 • 8 -
kshitijthakkar/loggenix-synthetic-ai-tasks-eval_v6-with-outputs
Viewer • Updated • 170 • 11 -
kshitijthakkar/loggenix-synthetic-ai-tasks-eval_v5-with-outputs-v7-sft-v1
Viewer • Updated • 170 • 11
Qwen3.5 Dense-to-MoE Weight Transfer
Qwen3.5 MoE models from dual-source weight transfer (dense backbone + 35B-A3B experts). Hybrid DeltaNet + GQA attention.
-
kshitijthakkar/qwen3.5-moe-0.87B-d0.8B
Image-Text-to-Text • 1B • Updated • 139 -
kshitijthakkar/qwen3.5-moe-2.3B-d2B
Image-Text-to-Text • 3B • Updated • 75 -
kshitijthakkar/qwen3.5-moe-4.7B-d4B
Image-Text-to-Text • 5B • Updated • 84 -
kshitijthakkar/qwen3.5-tiny-test
Image-Text-to-Text • 0.1B • Updated • 286
Mobile MoE Architecture Search
32 MoE models from 41 experiments exploring expert count, routing, and learning rates for mobile deployment.
TraceMind-AI
Collection of my submissions to MCP-1st-Birthday:- TraceMind Agent and MCP Server and smoltrace datasets generated for running evals using smoltrace.
- Running10
TraceMind MCP Server
🤖10MCP server for agent evaluation with Gemini 2.5 Flash
- Running21
TraceMind AI
🧠21AI agent evaluation with MCP-powered intelligence
-
MCP-1st-Birthday/smoltrace-recruitment-tasks
Viewer • Updated • 101 • 27 -
MCP-1st-Birthday/smoltrace-smart-home-tasks
Viewer • Updated • 100 • 10
mcp-server-bench
This is a collection of Benchmarking results between Gradio and FastMCP
Qwen3.5 Dense-to-MoE Weight Transfer
Qwen3.5 MoE models from dual-source weight transfer (dense backbone + 35B-A3B experts). Hybrid DeltaNet + GQA attention.
-
kshitijthakkar/qwen3.5-moe-0.87B-d0.8B
Image-Text-to-Text • 1B • Updated • 139 -
kshitijthakkar/qwen3.5-moe-2.3B-d2B
Image-Text-to-Text • 3B • Updated • 75 -
kshitijthakkar/qwen3.5-moe-4.7B-d4B
Image-Text-to-Text • 5B • Updated • 84 -
kshitijthakkar/qwen3.5-tiny-test
Image-Text-to-Text • 0.1B • Updated • 286
Large MoE Architecture Search (1B-2B)
Systematic search for 1B-2B MoE models. Best: bs=1, ctx=2048 achieves 0.32 loss. Top-8 routing beats top-2.
Mobile MoE Architecture Search
32 MoE models from 41 experiments exploring expert count, routing, and learning rates for mobile deployment.
OutageOdyssey
My submission to Agents-MCP-Hackathon
TraceMind-AI
Collection of my submissions to MCP-1st-Birthday:- TraceMind Agent and MCP Server and smoltrace datasets generated for running evals using smoltrace.
- Running10
TraceMind MCP Server
🤖10MCP server for agent evaluation with Gemini 2.5 Flash
- Running21
TraceMind AI
🧠21AI agent evaluation with MCP-powered intelligence
-
MCP-1st-Birthday/smoltrace-recruitment-tasks
Viewer • Updated • 101 • 27 -
MCP-1st-Birthday/smoltrace-smart-home-tasks
Viewer • Updated • 100 • 10
Loggenix-MOE
Collection of Loggenix Models, Eval Dataset, Demo Playground. Soon will add the training dataset.
- Build error2
Loggenix Moe 0.3B A0.1B Demo
🏢2Demo Space for my model loggenix-moe-0.3B-A0.1B
-
kshitijthakkar/loggenix-synthetic-ai-tasks-eval-with-outputs
Viewer • Updated • 28 • 8 -
kshitijthakkar/loggenix-synthetic-ai-tasks-eval_v6-with-outputs
Viewer • Updated • 170 • 11 -
kshitijthakkar/loggenix-synthetic-ai-tasks-eval_v5-with-outputs-v7-sft-v1
Viewer • Updated • 170 • 11