Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
marcos gabriel's picture
2 1 1

marcos gabriel

marcosstable
·

AI & ML interests

None yet

Recent Activity

new activity 19 days ago
Tesslate/OmniCoder-9B-GGUF:Will there be a version based on the new QWEN 3.6?
liked a model 2 months ago
fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA
reacted to OzTianlu's post with 🤗 2 months ago
Scaling UP in Kai! 🌊 https://huggingface.co/NoesisLab/Kai-3B-Instruct Introducing NoesisLab/Kai-3B-Instruct What happens when you force a 3B model to reason entirely in its latent space ? Meet Kai-3B, our latest industrial-grade reasoning model fine-tuned using the Adaptive Dual Search (ADS) algorithm. GSM8K (0-shot, Direct Answer): 39.27% 🤯 (Llama-2-7B is ~14.6%) HumanEval (Pass@1): 39.02% 💻 (Overtakes Gemma-2-2B's 30%) MMLU (5-shot): 53.62% 📚 (Crushing the 50% barrier) ARC-Challenge: 51.88%🎯 PIQA: 77.53% HellaSwag: 69.53% Kai-3B proves that reasoning density doesn't strictly require parameter bloat or verbose generation. It acts as a perfect, cold-blooded Agent action-engine—ideal for JSON routing, SWE-bench patch generation, and anywhere you need absolute structured certainty without token waste.
View all activity

Organizations

None yet

models 1

marcosstable/SmolLM2-FT-MyDataset

Text Generation • 0.1B • Updated Jun 7, 2025 • 2

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs