Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
marcos gabriel's picture
2 1 1

marcos gabriel

marcosstable
·

AI & ML interests

None yet

Recent Activity

new activity 19 days ago
Tesslate/OmniCoder-9B-GGUF:Will there be a version based on the new QWEN 3.6?
liked a model 2 months ago
fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA
reacted to OzTianlu's post with 🤗 2 months ago
Scaling UP in Kai! 🌊 https://huggingface.co/NoesisLab/Kai-3B-Instruct Introducing NoesisLab/Kai-3B-Instruct What happens when you force a 3B model to reason entirely in its latent space ? Meet Kai-3B, our latest industrial-grade reasoning model fine-tuned using the Adaptive Dual Search (ADS) algorithm. GSM8K (0-shot, Direct Answer): 39.27% 🤯 (Llama-2-7B is ~14.6%) HumanEval (Pass@1): 39.02% 💻 (Overtakes Gemma-2-2B's 30%) MMLU (5-shot): 53.62% 📚 (Crushing the 50% barrier) ARC-Challenge: 51.88%🎯 PIQA: 77.53% HellaSwag: 69.53% Kai-3B proves that reasoning density doesn't strictly require parameter bloat or verbose generation. It acts as a perfect, cold-blooded Agent action-engine—ideal for JSON routing, SWE-bench patch generation, and anywhere you need absolute structured certainty without token waste.
View all activity

Organizations

None yet

marcosstable 's datasets

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs