1 14 41

Gabriel

gacosta26

AI & ML interests

None yet

Recent Activity

liked a model about 8 hours ago

prism-ml/Bonsai-8B-gguf

upvoted a collection about 8 hours ago

Bonsai

liked a model about 18 hours ago

nvidia/Phi-4-multimodal-instruct-NVFP4

View all activity

Organizations

None yet

liked a model about 8 hours ago

prism-ml/Bonsai-8B-gguf

Text Generation • 8B • Updated 3 days ago • 13.8k • 303

upvoted a collection about 8 hours ago

Bonsai

Collection

1-bit Bonsai models • 6 items • Updated 2 days ago • 124

liked a model about 18 hours ago

nvidia/Phi-4-multimodal-instruct-NVFP4

4B • Updated Sep 5, 2025 • 1.65k • 9

liked 2 models 1 day ago

Jackrong/Qwopus3.5-4B-v3

Image-Text-to-Text • 5B • Updated about 11 hours ago • 72 • 4

Jackrong/Qwopus3.5-4B-v3-GGUF

Image-Text-to-Text • 4B • Updated about 11 hours ago • 699 • 8

reactedto DedeProGames's post with 🔥🔥 1 day ago

Post

1351

🔥 GRM2 - The small one that surpasses the big ones.
What if a 3-parameter model can beat a 32-parameter model in every benchmark? We prove that it can.
GRM2 is a 3b params model based on the llama architecture, trained for long reasoning and high performance in complex tasks - the first 3b params model to outperform qwen3-32b in ALL benchmarks, and outperform o3-mini in almost all benchmarks.
🤗 Model: OrionLLM/GRM2-3b
The first 3b params model to generate over 1000 lines of code and achieve a score of 39.0 in xBench-DeepSearch-2510.

🚀 Chat with GRM:
DedeProGames/GRM2-Chat

🏆 Download official GGUFs: OrionLLM/GRM2-3b-GGUF

upvoted a collection 1 day ago

Qwopus3.5-v3

Collection

10 items • Updated about 10 hours ago • 30

liked a model 2 days ago

khazarai/Qwen3-4B-Kimi2.5-Reasoning-Distilled-GGUF

Text Generation • 4B • Updated 2 days ago • 3.61k • 3

upvoted a collection 2 days ago

Distilled Models

Collection

Distilled models for Kimi-2.5, Gemini 3.1 Pro • 3 items • Updated 12 days ago • 2

reactedto danielhanchen's post with 🔥 2 days ago

Post

2302

A new way to use Unsloth.

Coming soon...

liked a model 2 days ago

Jackrong/Qwopus3.5-9B-v3-GGUF

Image-Text-to-Text • 9B • Updated 2 days ago • 10.9k • 117

liked 5 models 5 days ago

liked a model 7 days ago

unsloth/NVIDIA-Nemotron-3-Nano-4B

Text Generation • Updated 7 days ago • 28k • 10

reactedto karstenskyt's post with 🔥 7 days ago

Post

2259

🚀 𝗟𝗮𝘂𝗻𝗰𝗵𝗶𝗻𝗴 𝘁𝗵𝗲 𝗔𝗜/𝗠𝗟 𝗪𝗼𝗿𝗸𝗳𝗹𝗼𝘄𝘀 𝗗𝗮𝘀𝗵𝗯𝗼𝗮𝗿𝗱

Now that our Taipy architecture is humming along on Hugging Face Spaces, we just shipped the most complex feature of the (𝘙𝘪𝘨𝘩𝘵! 𝘓𝘶𝘹𝘶𝘳𝘺!) 𝘓𝘢𝘬𝘦𝘩𝘰𝘶𝘴𝘦 to date: the 𝗔𝗜/𝗠𝗟 𝗪𝗼𝗿𝗸𝗳𝗹𝗼𝘄𝘀 𝗗𝗮𝘀𝗵𝗯𝗼𝗮𝗿𝗱.

Managing 16 different machine learning pipelines (from Expected Goals to Space Creation) across Databricks Serverless and HF Jobs is a logistical challenge. To solve this, we built a dynamic operations center (the 13th page in our app).

It features:

  • 𝗔𝗻 𝗶𝗻𝘁𝗲𝗿𝗮𝗰𝘁𝗶𝘃𝗲 𝗱𝗲𝗽𝗲𝗻𝗱𝗲𝗻𝗰𝘆 𝗗𝗔𝗚: Powered by Cytoscape.js, it visually maps exactly how our models and data grids feed into each other.

  • 𝗥𝗲𝗮𝗹-𝘁𝗶𝗺𝗲 𝗺𝗼𝗻𝗶𝘁𝗼𝗿𝗶𝗻𝗴: Tracks run volumes and data freshness SLAs across the entire platform.

  • 𝗔 𝟯-𝘁𝗶𝗲𝗿 𝗵𝘆𝗯𝗿𝗶𝗱 𝗰𝗼𝘀𝘁 𝗲𝗻𝗴𝗶𝗻𝗲: Merges "cold" Databricks billing data with "warm/hot" live HF Jobs estimates to give a unified view of pipeline expenses.

Check out the live interactive graph here:
luxury-lakehouse/soccer-analytics-app

New activity in Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled 7 days ago

High token count after code implementatio

#2 opened 7 days ago by

gacosta26

Gabriel

AI & ML interests

Recent Activity

Organizations

gacosta26's activity

High token count after code implementatio