JT

jtvino

18 8 47

AI & ML interests

None yet

Recent Activity

new activity 7 days ago

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4:Tool Calling Catastrophic Forgetfulness

new activity 9 days ago

google/diffusiongemma-26B-A4B-it:VLLM setup? Getting malformed responses. What is correct configuration

liked a model 10 days ago

LiquidAI/LFM2.5-350M

View all activity

Organizations

upvoted an article 2 months ago

Article

Granite 4.1 LLMs: How They’re Built

ibm-granite

•

Apr 29

• 83

upvoted a paper 8 months ago

UltraCUA: A Foundation Model for Computer Use Agents with Hybrid Action

Paper • 2510.17790 • Published Oct 20, 2025 • 6

upvoted a paper about 1 year ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 191

upvoted a collection about 1 year ago

NextCoder

Collection

NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. • 6 items • Updated Jul 9, 2025 • 79

upvoted 2 articles about 1 year ago

Article

Uncensor any LLM with abliteration

mlabonne

•

Jun 13, 2024

• 870

Article

Custom Vibe Coding Quest Part 2: 🚙 Fine-Tuning Gemma 3 for Code Reasoning

burtenshaw

•

Apr 1, 2025

• 25

upvoted 2 papers over 2 years ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 264

3D-LFM: Lifting Foundation Model

Paper • 2312.11894 • Published Dec 19, 2023 • 15

JT

AI & ML interests

Recent Activity

Organizations

jtvino's activity

Granite 4.1 LLMs: How They’re Built

Uncensor any LLM with abliteration

Custom Vibe Coding Quest Part 2: 🚙 Fine-Tuning Gemma 3 for Code Reasoning