Kalyan KS PRO

kalyan-ks

AI & ML interests

NLP (LLMs)

Recent Activity

liked a model 3 days ago

LiquidAI/LFM2.5-230M

liked a model 3 days ago

l3cube-pune/IndicGuard

upvoted an article 8 days ago

Beyond LoRA: Can you beat the most popular fine-tuning technique?

View all activity

Organizations

liked 2 models 3 days ago

LiquidAI/LFM2.5-230M

Text Generation • 0.2B • Updated 2 days ago • 12.4k • 137

l3cube-pune/IndicGuard

Text Generation • Updated about 7 hours ago • 9 • 2

upvoted an article 8 days ago

Article

Beyond LoRA: Can you beat the most popular fine-tuning technique?

BenjaminB, sayakpaul, hubnemo, kashif

•

11 days ago

• 64

liked a model 10 days ago

LiquidAI/LFM2.5-ColBERT-350M

upvoted 2 articles 11 days ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

aaditya, pminervini, clefourrier

•

Apr 19, 2024

• 202

Article

Let's talk about LLM evaluation

clefourrier

•

May 23, 2024

• 212

liked a model 12 days ago

MoritzLaurer/mDeBERTa-v3-base-xnli-multilingual-nli-2mil7

Zero-Shot Classification • 0.3B • Updated Apr 11, 2024 • 312k • • 376

liked a model 21 days ago

tensorfiend/DotLM-165M

Text Generation • 0.2B • Updated Apr 4 • 5 • 1

liked a dataset 21 days ago

tensorfiend/SimpleThoughts

Viewer • Updated Apr 5 • 391k • 87 • 1

liked a model 21 days ago

principled-intelligence/scope-guard-4B-q-2601

Text Classification • 4B • Updated May 8 • 1.01k • 12

upvoted a paper 22 days ago

Advancing Semantic Caching for LLMs with Domain-Specific Embeddings and Synthetic Data

Paper • 2504.02268 • Published Apr 3, 2025 • 5

liked a Space 22 days ago

Open SLM Leaderboard

🏆

Open Small Language Model Leaderboard

replied to AxionLab-official's post 22 days ago

Good work. Can you share the following details regarding the pretraining of Supra-50M base model?

GPU(s) used for pretraining
Total GPU hours and cost
Cloud platform (GPU) used for pretraining

reacted to AxionLab-official's post with 👍 22 days ago

Post

6604

We're happy to announce that we released a Reasoning tuned version of Supra-50M!

SupraLabs/Supra-50M-Reasoning

liked a model 22 days ago

SupraLabs/Supra-50M-Instruct

Text Generation • 51.8M • Updated 9 days ago • 7.55k • 59

upvoted a collection 22 days ago

Supra-50M

Collection

All Supra-50M models • 9 items • Updated 7 days ago • 5

liked a model 23 days ago

dslim/bert-large-NER

Token Classification • 0.3B • Updated Oct 8, 2024 • 144k • • 163

liked 2 models 24 days ago

LiquidAI/LFM2.5-VL-450M-Extract

Image-Text-to-Text • 0.4B • Updated 23 days ago • 4.56k • 49

google/gemma-4-12B-it

Any-to-Any • 12B • Updated 24 days ago • 2.51M • 1.21k

reacted to danielhanchen's post with 👍 24 days ago

Post

9227

Gemma 4 12B can now run locally on just 8GB RAM via Dynamic GGUFs.

Google's new model, Gemma 4 12B Unified supports image, audio and 256K context.
You can run and train the model via Unsloth Studio.

GGUF: unsloth/gemma-4-12b-it-GGUF
Guide: https://unsloth.ai/docs/models/gemma-4

5 replies

Kalyan KS PRO

AI & ML interests

Recent Activity

Organizations

kalyan-ks's activity

Beyond LoRA: Can you beat the most popular fine-tuning technique?

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Let's talk about LLM evaluation

Open SLM Leaderboard