Sthenno's picture

🏗️ Building on HF

Sthenno PRO

sthenno

·

neoheartbeats

AI & ML interests

To contact me: sthenno@sthenno.com.

Recent Activity

liked a model 6 days ago

internlm/Intern-S2-Preview

liked a model 10 days ago

nvidia/diffusiongemma-26B-A4B-it-NVFP4

liked a model 10 days ago

zai-org/SCAIL-2

View all activity

Organizations

upvoted a paper about 1 month ago

Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models

Paper • 2605.11887 • Published May 12 • 17

upvoted a collection 3 months ago

Scalably Extracting Latent Representations of Users

Models and datasets for "Scalably Extracting Latent Representations of Users" • 10 items • Updated Mar 9 • 1

upvoted a paper 3 months ago

QuantLRM: Quantization of Large Reasoning Models via Fine-Tuning Signals

Paper • 2602.02581 • Published Jan 31 • 10

upvoted a paper 4 months ago

Functionality-Oriented LLM Merging on the Fisher--Rao Manifold

Paper • 2603.04972 • Published Mar 5 • 3

upvoted a collection 5 months ago

Qwen3-TTS

7 items • Updated Jan 22 • 367

upvoted a collection 8 months ago

GPT-2 models fine-tuned on tasks from GLUE Benchmark

if you find these models helpful, consider citing [our paper](https://arxiv.org/abs/2406.03280) • 7 items • Updated Aug 27, 2024 • 3

upvoted a paper 8 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 233

upvoted a collection 8 months ago

story writing favourites

Models I personally liked for generating stories in the past. Not a recommendation, most of these are outdated. • 19 items • Updated 12 days ago • 117

upvoted a collection 9 months ago

Qwen3-Omni

6 items • Updated Dec 31, 2025 • 204

upvoted a paper 11 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

upvoted a collection about 1 year ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.82k

upvoted 3 collections over 1 year ago

Synthetic Data Generation

SDG papers • 86 items • Updated Jul 11, 2025 • 15

Unsloth 4-bit Dynamic Quants

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 28 items • Updated 11 days ago • 97

miscii-14b-dev

Known stable releases of the miscii-1020 based models • 3 items • Updated Mar 2 • 2