lack

Hosseinlack123

1 18 203

AI & ML interests

None yet

Recent Activity

liked a model about 4 hours ago

fishaudio/s2-pro

liked a model 2 days ago

hexgrad/Kokoro-82M

liked a model 21 days ago

Thomcles/Chatterbox-TTS-Persian-Farsi

View all activity

Organizations

None yet

upvoted a collection 23 days ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 18 days ago • 168

upvoted a paper 4 months ago

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

Paper • 2602.13367 • Published Feb 13 • 36

upvoted 2 articles 4 months ago

Article

Building an AI-powered search engine from scratch

as-cle-bert

•

Dec 12, 2024

• 12

Article

Search the Web with AI

as-cle-bert

•

Jan 10, 2025

• 6

upvoted an article 5 months ago

Article

Uncensor any LLM with abliteration

mlabonne

•

Jun 13, 2024

• 868

upvoted a paper 5 months ago

Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery

Paper • 2601.20088 • Published Jan 27 • 4

upvoted an article 5 months ago

Article

AutoThink: Adaptive Reasoning for Large Language Models

codelion

•

May 27, 2025

• 8

upvoted a collection 6 months ago

Dataset Mix for Pre-Training SLMs

Collection

11 items • Updated Mar 25, 2025 • 2

upvoted 3 articles 6 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

codelion

•

Nov 3, 2025

• 65

Article

The Optimal Architecture for Small Language Models

codelion

•

Dec 26, 2025

• 121

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 780

upvoted 2 collections 7 months ago

Awesome SFT datasets

Collection

A curated list of interesting datasets to fine-tune language models with. • 41 items • Updated Mar 2 • 154

Ministral 3

Collection

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 169

upvoted an article 7 months ago

Article

SmolLM - blazingly fast and remarkably powerful

loubnabnl, anton-l, eliebak

•

Jul 16, 2024

• 460

upvoted a paper 7 months ago

Craw4LLM: Efficient Web Crawling for LLM Pretraining

Paper • 2502.13347 • Published Feb 19, 2025 • 30

upvoted an article 9 months ago

Article

Releasing Common Corpus: the largest public domain dataset for training LLMs

Pclanglais

•

Mar 20, 2024

• 34

upvoted a paper 9 months ago

Essential-Web v1.0: 24T tokens of organized web data

Paper • 2506.14111 • Published Jun 17, 2025 • 48

upvoted an article 9 months ago

Article

🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders

adaamko

•

Aug 31, 2025

• 16

lack

AI & ML interests

Recent Activity

Organizations

Hosseinlack123's activity

Building an AI-powered search engine from scratch

Search the Web with AI

Uncensor any LLM with abliteration

AutoThink: Adaptive Reasoning for Large Language Models

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

The Optimal Architecture for Small Language Models

SmolLM3: smol, multilingual, long-context reasoner

SmolLM - blazingly fast and remarkably powerful

Releasing Common Corpus: the largest public domain dataset for training LLMs

🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders