Aleksei Dorkin PRO

adorkin

AI & ML interests

Computational Linguistics

Recent Activity

upvoted a collection about 20 hours ago

Luciole LLM

upvoted a changelog about 23 hours ago

Share your feedback with us

liked a dataset about 23 hours ago

utter-project/LongBlocks

View all activity

Organizations

upvoted a collection about 20 hours ago

Luciole LLM

Collection

Open Source LLM in French, English, German, Spanish, Italian, Portuguese, Dutch and Arabic • 9 items • Updated about 23 hours ago • 10

upvoted a changelog about 23 hours ago

Hugging Face Changelog

Share your feedback with us

about 24 hours ago

• 44

upvoted a collection 11 days ago

Latxa Instruct

Collection

Instructing Large Language Models for Low-Resource Languages: A Systematic Study for Basque • 17 items • Updated 1 day ago • 2

upvoted an article 24 days ago

Article

Introducing BERTopic Integration with the Hugging Face Hub

MaartenGr, davanstrien

•

May 31, 2023

• 12

upvoted a changelog 30 days ago

Hugging Face Changelog

Filter Models page by Base Models only

30 days ago

• 171

upvoted 2 changelogs about 1 month ago

Hugging Face Changelog

Copy Repo Contents to Buckets Instantly

May 22

• 87

Hugging Face Changelog

Filter Leaderboards by Model Size

May 20

• 136

upvoted a collection about 2 months ago

NVIDIA Nemotron v3

Collection

Open, Production-ready Enterprise Models • 23 items • Updated 15 days ago • 330

upvoted an article 2 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 165

upvoted a changelog 2 months ago

Hugging Face Changelog

Introducing Kernels

Apr 15

• 201

upvoted a changelog 3 months ago

Hugging Face Changelog

ZeroGPU overquota

Apr 10

• 153

upvoted a paper 3 months ago

BidirLM: From Text to Omnimodal Bidirectional Encoders by Adapting and Composing Causal LLMs

Paper • 2604.02045 • Published Apr 2 • 38

upvoted a collection 3 months ago

BidirLM-Embedding

Collection

BidirLM is a family of 5 frontier bidirectional encoders, including an omnimodal variant at 2.5B. • 6 items • Updated Apr 7 • 7

upvoted an article 3 months ago

Article

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

nielsr

•

Apr 7

• 62

upvoted a collection 3 months ago

Nemotron OCR and Object Detection

Collection

4 items • Updated 15 days ago • 18

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 910

upvoted a collection 3 months ago

Gemma 4

Collection

15 items • Updated 16 days ago • 994

upvoted a changelog 3 months ago

Hugging Face Changelog

Storage Buckets for Spaces

Mar 31

• 142

upvoted an article 3 months ago

Article

Introducing Cohere-transcribe: state-of-the-art speech recognition

CohereLabs

•

Mar 26

• 46

upvoted a collection 3 months ago

MolmoWeb-Data

Collection

This is the collection of all datasets in MolmoWebMix. • 6 items • Updated Mar 24 • 30

Aleksei Dorkin PRO

AI & ML interests

Recent Activity

Organizations

adorkin's activity

Share your feedback with us

Introducing BERTopic Integration with the Hugging Face Hub

Filter Models page by Base Models only

Copy Repo Contents to Buckets Instantly

Filter Leaderboards by Model Size

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Introducing Kernels

ZeroGPU overquota

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

Welcome Gemma 4: Frontier multimodal intelligence on device

Storage Buckets for Spaces

Introducing Cohere-transcribe: state-of-the-art speech recognition