Aleksei Dorkin PRO
adorkin
AI & ML interests
Computational Linguistics
Recent Activity
liked a model 8 days ago
LiquidAI/LFM2-ColBERT-350M upvoted a collection 9 days ago
Latxa Instruct liked a model 9 days ago
HiTZ/Latxa-Llama-3.1-70B-Instruct-v2Organizations
Math Datasets
Code SFT Datasets
Reward Models
-
nvidia/Llama-3.3-Nemotron-70B-Reward-Multilingual
Text Generation • 71B • Updated • 70 • • 10 -
nvidia/Llama-3.3-Nemotron-70B-Reward-Principle
Text Generation • 71B • Updated • 665 • • 7 -
nvidia/Qwen-3-Nemotron-32B-Reward
Text Classification • 32B • Updated • 26 • 20 -
Skywork/Skywork-Reward-V2-Llama-3.1-8B
Text Classification • 8B • Updated • 16.9k • 47
My Shared Task Papers
-
TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and Historical Languages
Paper • 2404.12845 • Published -
TartuNLP at EvaLatin 2024: Emotion Polarity Detection
Paper • 2405.01159 • Published -
TartuNLP @ AXOLOTL-24: Leveraging Classifier Output for New Sense Detection in Lexical Semantics
Paper • 2407.03861 • Published -
TartuNLP at SemEval-2025 Task 5: Subject Tagging as Two-Stage Information Retrieval
Paper • 2504.21547 • Published
Multilingual Text Embedding Models
-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • 12B • Updated • 9.32k • 97 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 42k • 167 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 2.43M • • 718 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 2.39M • 291
Code RL Datasets
Multi-Turn Chat
Llama 3(.1) 8B Finetunes
-
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 16.9k • • 221 -
nvidia/llama-3.1-nemoguard-8b-topic-control
Text Classification • Updated • 2.33k • 18 -
nvidia/llama-3.1-nemoguard-8b-content-safety
Text Classification • Updated • 1.69k • 35 -
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
Image-Text-to-Text • 9B • Updated • 1.19M • 181
Multilingual Text Encoders
Multilingual Text Embedding Models
-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • 12B • Updated • 9.32k • 97 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 42k • 167 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 2.43M • • 718 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 2.39M • 291
Math Datasets
Code RL Datasets
Code SFT Datasets
Multi-Turn Chat
Reward Models
-
nvidia/Llama-3.3-Nemotron-70B-Reward-Multilingual
Text Generation • 71B • Updated • 70 • • 10 -
nvidia/Llama-3.3-Nemotron-70B-Reward-Principle
Text Generation • 71B • Updated • 665 • • 7 -
nvidia/Qwen-3-Nemotron-32B-Reward
Text Classification • 32B • Updated • 26 • 20 -
Skywork/Skywork-Reward-V2-Llama-3.1-8B
Text Classification • 8B • Updated • 16.9k • 47
Llama 3(.1) 8B Finetunes
-
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 16.9k • • 221 -
nvidia/llama-3.1-nemoguard-8b-topic-control
Text Classification • Updated • 2.33k • 18 -
nvidia/llama-3.1-nemoguard-8b-content-safety
Text Classification • Updated • 1.69k • 35 -
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
Image-Text-to-Text • 9B • Updated • 1.19M • 181
My Shared Task Papers
-
TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and Historical Languages
Paper • 2404.12845 • Published -
TartuNLP at EvaLatin 2024: Emotion Polarity Detection
Paper • 2405.01159 • Published -
TartuNLP @ AXOLOTL-24: Leveraging Classifier Output for New Sense Detection in Lexical Semantics
Paper • 2407.03861 • Published -
TartuNLP at SemEval-2025 Task 5: Subject Tagging as Two-Stage Information Retrieval
Paper • 2504.21547 • Published