6 9

Jin

dsjinx

AI & ML interests

None yet

Recent Activity

upvoted an article 22 days ago

Efficient LLM Pretraining: Packed Sequences and Masked Attention

liked a model 2 months ago

Twitter/twhin-bert-base

liked a model 6 months ago

inclusionAI/Ling-1T

View all activity

Organizations

None yet

upvoted an article 22 days ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Oct 7, 2024

•

liked a model 2 months ago

Twitter/twhin-bert-base

Fill-Mask • 0.3B • Updated Jul 7, 2023 • 70k • • 45

liked a model 6 months ago

inclusionAI/Ling-1T

Text Generation • Updated Nov 4, 2025 • 2.14k • • 532

upvoted an article 10 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

•

407

liked a Space 12 months ago

LLM Embeddings Explained: A Visual and Intuitive Guide

🚀

330

How Language Models Turn Text into Meaning, From Traditional

liked a dataset about 1 year ago

qihoo360/Light-R1-SFTData

Viewer • Updated Mar 17, 2025 • 79.4k • 429 • 60

liked a model about 1 year ago

qihoo360/TinyR1-32B-Preview

Text Generation • 33B • Updated Sep 24, 2025 • 104 • • 324

upvoted 2 collections about 1 year ago

TinyR1

Collection

4 items • Updated about 1 month ago • 4

Light-R1

Collection

Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond • 7 items • Updated Oct 15, 2025 • 12

upvoted an article about 1 year ago

Article

Open R1: Update #2

Feb 10, 2025

•

218

liked a Space about 1 year ago

README

📈

upvoted an article about 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28, 2025

•

888

liked a Space about 1 year ago

MTEB Leaderboard

🥇

7.2k

Embedding Leaderboard

liked 2 models about 1 year ago

bespokelabs/Bespoke-Stratos-32B

Text Generation • 33B • Updated Jan 24, 2025 • 25 • 44

whyhow-ai/PatientSeek

Question Answering • 8B • Updated Jan 27, 2025 • 4 • 71

Jin

AI & ML interests

Recent Activity

Organizations

dsjinx's activity

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Illustrating Reinforcement Learning from Human Feedback (RLHF)

LLM Embeddings Explained: A Visual and Intuitive Guide

Open R1: Update #2

README

Open-R1: a fully open reproduction of DeepSeek-R1

MTEB Leaderboard