SII-jzhao

3 10 2

AI & ML interests

SII is an institution dedicated to innovation in education and research in the field of AI.

Recent Activity

new activity 21 days ago

nvidia/LocateAnything-3B:About Dataset release

upvoted a paper about 2 months ago

ACC: Compiling Agent Trajectories for Long-Context Training

new activity about 2 months ago

HelloKKMe/grounding_dataset:Improve dataset card: Add metadata, links, abstract, and sample usage

View all activity

Organizations

None yet

New activity in nvidia/LocateAnything-3B 21 days ago

About Dataset release

#22 opened 21 days ago by

SII-jzhao

upvoted a paper about 2 months ago

ACC: Compiling Agent Trajectories for Long-Context Training

Paper • 2605.21850 • Published May 21 • 60

New activity in HelloKKMe/grounding_dataset about 2 months ago

Improve dataset card: Add metadata, links, abstract, and sample usage

#3 opened 12 months ago by

nielsr

upvoted a paper 3 months ago

CiQi-Agent: Aligning Vision, Tools and Aesthetics in Multimodal Agent for Cultural Reasoning on Chinese Porcelains

Paper • 2603.28474 • Published Mar 30 • 9

New activity in stepfun-ai/Step-3.5-Flash-SFT 4 months ago

The use of open-source datasets

👍 2

#9 opened 4 months ago by

Ken0102030405

upvoted a paper 4 months ago

RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback

Paper • 2603.08561 • Published Mar 9 • 12

upvoted a collection 5 months ago

💧 LFM2.5

Collection

Collection of post-trained and base LFM2.5 models. • 14 items • Updated 12 days ago • 172

liked a model 6 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 7.01M • • 4.77k

upvoted a paper 9 months ago

RLFR: Extending Reinforcement Learning for LLMs with Flow Environment

Paper • 2510.10201 • Published Oct 11, 2025 • 36

upvoted an article about 1 year ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb

•

May 21, 2025

• 262

liked a model about 1 year ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 8.59M • • 13.4k

upvoted 2 articles over 1 year ago

Article

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

mirinflim, aldopareja, muellerzr, stas

•

Jun 13, 2024

• 62

Article

From PyTorch DDP to Accelerate to Trainer, mastery of distributed training with ease

muellerzr

•

Oct 21, 2022

• 44

upvoted a paper over 1 year ago

CoRe^2: Collect, Reflect and Refine to Generate Better and Faster

Paper • 2503.09662 • Published Mar 12, 2025 • 33

upvoted a collection over 1 year ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 309

SII-jzhao

AI & ML interests

Recent Activity

Organizations

SII-jzhao's activity

About Dataset release

Improve dataset card: Add metadata, links, abstract, and sample usage

The use of open-source datasets

nanoVLM: The simplest repository to train your VLM in pure PyTorch

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

From PyTorch DDP to Accelerate to Trainer, mastery of distributed training with ease