hyeong's picture

hyeong

kimhyeongjun

·

https://hypro2.github.io/

hypro2

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

baidu/Unlimited-OCR

liked a model 9 days ago

microsoft/FastContext-1.0-4B-SFT

liked a model 14 days ago

openai/privacy-filter

View all activity

Organizations

upvoted a collection 3 months ago

Open-AgentRL

RLAnything & DemyAgent: Open-Source RL for LLMs and Agentic Scenarios • 12 items • Updated Feb 3 • 7

upvoted a collection 8 months ago

Cerebras REAP

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated Feb 25 • 150

upvoted an article 9 months ago

Article

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

baidu

•

Sep 10, 2025

• 112

upvoted a collection 9 months ago

mmBERT: a modern multilingual encoder

mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 17 items • Updated 12 days ago • 54

upvoted an article 10 months ago

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

+3

smohammadi, siro1, winglian, marcsun13, djsaunde

•

Aug 8, 2025

• 99

upvoted 3 collections about 1 year ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.82k

Llama 4

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 13 days ago • 56

Llama 4

Llama 4 release • 13 items • Updated Apr 29, 2025 • 739

upvoted a paper over 1 year ago

On Teacher Hacking in Language Model Distillation

Paper • 2502.02671 • Published Feb 4, 2025 • 19

upvoted 2 collections over 1 year ago

DeepSeek-R1

10 items • Updated Nov 27, 2025 • 854

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 190

upvoted an article over 1 year ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

+4

RQlee, ArthurZ, achikundu, lwtr, rganti, mayank-mishra

•

Aug 21, 2024

• 41

upvoted 4 collections almost 2 years ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated Mar 2 • 728

The Big Benchmarks Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 267

Standard-format-preference-dataset

We collect the open-source datasets and process them into the standard format. • 12 items • Updated Mar 2 • 26

FP8 LLMs for vLLM

Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 42 items • Updated Mar 2 • 81

upvoted a collection about 2 years ago

Korean Datasets I've released so far.

지금까지 업로드한 한국어 데이터셋 콜렉션입니다. • 8 items • Updated May 24, 2024 • 21