kenyano (Ken Yano)

liked 2 datasets about 1 year ago

eriktks/conll2003

Updated Jan 18, 2024 • 25.4k • 171

DFKI-SLT/conll04

Viewer • Updated Jun 7, 2024 • 1.44k • 393 • 4

liked 2 models about 1 year ago

nari-labs/Dia-1.6B

Text-to-Speech • 2B • Updated Jun 1, 2025 • 3.32k • • 2.89k

MohamedRashad/arabic-small-nougat

Image-to-Text • 0.2B • Updated Nov 28, 2024 • 87 • 26

liked a dataset about 1 year ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 30.8k • 1.82k

liked a Space about 1 year ago

The Ultra-Scale Playbook

🌌

3.93k

The ultimate guide to training LLM on large GPU Clusters

liked 6 datasets about 1 year ago

liked a model about 1 year ago

google/gemma-3-12b-it

Image-Text-to-Text • 12B • Updated Mar 21, 2025 • 1.29M • • 778

liked a dataset over 1 year ago

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 380k • 1.19k

liked a Space over 1 year ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.38k

Explore and download the FineWeb web‑scale text dataset

liked 5 datasets over 1 year ago

HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27, 2025 • 4.48B • 94.1k • 832

lmsys/lmsys-chat-1m

Viewer • Updated Jul 27, 2024 • 1M • 6.15k • 937

toloka/mu-math

Viewer • Updated Jan 30 • 1.08k • 44 • 24

openai/gsm8k

Benchmark • Updated Mar 23 • 17.6k • 977k • 1.44k

csebuetnlp/xlsum

Updated Apr 18, 2023 • 23k • 153

Ken Yano

AI & ML interests

Organizations

eriktks/conll2003

DFKI-SLT/conll04

nari-labs/Dia-1.6B

MohamedRashad/arabic-small-nougat

Anthropic/hh-rlhf

The Ultra-Scale Playbook

teknium/OpenHermes-2.5

openbmb/UltraChat

HuggingFaceFW/fineweb

bigcode/the-stack-v2

HuggingFaceTB/cosmopedia

allenai/WildChat-1M

google/gemma-3-12b-it

HuggingFaceFW/fineweb-edu

FineWeb: decanting the web for the finest text data at scale

HuggingFaceFW/fineweb-2

lmsys/lmsys-chat-1m

toloka/mu-math

openai/gsm8k

csebuetnlp/xlsum

Ken Yano

AI & ML interests

Organizations

kenyano's activity

The Ultra-Scale Playbook

FineWeb: decanting the web for the finest text data at scale