Ave's picture

Ave

oxuwu

·

oxxuwu
oxuwu

AI & ML interests

None yet

Recent Activity

new activity 29 days ago

stabilityai/stable-audio-3:Having problems with the library in the colab notebook

liked a Space 29 days ago

stabilityai/stable-audio-3

upvoted a collection 29 days ago

View all activity

Organizations

upvoted a collection 29 days ago

Stable Audio 3

Stable Audio 3 Post-trained models • 3 items • Updated May 20 • 41

upvoted a collection about 1 month ago

Qwen3-TTS

7 items • Updated Jan 22 • 367

upvoted a paper 4 months ago

Mercury: Ultra-Fast Language Models Based on Diffusion

Paper • 2506.17298 • Published Jun 17, 2025 • 11

upvoted a collection 4 months ago

Datasets - CoT Synthetic Reasoning

8 items • Updated May 14, 2025 • 2

upvoted an article 5 months ago

Article

BigCodeArena: Judging code generations end to end with code executions

bigcode

•

Oct 7, 2025

• 21

upvoted 2 collections 6 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.82k

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 15 days ago • 172

upvoted a paper 6 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20, 2025 • 51

upvoted 2 collections 7 months ago

Deepseek v3.2 Speciale

Distilled models and datasets for Deepseek v3.2 Speciale. • 11 items • Updated Dec 20, 2025 • 8

Gemini 3 Pro

Distilled models and datasets for Gemini 3 Pro. • 9 items • Updated Dec 20, 2025 • 7

upvoted an article 7 months ago

Article

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

+1

HugoLaurencon, Leyo, VictorSanh

•

Mar 15, 2024

• 13

upvoted 2 collections 7 months ago

Reasoning datasets

24 items • Updated May 22, 2025 • 11

Qwen3-VL

37 items • Updated Dec 31, 2025 • 747

upvoted a paper 7 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 269

upvoted a collection 7 months ago

GPT-4 generated datasets

Collection of some GPT-4 generated datasets. It may be useful for those looking for the best-quality datasets to train competitive LLMs. • 18 items • Updated Apr 16, 2024 • 10

upvoted a paper 10 months ago

Hierarchical Reasoning Model

Paper • 2506.21734 • Published Jun 26, 2025 • 54

upvoted a collection 10 months ago

DINOv3

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 15 items • Updated Mar 10 • 674

upvoted a collection 12 months ago

Gemma 3n

4 items • Updated Mar 12 • 272

upvoted an article about 1 year ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

+1

merve, andsteing, pcuenq

•

May 14, 2024

• 287

upvoted a paper about 1 year ago

Textbooks Are All You Need

Paper • 2306.11644 • Published Jun 20, 2023 • 158