KW

kevineen

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

liked a model 7 days ago

llm-jp/layoutlmv3-japanese-preview

liked a model 7 days ago

llm-jp/Jagle-VL-2.2B-Jagle-FineVision

View all activity

Organizations

upvoted an article 3 days ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

danaaubakirova, andito, merve, ariG23498, fracapuano, loubnabnl, pcuenq, mshukor, cadene

•

Jun 3, 2025

• 356

upvoted a paper 26 days ago

QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks

Paper • 2605.24218 • Published May 22 • 46

upvoted an article 28 days ago

Article

PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend

PaddlePaddle

•

May 18

• 37

upvoted a collection about 1 month ago

Ettin Rerankers

Collection

8 items • Updated May 19 • 8

upvoted an article about 1 month ago

Article

SSE Retrieval MRL v2: Regularization of Representation Space and Performance Improvement via Hyperparameter Optimization

RikkaBotan

•

May 13

• 2

upvoted 2 papers about 2 months ago

RLDX-1 Technical Report

Paper • 2605.03269 • Published May 5 • 126

MolmoAct2: Action Reasoning Models for Real-world Deployment

Paper • 2605.02881 • Published May 4 • 355

upvoted an article about 2 months ago

Article

Magpie Speech — Applying an LLM Data Synthesis Method to an LLM-Based TTS Model to Synthesize a Speech Dataset

Aratako

•

Aug 14, 2025

• 13

upvoted a paper 2 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 161

upvoted a paper 3 months ago

daVinci-LLM:Towards the Science of Pretraining

Paper • 2603.27164 • Published Mar 28 • 32

upvoted a paper 4 months ago

Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

Paper • 2602.22675 • Published Feb 26 • 23

upvoted 2 articles 4 months ago

Article

Mixture of Experts (MoEs) in Transformers

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 169

Article

Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach

oopere

•

Nov 24, 2024

• 20

upvoted a collection 4 months ago

GPT-OSS-Swallow-v0.1

Collection

6 items • Updated 30 days ago • 13

upvoted 3 papers 5 months ago

SWE-Universe: Scale Real-World Verifiable Environments to Millions

Paper • 2602.02361 • Published Feb 2 • 61

Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow

Paper • 2601.14243 • Published Jan 20 • 24

Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision

Paper • 2601.19798 • Published Jan 27 • 44

upvoted an article 5 months ago

Article

Introducing Waypoint-1: Real-time interactive video diffusion from Overworld

lapp0, LouisCastricato, ScottieFox, shahbuland, xAesthetics

•

Jan 20

• 43

upvoted a paper 5 months ago

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Paper • 2601.15165 • Published Jan 21 • 75

upvoted a collection 5 months ago

TranslateGemma

Collection

3 items • Updated Mar 12 • 245

KW

AI & ML interests

Recent Activity

Organizations

kevineen's activity

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend

SSE Retrieval MRL v2: Regularization of Representation Space and Performance Improvement via Hyperparameter Optimization

Magpie Speech — Applying an LLM Data Synthesis Method to an LLM-Based TTS Model to Synthesize a Speech Dataset

Mixture of Experts (MoEs) in Transformers

Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach

Introducing Waypoint-1: Real-time interactive video diffusion from Overworld