1 312 46

jasonjiang

mikinyaa

jasonjiang8866

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

Unlimited OCR Works

liked a dataset 9 days ago

lazarus19/Vibe-Coding-Instruct

liked a model 12 days ago

Jackrong/Qwopus3.6-27B-Coder-MTP-GGUF

View all activity

Organizations

None yet

upvoted a paper about 6 hours ago

Unlimited OCR Works

Paper • 2606.23050 • Published 3 days ago • 23

liked a dataset 9 days ago

lazarus19/Vibe-Coding-Instruct

Viewer • Updated 6 days ago • 1.1M • 1.87k • 157

liked a model 12 days ago

Jackrong/Qwopus3.6-27B-Coder-MTP-GGUF

Text Generation • 0.5B • Updated about 4 hours ago • 245k • 289

upvoted a paper 15 days ago

FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention

Paper • 2606.09079 • Published 17 days ago • 62

upvoted a paper 16 days ago

When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents

Paper • 2606.05806 • Published 21 days ago • 23

upvoted an article 20 days ago

Article

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

nvidia

•

20 days ago

• 64

upvoted 6 papers about 1 month ago

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published May 14 • 115

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published May 14 • 91

AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Paper • 2605.13724 • Published May 13 • 105

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Paper • 2605.10899 • Published May 11 • 79

δ-mem: Efficient Online Memory for Large Language Models

Paper • 2605.12357 • Published May 12 • 131

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 114

upvoted 8 papers about 2 months ago

Audio-Visual Intelligence in Large Foundation Models

Paper • 2605.04045 • Published May 5 • 35

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published May 1 • 49

OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models

Paper • 2605.00877 • Published Apr 25 • 15

Web2BigTable: A Bi-Level Multi-Agent LLM System for Internet-Scale Information Search and Extraction

Paper • 2604.27221 • Published Apr 29 • 40

Stochastic KV Routing: Enabling Adaptive Depth-Wise Cache Sharing

Paper • 2604.22782 • Published Apr 3 • 8

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Paper • 2604.28139 • Published Apr 30 • 42

Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

Paper • 2604.25819 • Published Apr 28 • 17

SketchVLM: Vision language models can annotate images to explain thoughts and guide users

Paper • 2604.22875 • Published Apr 23 • 38

jasonjiang

AI & ML interests

Recent Activity

Organizations

mikinyaa's activity

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent