spectacle's picture

spectacle

spectaclecs

·

spectaclecs

AI & ML interests

Multimodal LLM, Agent

Recent Activity

upvoted a paper 10 days ago

Grammar-Constrained Decoding Can Jailbreak LLMs into Generating Malicious Code

upvoted a paper 25 days ago

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

liked a dataset about 1 month ago

Kun-Xiang/PhysRL

View all activity

Organizations

upvoted a paper 10 days ago

Grammar-Constrained Decoding Can Jailbreak LLMs into Generating Malicious Code

Paper • 2606.11817 • Published 16 days ago • 18

upvoted a paper 25 days ago

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published 28 days ago • 118

upvoted a collection 2 months ago

Qwen3-VL

37 items • Updated Dec 31, 2025 • 747

upvoted 2 papers 3 months ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 352

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published Mar 23 • 138

upvoted a paper 4 months ago

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Paper • 2602.14041 • Published Mar 13 • 55

upvoted a collection 4 months ago

Qwen3-Next

4 items • Updated Dec 31, 2025 • 187

upvoted a paper 5 months ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

upvoted a collection 5 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.82k

upvoted a paper 5 months ago

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published Oct 9, 2025 • 46

upvoted a collection 6 months ago

DeepSeek-V3.2

4 items • Updated Dec 1, 2025 • 544

upvoted 4 papers 7 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 269

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Paper • 2412.10302 • Published Dec 13, 2024 • 24

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 96

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 135

upvoted a collection 8 months ago

CapRL

Data & Models for CapRL1.0 series &2.0 series • 14 items • Updated 16 days ago • 6

upvoted a paper 11 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 146

upvoted an article over 1 year ago

Article

From PyTorch DDP to Accelerate to Trainer, mastery of distributed training with ease

muellerzr

•

Oct 21, 2022

• 44

upvoted a collection over 1 year ago

Emu3

Emu3: Next-Token Prediction is All You Need • 7 items • Updated Feb 4 • 81

upvoted an article almost 2 years ago

Article

How to generate text: using different decoding methods for language generation with Transformers

patrickvonplaten

•

Mar 1, 2020

• 301