31 88

Lucas Rose-Winters

paperboygold

https://www.sanguinehost.com

paperboygold

AI & ML interests

None yet

Recent Activity

liked a model 8 days ago

Mia-AiLab/Qwable-3.6-35b

liked a model 11 days ago

Qwen/Qwen3.6-35B-A3B-FP8

upvoted a collection 11 days ago

Qwen3.5

View all activity

Organizations

upvoted a collection 11 days ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.69k

upvoted a paper 4 months ago

Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 60

upvoted 2 collections 4 months ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 15 days ago • 168

Dark / Evil / NSFW Reasoning Models (gguf/source)

Collection

Models that are dark/evil/corrupt (and many times NSFW!) to begin with then I add reasoning/thinking to them to make them even... ahh... better. • 134 items • Updated 5 days ago • 180

upvoted 4 papers 4 months ago

HeartMuLa: A Family of Open Sourced Music Foundation Models

Paper • 2601.10547 • Published Jan 15 • 49

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Paper • 2602.14041 • Published Mar 13 • 55

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14, 2025 • 164

A decoder-only foundation model for time-series forecasting

Paper • 2310.10688 • Published Oct 14, 2023 • 37

upvoted 2 papers 8 months ago

InteractComp: Evaluating Search Agents With Ambiguous Queries

Paper • 2510.24668 • Published Oct 28, 2025 • 100

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24, 2025 • 103

upvoted an article 8 months ago

Article

Australian-made LLM beats OpenAI and Google at legal retrieval

isaacus

•

Oct 23, 2025

• 28

upvoted 4 papers 9 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 180

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 189

VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models

Paper • 2509.19803 • Published Sep 24, 2025 • 122

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24, 2025 • 101

upvoted an article over 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 889

upvoted 2 articles about 2 years ago

Article

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

HugoLaurencon, Leyo, VictorSanh

•

Mar 15, 2024

• 13

Article

Hugging Face partners with Wiz Research to Improve AI Security

JJoe206, GuillaumeSalouHF, michellehbn, XciD, mcpotato, Narsil, julien-c

•

Apr 4, 2024

• 14

upvoted 2 papers over 2 years ago

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference

Paper • 2403.14520 • Published Mar 21, 2024 • 35

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 162

Lucas Rose-Winters

AI & ML interests

Recent Activity

Organizations

paperboygold's activity

Australian-made LLM beats OpenAI and Google at legal retrieval

Open-R1: a fully open reproduction of DeepSeek-R1

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Hugging Face partners with Wiz Research to Improve AI Security