Monishwaran's picture

4

Monishwaran

Nietzsche6700

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

upvoted a paper about 1 month ago

Learning, Fast and Slow: Towards LLMs That Adapt Continually

published a model 3 months ago

Nietzsche6700/qwen-prm

View all activity

Organizations

upvoted a paper 7 days ago

EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

Paper • 2606.18967 • Published 9 days ago • 24

upvoted a paper about 1 month ago

Learning, Fast and Slow: Towards LLMs That Adapt Continually

Paper • 2605.12484 • Published May 12 • 18

published a model 3 months ago

Nietzsche6700/qwen-prm

updated a model 3 months ago

Nietzsche6700/NW1110-draft-Llama-3.1-8B-Instruct-target-Llama-3.1-70B-Instruct-732

2B • Updated Mar 27 • 5

published a model 3 months ago

Nietzsche6700/NW1110-draft-Llama-3.1-8B-Instruct-target-Llama-3.1-70B-Instruct-732

2B • Updated Mar 27 • 5

updated a model 5 months ago

Nietzsche6700/Qwen3-32B-long-no-rope-scaling

Text Generation • 33B • Updated Jan 24 • 3

published a model 5 months ago

Nietzsche6700/Qwen3-32B-long-no-rope-scaling

Text Generation • 33B • Updated Jan 24 • 3

updated a model 5 months ago

Nietzsche6700/qwen3-4b-long-no-rope-scaling

Text Generation • 4B • Updated Jan 24 • 1

published a model 5 months ago

Nietzsche6700/qwen3-4b-long-no-rope-scaling

Text Generation • 4B • Updated Jan 24 • 1

updated a model 5 months ago

Nietzsche6700/Qwen-2.5-Instruct-32B-long

Text Generation • 33B • Updated Jan 22 • 2

published a model 5 months ago

Nietzsche6700/Qwen-2.5-Instruct-32B-long

Text Generation • 33B • Updated Jan 22 • 2

updated a model 5 months ago

Nietzsche6700/Qwen-2.5-7B-Instruct-long

Text Generation • 8B • Updated Jan 22 • 2

published a model 5 months ago

Nietzsche6700/Qwen-2.5-7B-Instruct-long

Text Generation • 8B • Updated Jan 22 • 2

updated a model 5 months ago

Nietzsche6700/Qwen-2.5-3B-Instruct-long

Text Generation • 3B • Updated Jan 22 • 2

published a model 5 months ago

Nietzsche6700/Qwen-2.5-3B-Instruct-long

Text Generation • 3B • Updated Jan 22 • 2

authored 3 papers 7 months ago

Squeezed Attention: Accelerating Long Context Length LLM Inference

Paper • 2411.09688 • Published Nov 14, 2024 • 1

ETS: Efficient Tree Search for Inference-Time Scaling

Paper • 2502.13575 • Published Feb 19, 2025

Arbitrage: Efficient Reasoning via Advantage-Aware Speculation

Paper • 2512.05033 • Published Dec 4, 2025 • 17

upvoted a paper 7 months ago

Arbitrage: Efficient Reasoning via Advantage-Aware Speculation

Paper • 2512.05033 • Published Dec 4, 2025 • 17

upvoted a paper 8 months ago

AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

Paper • 2510.19779 • Published Oct 22, 2025 • 62