Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Alessamo

Alessamo

3 6

·

AI & ML interests

None yet

Organizations

None yet

Alessamo 's collections 6

OysterAI/Qwen2.5-3B-Instruct-SAEs

Updated Sep 23, 2025
adamkarvonen/qwen3-32b-saes

Updated Nov 24, 2025 • 4
Jammies-io/sae-Llama-3.1-8B-Instruct-layer18-sycophancy-v2

0.1B • Updated Jul 19, 2025 • 7

Reasoning with Exploration: An Entropy Perspective

Paper • 2506.14758 • Published Jun 17, 2025 • 30

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 141
TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22, 2025 • 123
What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

Paper • 2503.24235 • Published Mar 31, 2025 • 55
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 190

time series LLM

Foundation Models for Time Series: A Survey

Paper • 2504.04011 • Published Apr 5, 2025

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 98
Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions

Paper • 2505.19949 • Published May 26, 2025 • 16

Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model

Paper • 2504.15843 • Published Apr 22, 2025 • 16
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7, 2025 • 44

OysterAI/Qwen2.5-3B-Instruct-SAEs

Updated Sep 23, 2025
adamkarvonen/qwen3-32b-saes

Updated Nov 24, 2025 • 4
Jammies-io/sae-Llama-3.1-8B-Instruct-layer18-sycophancy-v2

0.1B • Updated Jul 19, 2025 • 7

time series LLM

Foundation Models for Time Series: A Survey

Paper • 2504.04011 • Published Apr 5, 2025

Reasoning with Exploration: An Entropy Perspective

Paper • 2506.14758 • Published Jun 17, 2025 • 30

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 98
Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions

Paper • 2505.19949 • Published May 26, 2025 • 16

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 141
TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22, 2025 • 123
What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

Paper • 2503.24235 • Published Mar 31, 2025 • 55
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 190

Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model

Paper • 2504.15843 • Published Apr 22, 2025 • 16
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7, 2025 • 44

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs