Skander Moalla's picture

6 2

Skander Moalla

skandermoalla

·

https://skandermoalla.com/

AI & ML interests

DeepRL, RL finetuning

Recent Activity

upvoted a paper about 1 month ago

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

upvoted a paper about 1 month ago

Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions

liked a dataset about 1 month ago

LukeBailey181Pub/D_3k

View all activity

Organizations

upvoted 2 papers about 1 month ago

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Paper • 2509.14233 • Published Sep 17, 2025 • 21

Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions

Paper • 2507.08068 • Published Jul 10, 2025 • 1

upvoted a paper 2 months ago

Efficient RL Training for LLMs with Experience Replay

Paper • 2604.08706 • Published Apr 9 • 23

upvoted a paper 7 months ago

Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis

Paper • 2407.09835 • Published Jul 13, 2024 • 1

upvoted a collection almost 2 years ago

Tulu V1 Suite

The set of models associated with the paper "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources". • 34 items • Updated Mar 4, 2025 • 3