2 7 6

Oleksiy Ostapenko

ostapeno

oleksost

AI & ML interests

Continual Learning, Transfer Learning, Modularity

Recent Activity

liked a model 27 days ago

ServiceNow-AI/SuperApriel-15B-Instruct

liked a Space about 1 month ago

nanotron/ultrascale-playbook

authored a paper about 2 months ago

Apriel-H1: Towards Efficient Enterprise Reasoning Models

View all activity

Organizations

liked a model 27 days ago

ServiceNow-AI/SuperApriel-15B-Instruct

Text Generation • Updated Apr 23 • 32 • 10

liked a Space about 1 month ago

The Ultra-Scale Playbook

🌌

3.9k

The ultimate guide to training LLM on large GPU Clusters

authored 3 papers about 2 months ago

upvoted a paper 2 months ago

Super Apriel: One Checkpoint, Many Speeds

Paper • 2604.19877 • Published Apr 21 • 2

updated 2 models 2 months ago

ServiceNow-AI/SuperApriel-15b-Base

Text Generation • Updated Apr 23 • 19 • 3

ServiceNow-AI/SuperApriel-15B-Instruct

Text Generation • Updated Apr 23 • 32 • 10

published a model 2 months ago

ServiceNow-AI/SuperApriel-15B-Instruct

Text Generation • Updated Apr 23 • 32 • 10

upvoted a paper 3 months ago

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published Mar 13 • 149

updated a model 7 months ago

ostapeno/ap1p5_wasft_200steps

15B • Updated Nov 20, 2025

published a model 7 months ago

ostapeno/ap1p5_wasft_200steps

15B • Updated Nov 20, 2025

published an article 7 months ago

Article

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

ServiceNow-AI

•

Nov 19, 2025

• 34

upvoted an article 7 months ago

Article

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

ServiceNow-AI

•

Nov 19, 2025

• 34

authored 2 papers 8 months ago

Challenging Common Assumptions about Catastrophic Forgetting

Paper • 2207.04543 • Published Jul 10, 2022

Apriel-Nemotron-15B-Thinker

Paper • 2508.10948 • Published Aug 13, 2025 • 6

upvoted 3 papers 8 months ago

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1, 2025 • 125

Apriel-Nemotron-15B-Thinker

Paper • 2508.10948 • Published Aug 13, 2025 • 6

Towards Modular LLMs by Building and Reusing a Library of LoRAs

Paper • 2405.11157 • Published May 18, 2024 • 31

liked a model 8 months ago

ServiceNow-AI/Apriel-H1-15b-Thinker-SFT

Text Generation • 16B • Updated Nov 3, 2025 • 31 • 29

Oleksiy Ostapenko

AI & ML interests

Recent Activity

Organizations

ostapeno's activity

The Ultra-Scale Playbook

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models