1 3 8

Astarag Mohapatra

Athekunal

AI & ML interests

None yet

Recent Activity

updated a dataset 26 days ago

Athekunal/Agent-Skills-Retriever

updated a model 26 days ago

Athekunal/Qwen3-0.6B-Agent-Skills-Retriever

published a model 26 days ago

Athekunal/Qwen3-0.6B-Agent-Skills-Retriever

View all activity

Organizations

updated a dataset 26 days ago

Athekunal/Agent-Skills-Retriever

Viewer • Updated 26 days ago • 100k • 117

updated a model 26 days ago

Athekunal/Qwen3-0.6B-Agent-Skills-Retriever

published a model 26 days ago

Athekunal/Qwen3-0.6B-Agent-Skills-Retriever

published a dataset 26 days ago

Athekunal/Agent-Skills-Retriever

Viewer • Updated 26 days ago • 100k • 117

upvoted an article about 1 month ago

Article

DualPipe Explained: A Comprehensive Guide to DualPipe That Anyone Can Understand—Even Without a Distributed Training Background

NormalUhr

•

Feb 28, 2025

• 19

upvoted an article 11 months ago

Article

You could have designed state of the art positional encoding

FL33TW00D-HF

•

Nov 25, 2024

• 479

upvoted a paper 12 months ago

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

liked a model 12 months ago

google/medgemma-27b-text-it

Text Generation • Updated Sep 16, 2025 • 71.5k • • 430

liked a Space about 1 year ago

The Distill Template

🌌

Craft Beautiful Blogs

New activity in nanotron/predict_memory about 1 year ago

How is activation memory calculated?

#1 opened about 1 year ago by

trungtvu

liked 2 Spaces about 1 year ago

Predict Memory

🧮

108

Calculate and visualize memory usage for model training

The Ultra-Scale Playbook

🌌

3.85k

The ultimate guide to training LLM on large GPU Clusters

liked a model about 2 years ago

gorilla-llm/gorilla-openfunctions-v2

Text Generation • Updated Apr 18, 2024 • 1.11k • 245

updated a collection over 2 years ago

Papers to read

Collection

1 item • Updated Feb 9, 2024

liked 2 models over 2 years ago

ProsusAI/finbert

Text Classification • Updated May 23, 2023 • 6.67M • • 1.16k

google-t5/t5-small

Translation • 60.5M • Updated Jun 30, 2023 • 2.86M • • 544

liked a dataset over 2 years ago

google-research-datasets/cfq

Viewer • Updated Jan 18, 2024 • 865k • 632 • 6

Astarag Mohapatra

AI & ML interests

Recent Activity

Organizations

Athekunal's activity

DualPipe Explained: A Comprehensive Guide to DualPipe That Anyone Can Understand—Even Without a Distributed Training Background

You could have designed state of the art positional encoding

The Distill Template

How is activation memory calculated?

Predict Memory

The Ultra-Scale Playbook