8 3 5

Trilok Padhi

tpadhi1

trilokpadhi

AI & ML interests

None yet

Recent Activity

liked a Space 19 days ago

nanotron/ultrascale-playbook

upvoted an article 11 months ago

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

upvoted a collection about 1 year ago

PaliGemma 2 Release

View all activity

Organizations

None yet

liked a Space 19 days ago

The Ultra-Scale Playbook

🌌

3.61k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 11 months ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

May 7, 2024

•

111

upvoted a collection about 1 year ago

PaliGemma 2 Release

Collection

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated Jul 10 • 151

liked a model about 1 year ago

MBZUAI/GLaMM-RefSeg

Text Generation • Updated Dec 26, 2023 • 20 • 1

New activity in IDEA-Research/grounding-dino-base over 1 year ago

the sample code in the README its not working

#2 opened over 1 year ago by

Javierquin

upvoted an article over 1 year ago

Article

seemore: Implement a Vision Language Model from Scratch

Jun 23, 2024

•

104

New activity in microsoft/Phi-3-small-128k-instruct over 1 year ago

AssertionError: Flash Attention is not available, but is needed for dense attention

#30 opened over 1 year ago by

tpadhi1

New activity in meta-llama/Meta-Llama-3-8B over 1 year ago

Error with Anaconda with Pycharm

#205 opened over 1 year ago by

yiwens

New activity in numind/NuExtract-large over 1 year ago

AssertionError: Flash Attention is not available, but is needed for dense attention

#7 opened over 1 year ago by

tpadhi1

New activity in microsoft/Phi-3-small-128k-instruct over 1 year ago

Target_module of this phi-3-small model

#3 opened over 1 year ago by

hackint0sh

liked a Space almost 2 years ago

OpenCodeInterpreter Demo

🚀

updated a model almost 2 years ago

tpadhi1/llama-2-7b-chat-hf-finetuned-mental-health-reddit-trilok

Text Generation • 7B • Updated Feb 28, 2024

liked a model about 2 years ago

openai/whisper-large-v3

Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 6.42M • • 5.24k

liked a model over 2 years ago

meta-llama/Llama-2-7b

Text Generation • Updated Apr 17, 2024 • 433 • 4.44k

Trilok Padhi

AI & ML interests

Recent Activity

Organizations

tpadhi1's activity

The Ultra-Scale Playbook

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

the sample code in the README its not working

seemore: Implement a Vision Language Model from Scratch

AssertionError: Flash Attention is not available, but is needed for dense attention

Error with Anaconda with Pycharm

AssertionError: Flash Attention is not available, but is needed for dense attention

Target_module of this phi-3-small model

OpenCodeInterpreter Demo