Ayman Mohammed

Pegasus10

·

AI & ML interests

Interest in foundation, multimodal, LLMs , and ML, deep and reinforcement learning

Recent Activity

liked a model 2 days ago

LiquidAI/LFM2.5-230M

upvoted a paper 4 days ago

Running the Gauntlet: Re-evaluating the Capabilities of Agents Beyond Familiar Environments

upvoted an article 6 days ago

Run a vLLM Server on HF Jobs in One Command

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Running the Gauntlet: Re-evaluating the Capabilities of Agents Beyond Familiar Environments

Paper • 2606.14397 • Published 7 days ago • 18

upvoted 3 articles 6 days ago

Article

Run a vLLM Server on HF Jobs in One Command

qgallouedec

•

6 days ago

• 11

Article

Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel

nvidia

•

8 days ago

• 34

Article

Build real agentic apps using CUGA: two dozen working examples on a lightweight harness

ibm-research

•

9 days ago

• 36

upvoted 2 articles 11 days ago

Article

MosaicLeaks: Can your research agent keep a secret?

ServiceNow

•

14 days ago

• 13

Article

MolmoMotion: Language-guided 3D motion forecasting

allenai

•

15 days ago

• 10

upvoted an article 17 days ago

Article

olmo-eval: An evaluation workbench for the model development loop

allenai

•

20 days ago

• 17

upvoted an article 20 days ago

Article

Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP

+3

ariG23498, ror, sergiopaniego, pcuenq, sayakpaul

•

21 days ago

• 50

upvoted 2 articles 25 days ago

Article

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

ServiceNow-AI

•

28 days ago

• 41

Article

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

nvidia

•

28 days ago

• 12

upvoted an article about 1 month ago

Article

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

+6

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego

•

May 27

• 42

upvoted 2 collections about 1 month ago

Stable Audio 3

Stable Audio 3 Post-trained models • 3 items • Updated May 20 • 43

Nemotron-Labs-Diffusion

A Tri-Mode Language Model Family Unifying Autoregressive, Diffusion, and Self-Speculation Decoding • 7 items • Updated 20 days ago • 51

upvoted 2 articles about 1 month ago

Article

Introducing the Ettin Reranker Family

tomaarsen

•

May 19

• 53

Article

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

nvidia

•

May 18

• 21

upvoted 3 articles about 2 months ago

Article

Unlocking asynchronicity in continuous batching

+1

ror, pcuenq, ariG23498

•

May 14

• 61

Article

Building Blocks for Foundation Model Training and Inference on AWS

amazon

•

May 11

• 24

Article

CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models

lablab-ai-amd-developer-hackathon

•

May 8

• 10

upvoted 2 collections about 2 months ago

Qwen-Scope

16 items • Updated May 14 • 75

Mistral Medium 3.5

Our first flaship models handling instruction-following, reasoning, and coding in a single set of opened-weights. • 2 items • Updated Apr 29 • 19