222 113

Ougrid Dumdang

Ougrid-D

ougrid

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

baidu/Unlimited-OCR

upvoted an article 8 days ago

Beyond LoRA: Can you beat the most popular fine-tuning technique?

liked a model 10 days ago

Boogu/Boogu-Image-0.1-Edit

View all activity

Organizations

upvoted an article 8 days ago

Article

Beyond LoRA: Can you beat the most popular fine-tuning technique?

BenjaminB, sayakpaul, hubnemo, kashif

•

11 days ago

• 66

upvoted a paper 17 days ago

Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields

Paper • 2606.11042 • Published 20 days ago • 22

upvoted 4 papers about 1 month ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published May 28 • 146

upvoted 4 papers about 2 months ago

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 114

Lightning Unified Video Editing via In-Context Sparse Attention

Paper • 2605.04569 • Published May 6 • 18

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published May 3 • 171

EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model

Paper • 2604.10268 • Published Apr 11 • 12

upvoted an article 2 months ago

Article

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

nvidia

•

Apr 28

• 62

upvoted a paper 2 months ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published Apr 27 • 119

upvoted an article 2 months ago

Article

How to Use Transformers.js in a Chrome Extension

nico-martin

•

Apr 23

• 39

upvoted a paper 2 months ago

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published Apr 17 • 59

upvoted an article 2 months ago

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 16

• 72

upvoted 2 papers 3 months ago

FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

Paper • 2604.06916 • Published Apr 8 • 34

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

Paper • 2604.01007 • Published Apr 2 • 31

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 910

upvoted 2 papers 3 months ago

Voxtral TTS

Paper • 2603.25551 • Published Mar 26 • 63

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published Mar 26 • 134

Ougrid Dumdang

AI & ML interests

Recent Activity

Organizations

Ougrid-D's activity

Beyond LoRA: Can you beat the most popular fine-tuning technique?

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

How to Use Transformers.js in a Chrome Extension

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

Welcome Gemma 4: Frontier multimodal intelligence on device