21 29

Mariusj G

MariusjG

AI & ML interests

None yet

Organizations

None yet

upvoted 2 articles 3 months ago

Article

Mixture of Experts (MoEs) in Transformers

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 169

Article

Build a Domain-Specific Embedding Model in Under a Day

nvidia

•

Mar 20

• 74

upvoted 2 articles 8 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

tomaarsen, Xenova, alvarobartt, ariG23498, pcuenq, sergiopaniego

•

Sep 4, 2025

• 275

Article

Supercharge your OCR Pipelines with Open Models

merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq

•

Oct 21, 2025

• 315

upvoted an article 10 months ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

drbh, danieldk

•

Aug 18, 2025

• 104

upvoted a paper 10 months ago

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19, 2025 • 48

upvoted 9 papers 11 months ago

Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs

Paper • 2507.09477 • Published Jul 13, 2025 • 89

SingLoRA: Low Rank Adaptation Using a Single Matrix

Paper • 2507.05566 • Published Jul 8, 2025 • 116

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 257

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 264

upvoted 5 papers 12 months ago

Sekai: A Video Dataset towards World Exploration

Paper • 2506.15675 • Published Jun 18, 2025 • 67

ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Paper • 2506.05010 • Published Jun 5, 2025 • 82

PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Paper • 2506.05573 • Published Jun 5, 2025 • 83

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 161

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265

Mariusj G

AI & ML interests

Organizations

MariusjG's activity

Mixture of Experts (MoEs) in Transformers

Build a Domain-Specific Embedding Model in Under a Day

Welcome EmbeddingGemma, Google's new efficient embedding model

Supercharge your OCR Pipelines with Open Models

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels