2 13 16

Debashish C PRO

d3bach

AI & ML interests

omni-modal inference and training. GPUs

Recent Activity

upvoted a paper 6 days ago

Accelerating Streaming Video Large Language Models via Hierarchical Token Compression

upvoted a paper 13 days ago

VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

upvoted a paper 15 days ago

Scaling Audio-Text Retrieval with Multimodal Large Language Models

View all activity

Organizations

upvoted a paper 6 days ago

Accelerating Streaming Video Large Language Models via Hierarchical Token Compression

Paper • 2512.00891 • Published Nov 30, 2025 • 17

upvoted a paper 13 days ago

VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

Paper • 2512.10942 • Published Dec 11, 2025 • 61

upvoted a paper 15 days ago

Scaling Audio-Text Retrieval with Multimodal Large Language Models

Paper • 2602.18010 • Published Feb 20 • 1

authored 2 papers 20 days ago

MARQUIS: A Three-Stage Pipeline for Video Retrieval-Augmented Generation

Paper • 2605.17640 • Published May 17

Principled Context Engineering for RAG: Statistical Guarantees via Conformal Prediction

Paper • 2511.17908 • Published Jan 19

liked a model about 2 months ago

LiquidAI/LFM2.5-Audio-1.5B

Audio-to-Audio • 1B • Updated Mar 30 • 1.53k • 426

upvoted a paper 4 months ago

Multi-Vector Index Compression in Any Modality

Paper • 2602.21202 • Published Feb 24 • 22

liked a model 4 months ago

nvidia/Qwen3.5-397B-A17B-NVFP4

Text Generation • Updated Mar 30 • 635k • 100

upvoted an article 4 months ago

Article

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?

lightonai

•

Feb 19

• 22

upvoted a paper 5 months ago

RANKVIDEO: Reasoning Reranking for Text-to-Video Retrieval

Paper • 2602.02444 • Published Feb 2 • 18

liked a model 5 months ago

allenai/SERA-32B

677k • Updated Feb 2 • 304 • 113

New activity in nvidia/KVzap-linear-Llama-3.1-8B-Instruct 5 months ago

docs: update readme to include GitHub url

#2 opened 5 months ago by

d3bach

liked 2 models 6 months ago

nvidia/audio-flamingo-3-hf

Audio-Text-to-Text • 8B • Updated Apr 13 • 216k • 187

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • 35B • Updated Sep 22, 2025 • 2.06M • 943

upvoted an article 8 months ago

Article

Visualize and understand GPU memory in PyTorch

qgallouedec

•

Dec 24, 2024

• 273

updated a collection 9 months ago

VLM training

Collection

List of VLM papers • 3 items • Updated Sep 15, 2025

upvoted an article 9 months ago

Article

mmBERT: ModernBERT goes Multilingual

mmarone, orionweller, will-fleshman, eugene-yang, dlawrie, vandurme

•

Sep 9, 2025

• 147

liked a Space 10 months ago

The Ultra-Scale Playbook

🌌

3.9k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 10 months ago

Article

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

nvidia

•

Aug 11, 2025

• 76

Debashish C PRO

AI & ML interests

Recent Activity

Organizations

d3bach's activity

**ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?**

docs: update readme to include GitHub url

Visualize and understand GPU memory in PyTorch

mmBERT: ModernBERT goes Multilingual

The Ultra-Scale Playbook

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?