Computer Vision Lab, CAIDAS, University of Würzburg

university

Verified

https://www.informatik.uni-wuerzburg.de/computervision/home/

Activity Feed Request to join this org

AI & ML interests

Artificial Intelligence, Computer Vision, Machine Learning, Computational Photography, Image Enhancement, Super-Resolution, Compression, Streaming

Recent Activity

nielsr submitted a paper 11 days ago

Duration Aware Scheduling for ASR Serving Under Workload Drift

nielsr submitted a paper 27 days ago

Ultralytics YOLO26: Unified Real-Time End-to-End Vision Models

nielsr submitted a paper about 1 month ago

Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini

View all activity

submitted a paper to Daily Papers 11 days ago

Duration Aware Scheduling for ASR Serving Under Workload Drift

Paper • 2603.11273 • Published Mar 11 • 3

submitted a paper to Daily Papers 27 days ago

Ultralytics YOLO26: Unified Real-Time End-to-End Vision Models

Paper • 2606.03748 • Published 29 days ago • 15

submitted 2 papers to Daily Papers about 1 month ago

Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini

Paper • 2605.27295 • Published May 26 • 23

Stable Audio 3

Paper • 2605.17991 • Published May 18 • 21

submitted a paper to Daily Papers 2 months ago

Scaling Test-Time Compute for Agentic Coding

Paper • 2604.16529 • Published Apr 16 • 12

submitted 6 papers to Daily Papers 3 months ago

Geometric Context Transformer for Streaming 3D Reconstruction

Paper • 2604.14141 • Published Apr 15 • 22

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

Paper • 2604.04913 • Published Apr 6 • 12

MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

Paper • 2603.28130 • Published Mar 30 • 11

Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders

Paper • 2603.19209 • Published Mar 19 • 6

V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning

Paper • 2603.14482 • Published Mar 15 • 36

Omnilingual MT: Machine Translation for 1,600 Languages

Paper • 2603.16309 • Published Mar 17 • 24

authored a paper 4 months ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65

submitted 2 papers to Daily Papers 4 months ago

VidEoMT: Your ViT is Secretly Also a Video Segmentation Model

Paper • 2602.17807 • Published Feb 19 • 7

Causal-JEPA: Learning World Models through Object-Level Latent Interventions

Paper • 2602.11389 • Published Feb 11 • 11

submitted a paper to Daily Papers 5 months ago

UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders

Paper • 2601.17950 • Published Jan 25 • 4

submitted 2 papers to Daily Papers 6 months ago

TCAndon-Router: Adaptive Reasoning Router for Multi-Agent Collaboration

Paper • 2601.04544 • Published Jan 8 • 6

CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion

Paper • 2512.19535 • Published Dec 22, 2025 • 13

authored 2 papers about 1 year ago

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

Paper • 2112.02721 • Published Dec 6, 2021

ModernGBERT: German-only 1B Encoder Model Trained from Scratch

Paper • 2505.13136 • Published May 19, 2025 • 22

authored a paper over 1 year ago

AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results

Paper • 2404.16205 • Published Apr 24, 2024