arxiv:2511.07253
Umberto Cappellazzo
hisoka94
AI & ML interests
Multimodal Large Language Models and audio-visual speech processing at @ Imperial College London.
Recent Activity
submitted
a paper
about 3 hours ago
Dr. SHAP-AV: Decoding Relative Modality Contributions via Shapley Attribution in Audio-Visual Speech Recognition upvoted a paper 4 days ago
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders Organizations
None yet