Ingrid Tveten
ingridtv
·
AI & ML interests
Medical image analysis and machine learning
Recent Activity
updated a collection 29 days ago
Multimodal/VLM updated a collection 29 days ago
Multimodal/VLM updated a collection 4 months ago
GenAI/LLMOrganizations
None yet
Medical images, encoding
Multimodal/VLM
-
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 410k • 1.6k -
microsoft/Phi-4-mini-instruct
Text Generation • Updated • 1.55M • • 735 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 157 -
Emerging Properties in Unified Multimodal Pretraining
Paper • 2505.14683 • Published • 134
Medical LM, Specific
GenAI/LLM
-
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • 2B • Updated • 427k • • 1.5k -
Qwen/CodeQwen1.5-7B-Chat
Text Generation • 7B • Updated • 23.8k • 353 -
lmstudio-community/gemma-3-12b-it-GGUF
Image-Text-to-Text • 12B • Updated • 39.3k • 44 -
google/gemma-3-12b-it-qat-q4_0-gguf
Image-Text-to-Text • 12B • Updated • 4.55k • 270
Document understanding
Medical LM, Specific
Medical images, encoding
GenAI/LLM
-
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • 2B • Updated • 427k • • 1.5k -
Qwen/CodeQwen1.5-7B-Chat
Text Generation • 7B • Updated • 23.8k • 353 -
lmstudio-community/gemma-3-12b-it-GGUF
Image-Text-to-Text • 12B • Updated • 39.3k • 44 -
google/gemma-3-12b-it-qat-q4_0-gguf
Image-Text-to-Text • 12B • Updated • 4.55k • 270
Multimodal/VLM
-
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 410k • 1.6k -
microsoft/Phi-4-mini-instruct
Text Generation • Updated • 1.55M • • 735 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 157 -
Emerging Properties in Unified Multimodal Pretraining
Paper • 2505.14683 • Published • 134