Ingrid Tveten
ingridtv
·
AI & ML interests
Medical image analysis and machine learning
Recent Activity
updated
a collection
22 days ago
GenAI/LLM
updated
a collection
about 2 months ago
Document understanding
updated
a collection
2 months ago
Document understanding
Organizations
None yet
Medical images, encoding
Multimodal/VLM
-
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 214k • 1.56k -
microsoft/Phi-4-mini-instruct
Text Generation • 4B • Updated • 147k • 667 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 136 -
Emerging Properties in Unified Multimodal Pretraining
Paper • 2505.14683 • Published • 133
Medical LM, Specific
GenAI/LLM
-
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • 2B • Updated • 808k • • 1.44k -
Qwen/CodeQwen1.5-7B-Chat
Text Generation • 7B • Updated • 1.11k • 351 -
lmstudio-community/gemma-3-12b-it-GGUF
Image-Text-to-Text • 12B • Updated • 45.7k • 40 -
google/gemma-3-12b-it-qat-q4_0-gguf
Image-Text-to-Text • 12B • Updated • 17.5k • 237
Document understanding
Medical LM, Specific
Medical images, encoding
GenAI/LLM
-
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • 2B • Updated • 808k • • 1.44k -
Qwen/CodeQwen1.5-7B-Chat
Text Generation • 7B • Updated • 1.11k • 351 -
lmstudio-community/gemma-3-12b-it-GGUF
Image-Text-to-Text • 12B • Updated • 45.7k • 40 -
google/gemma-3-12b-it-qat-q4_0-gguf
Image-Text-to-Text • 12B • Updated • 17.5k • 237
Multimodal/VLM
-
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 214k • 1.56k -
microsoft/Phi-4-mini-instruct
Text Generation • 4B • Updated • 147k • 667 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 136 -
Emerging Properties in Unified Multimodal Pretraining
Paper • 2505.14683 • Published • 133