KUAN-TING KE's picture

9 3

KUAN-TING KE

RFTFT

·

AI & ML interests

NLP

Organizations

upvoted an article about 1 year ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

+2

ariG23498, merve, pcuenq, reach-vb

•

Mar 12, 2025

• 497

upvoted a paper over 1 year ago

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published Nov 7, 2024 • 39

upvoted a paper almost 2 years ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 81

upvoted a collection almost 2 years ago

Multimodal RAG

9 items • Updated Mar 2 • 31

upvoted 4 papers over 2 years ago

ReNoise: Real Image Inversion Through Iterative Noising

Paper • 2403.14602 • Published Mar 21, 2024 • 21

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference

Paper • 2403.14520 • Published Mar 21, 2024 • 35

FlashTex: Fast Relightable Mesh Texturing with LightControlNet

Paper • 2402.13251 • Published Feb 20, 2024 • 14

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20, 2024 • 52

upvoted a collection over 2 years ago

ICCV 2023 Demos

Demos for ICCV 2023 papers • 38 items • Updated Oct 5, 2023 • 9