FastVLM Collection Efficient Vision Encoding for Vision Language Models • 8 items • Updated Mar 2 • 114
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment Paper • 2502.10391 • Published Feb 14, 2025 • 34
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published Feb 16, 2025 • 170
Fast Inference from Transformers via Speculative Decoding Paper • 2211.17192 • Published Nov 30, 2022 • 11
Aria: An Open Multimodal Native Mixture-of-Experts Model Paper • 2410.05993 • Published Oct 8, 2024 • 111
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation Paper • 2410.13848 • Published Oct 17, 2024 • 37
WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on Fixed-Point Iteration Paper • 2210.01029 • Published Oct 3, 2022 • 1
PaLI: A Jointly-Scaled Multilingual Language-Image Model Paper • 2209.06794 • Published Sep 14, 2022 • 2
view article Article A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake +4 juliensimon, echarlaix, ofirzaf, imargulis, guybd, moshew • Mar 20, 2024 • 7
view article Article Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon +6 juliensimon, Haihao, antonyvance, MatrixYao, lianglv, gserochi, Debbh, kding1 • May 9, 2024 • 12
AI PC: Text Generation Collection Text generation LLMs that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. • 158 items • Updated Apr 20 • 17
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate +2 mirinflim, aldopareja, muellerzr, stas • Jun 13, 2024 • 62
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models +1 andito, merve, SkalskiP • Jun 24, 2024 • 207
view article Article Welcome Gemma 2 - Google’s new open LLM +4 philschmid, osanseviero, pcuenq, lewtun, tomaarsen, reach-vb • Jun 27, 2024 • 132
Indic Alpaca Datasets Collection This collection comprises an alpaca datasets that encompasses a wide range of Indian languages. • 18 items • Updated Mar 21, 2024 • 10
view article Article License to Call: Introducing Transformers Agents 2.0 +1 m-ric, lysandre, pcuenq • May 13, 2024 • 137