gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. β’ 2 items β’ Updated Aug 7 β’ 391
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? Mar 17 β’ 345
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents Paper β’ 2401.00812 β’ Published Jan 1, 2024 β’ 11
Llama 3.3 (All Versions) Collection Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. β’ 3 items β’ Updated 10 days ago β’ 37
view article Article A failed experiment: Infini-Attention, and why we should keep trying? +1 Aug 14, 2024 β’ 71
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! β’ 30 items β’ Updated Jun 12, 2024 β’ 248
view article Article Fine-Tune Whisper For Multilingual ASR with π€ Transformers Nov 3, 2022 β’ 335
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models Paper β’ 2407.07895 β’ Published Jul 10, 2024 β’ 42
Gemma 2: Improving Open Language Models at a Practical Size Paper β’ 2408.00118 β’ Published Jul 31, 2024 β’ 79