view article Article Total noob’s intro to Hugging Face Transformers 2legit2overfit • Mar 22, 2024 • 97
Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning Paper • 2507.16802 • Published Jul 22, 2025 • 9
MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices • 30 items • Updated 8 days ago • 86
view article Article Everything You Need to Know about Knowledge Distillation Kseniase • Mar 6, 2025 • 80
LMDX: Language Model-based Document Information Extraction and Localization Paper • 2309.10952 • Published Sep 19, 2023 • 67
DocLLM: A layout-aware generative language model for multimodal document understanding Paper • 2401.00908 • Published Dec 31, 2023 • 191
google-bert/bert-large-uncased-whole-word-masking-finetuned-squad Question Answering • 0.3B • Updated Feb 19, 2024 • 45.1k • • 188