view article Article Reverse Engineering a $500M Mystery: From HashHop to Memory-Augmented Language Models Jan 23 • 10
view reply Is this still usable without a Pro account? Will it be able to output everything up to "Submit the job to Hugging Face Jobs"?
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix Nov 3, 2025 • 65
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic Image-Text-to-Text • 109B • Updated Sep 22, 2025 • 9.32k • 28
NousResearch/DeepHermes-3-Llama-3-8B-Preview Text Generation • 8B • Updated Apr 10, 2025 • 346 • • 359
RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8 Text Generation • 71B • Updated Sep 22, 2025 • 2.43k • 13