Running on CPU Upgrade Featured 2.95k The Smol Training Playbook 📚 2.95k The secrets to building world-class LLMs
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models Paper • 2505.16211 • Published May 22, 2025 • 18
DMind Benchmark: The First Comprehensive Benchmark for LLM Evaluation in the Web3 Domain Paper • 2504.16116 • Published Apr 18, 2025 • 12