llm-jp/optimal-sparsity-math-d512-E128-k4-3.3B-A220M
Text Generation • 3B • Updated • 5
None defined yet.
Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision-Language Models
JAMMEval: A Refined Collection of Japanese Benchmarks for Reliable VLM Evaluation