Track, rank and evaluate open LLMs and chatbots
Generate and rate instruction-response pairs
Experiment with and compare different tokenizers