Evaluate LLMs using Kazakh MC tasks
VLMEvalKit Evaluation Results Collection
Explore and submit LLM benchmark evaluations
Display and run auto evaluation logs