🛠️ Steven Gong
Search
Search
Search
Light mode
Dark mode
Table of Contents
Measuring Massive Multitask Language Understanding (MMLU)
Related
Mar 30, 2025, 1 min read
Measuring Massive Multitask Language Understanding (MMLU)
multiple-choice benchmark.
Related
Graph View
Backlinks
LLM Evaluation