Best AI

Large Language Model Evaluation

Model MMLU Score GPQA Score
GPT-3 60.5 72.8
LaMDA 67.2 68.1
PaLM 63.8 75.4