CodeMMLU, a comprehensive multiple-choice question-answering benchmark for evaluating code understanding in LLMs
Share this post
CodeMMLU: A Multi-Task Benchmark for…
Share this post
CodeMMLU, a comprehensive multiple-choice question-answering benchmark for evaluating code understanding in LLMs