CodeMMLU, a comprehensive multiple-choice question-answering benchmark for evaluating code understanding in LLMs
CodeMMLU: A Multi-Task Benchmark for…
CodeMMLU, a comprehensive multiple-choice question-answering benchmark for evaluating code understanding in LLMs