You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Stanislas0 38f16233c3 Refactor benchmark utils 2 years ago
..
cpp Add HumanEval-X benchmark 2 years ago
go Add HumanEval-X benchmark 2 years ago
java/data Add HumanEval-X benchmark 2 years ago
js/data Add HumanEval-X benchmark 2 years ago
python/data Add HumanEval-X benchmark 2 years ago
__init__.py Add HumanEval-X benchmark 2 years ago
evaluate_humaneval_x.py Refactor benchmark utils 2 years ago
generate_humaneval_x.py Refactor benchmark utils 2 years ago
translate_humaneval_x.py Add generation and translation scripts 2 years ago