-
Notifications
You must be signed in to change notification settings - Fork 181
Issues: openai/simple-evals
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
MMLU answer extraction regex fails with repeated "Answer: LETTER" pattern
#33
opened Dec 10, 2024 by
lucasresck
Simplified Standalone Version for SimpleQA with public result
#32
opened Nov 28, 2024 by
BorisLoveDev
What is the recommended way to access the simpleQA dataset?
#25
opened Oct 31, 2024 by
ZhangYiqun018
Run benchmarks for old GPT-4 models (GPT-4-0314 and GPT-4-0613) and all GPT-3.5-turbo models
#9
opened May 14, 2024 by
mikita-apollo
Run benchmarks also for GPT-3.5 versions and Claude Sonnet and Haiku
#7
opened Apr 17, 2024 by
zurferr
ProTip!
What’s not been updated in a month: updated:<2024-11-28.