Here I develop a framework using inspect-ai
for evaluating LLMs on the ETHICS dataset.
This project is a foundation for evaluating the behaviors of complex LLMs such as moral parliaments on ethical questions.
In machine ethics, the moral parliament is a leading idea among ethical decision-making algorithms.
Such algorithms are of particular interest in future LLMs.
-
Notifications
You must be signed in to change notification settings - Fork 0
aaron-sandoval/ethics_eval
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Evaluation of GPT-4o mini on the ETHICS dataset