Here I develop a framework using inspect-ai
for evaluating LLMs on the ETHICS dataset.
This project is a foundation for evaluating the behaviors of complex LLMs such as moral parliaments on ethical questions.
In machine ethics, the moral parliament is a leading idea among ethical decision-making algorithms.
Such algorithms are of particular interest in future LLMs.