Finding GPT-4’s mistakes with GPT-4
CriticGPT, a model based on GPT-4, writes critiques of ChatGPT responses to help human trainers spot mistakes during RLHF

CriticGPT, a model based on GPT-4, writes critiques of ChatGPT responses to help human trainers spot mistakes during RLHF

We’re partnering with TIME and its 101 years of archival content to enhance responses and provide links to stories on Time.com
