Put AI to work: Lessons from hundreds of successful deployments
Put AI to Work: Lessons from Hundreds of Successful Deployments

Put AI to Work: Lessons from Hundreds of Successful Deployments

LLM-based chatbots’ capabilities have been advancing every month. These improvements are mostly measured by benchmarks like MMLU, HumanEval, and MATH (e.g. sonnet 3.5, gpt-4o). However, as these measures get more and more saturated, is user experience increasing in proportion to these scores? If we envision a future

Ada uses GPT-4 to deliver a new customer service standard
