A new AI benchmark tests whether chatbots protect human wellbeing | TechCrunch
News Source : TechCrunch
News Summary
- HumaneBench measures whether chatbots prioritize user well-being and how easily those protections fail under pressure.
- Most AI benchmarks measure intelligence and instruction-following, rather than psychological safety.
- ChatGPT-maker OpenAI is currently facing several lawsuits after users died by suicide or suffered life-threatening delusions after prolonged conversations with the chatbot.
- The benchmark was created by a core team including Erika Anderson, Andalib Samandari, Jack Senechal, and Sarah Ladyman.
AI chatbots have been linked to serious mental health harms in heavy users, but there have been few standards for measuring whether they safeguard human wellbeing or just maximize for engagement.A n [+5621 chars]
Never miss a story from us, subscribe to our newsletter