Study accuses LM Arena of helping top AI labs game its benchmark | TechCrunch

Image for article Study accuses LM Arena of helping top AI labs game its benchmark | TechCrunch
News Source : TechCrunch

News Summary

  • A new paper accuses LM Arena, the organization behind the popular crowdsourced AI benchmark Chatbot Arena, of helping a select group of AI companies achieve better leaderboard scores
  • The authors say LM Arena allowed some industry-leading AI companies like Meta, OpenAI, Google, and Amazon to privately test several variants of AI models
  • This made it easier for these companies to achieve a top spot on the platform’s leaderboard
A new paper from AI lab Cohere, Stanford, MIT, and Ai2 accuses LM Arena, the organization behind the popular crowdsourced AI benchmark Chatbot Arena, of helping a select group of AI companies achieve [+6190 chars]

Must read Articles