Study accuses LM Arena of helping top AI labs game its benchmark | TechCrunch

News Source : TechCrunch
News Summary
- A new paper accuses LM Arena, the organization behind the popular crowdsourced AI benchmark Chatbot Arena, of helping a select group of AI companies achieve better leaderboard scores
- The authors say LM Arena allowed some industry-leading AI companies like Meta, OpenAI, Google, and Amazon to privately test several variants of AI models
- This made it easier for these companies to achieve a top spot on the platform’s leaderboard
A new paper from AI lab Cohere, Stanford, MIT, and Ai2 accuses LM Arena, the organization behind the popular crowdsourced AI benchmark Chatbot Arena, of helping a select group of AI companies achieve [+6190 chars]