Anthropic/OpenAI may be spending more than $1000 for every $100 you pay them

Image for article Anthropic/OpenAI may be spending more than $1000 for every $100 you pay them
News Source : Ea.rna.nl

News Summary

  • This is the first of two articles about ‘coding with Large ‘Language’ Models’ The benchmarks are suspect (a 50% or even 80% success rate on a coding task compared to a human is effectively completely useless in full-agentic (no human in the loop) coding.
  • Is checking in 8 times as many lines of code per day really a good thing?
  • What if LLMs edit in a way that Lines of Code becomes less trustworthy a measure as it is already?
  • All in all, it reminds me of Google’s deceptive talk about its ‘Willow’ QM computing chip.

Must read Articles