Anthropic/OpenAI may be spending more than $1000 for every $100 you pay them
News Source : Ea.rna.nl
News Summary
- This is the first of two articles about ‘coding with Large ‘Language’ Models’ The benchmarks are suspect (a 50% or even 80% success rate on a coding task compared to a human is effectively completely useless in full-agentic (no human in the loop) coding.
- Is checking in 8 times as many lines of code per day really a good thing?
- What if LLMs edit in a way that Lines of Code becomes less trustworthy a measure as it is already?
- All in all, it reminds me of Google’s deceptive talk about its ‘Willow’ QM computing chip.
Never miss a story from us, subscribe to our newsletter