So, what explains the shocking failures of basic reasoning that Simple Bench exposes? Let’s return to the vegetables — or fruits and vegetables, if you wish. Here’s my take on the first of my two questions: Why do models fail the question you saw above?
…
The clue is in their name: language models. They model language.
https://www.freethink.com/robots-ai/simple-bench
But sure, spend the next couple thousand days not trusting your own lying eyes. I swear, y’all want to be sucked into this BS. Grow. The. F**k. Up.
Leave a comment