LLMs struggle to explain even simple number patterns they can sometimes recognize. A new interactive demo shows how LLMs fail to provide coherent reasoning despite access to tools and multiple attempts. Are impressive benchmarks masking fundamental reasoning limitations?
Posted on 27 Sep 2024