BAbI: A Challenge for Commonsense Reasoning
The BAbI benchmark presents a challenging set of tasks designed to evaluate the skills of AI systems in interpreting commonsense knowledge. It contains a wide range of cases that require logic about everyday notions. By evaluating how well AI models can address these problems, researchers strive to