Benchmarks for evaluating intent-based chatbots

Benchmarks for evaluating intent-based chatbots

Given the growing importance of bots in all aspects of our digital live (including in software engineering!) and the well-known challenges to test any type of NLP intensive bot, we could definitely use a series of de facto standard datasets to soundly evaluate and...