Building a Biases LLM Leaderboard

Building a Biases LLM Leaderboard

We have released the first (AFAIK) leaderboard for LLMs specialized in assessing their ethical biases, such as ageism, racism, sexism,… The initiative aims to raise awareness about the status of the latest advances in development of ethical AI, and foster its...
Benchmarks for evaluating intent-based chatbots

Benchmarks for evaluating intent-based chatbots

Given the growing importance of bots in all aspects of our digital live (including in software engineering!) and the well-known challenges to test any type of NLP intensive bot, we could definitely use a series of de facto standard datasets to soundly evaluate and...