Building a Biases LLM Leaderboard

Building a Biases LLM Leaderboard

We have released the first (AFAIK) leaderboard for LLMs specialized in assessing their ethical biases, such as ageism, racism, sexism,… The initiative aims to raise awareness about the status of the latest advances in development of ethical AI, and foster its...
For a more transparent governance of open source

For a more transparent governance of open source

The long-term sustainability of FOSS is a complex and multi-dimensional problem (technical, economical, social, political, etc.). We believe more transparency in how projects are governed would be a significant improvement to all such dimensions. And one that it is...
Benchmarks for evaluating intent-based chatbots

Benchmarks for evaluating intent-based chatbots

Given the growing importance of bots in all aspects of our digital live (including in software engineering!) and the well-known challenges to test any type of NLP intensive bot, we could definitely use a series of de facto standard datasets to soundly evaluate and...