Blog

HFCommunity – A database to analyze the development of the Hugging Face community
We provide a relational database storing key information about the development and collaborations around the projects hosted on Hugging Face to better understand and improve the development of open source AI components
20+ tools to help you mine and analyze GitHub and Git data
List of tools to mine, analyze and visualize all the data around your software projects, including users, commits, issues… from Git, GitHub and other popular platforms
Testing challenges for NLP-intensive bots
We review and adapt current testing concepts to NLP-intensive bots in order to advance towards a unified reference framework to promote, classify, compare and evaluate future NLP-intensive bot testing research initiatives.
Bots against harassment in open source
Thanks to pretrained language models, bots can be used to detect and flag toxic comments. Bots could play a major role in our goal towards healthier and more inclusive open source communities.
Tools to visualize the history of a git repository
Best tools to visualize, understand and manage complex Git repositories with multiple parallel branches to deal with
Participation Inequality and the 90-9-1 Principle in Open Source
Participation inequality is a major challenge in any shared-resource system. This is also known as the “volunteer’s dilemma” or the tragedy of the commons. We set out to explore whether this problem also arises in open source communities.
Governance Readiness Checklist for Open Source projects
Our proposal can help the community to identify the main challenges they may face when governing Open Source Systems.
Lessons learned from building a commercial open source bot development platform
Summary of my keynote talk at BotSE 2020 where I discuss the organizational, technical, NLP and Business challenges of building a commercial bot platform
The Role of Foundations in Open Source Projects
A number of Open-Source Systems (OSS) have created parallel foundations, as legal instruments to better articulate the structure, collaboration and financial model for the project. We study over 80 foundations to analyze the impact they have on the open source projects the foundations aim to help.
Discussion in open source projects is strongly limited by the tools we use
Open source needs better tools to make sure the community can effectively discuss and make important decisions of the future of the project
Online division of labour: self-organization in Open Source
Sustainability of open source projects requires a better understanding of how contributors tend to organize themselves.