MsingiAI Data Challenge

Join us in building high-quality datasets for African language AI

About the Challenge

Help us create comprehensive datasets that will power the next generation of African language AI models

Data Quality

Ensure high-quality, culturally relevant data collection for African languages

Community Driven

Collaborate with native speakers and language experts

Open Source

All datasets will be freely available to the research community

How to Participate

Join our data collection initiative in three simple steps

1

Sign Up

Create an account and join our Discord community

Join Discord →
2

Choose Tasks

Select languages and data collection tasks that interest you

View Tasks →
3

Start Contributing

Begin collecting and validating data for your chosen tasks

View Guidelines →

Submit Your Dataset

Help us build the most comprehensive African language dataset collection

Data Challenge Submission Form

Click the button below to access our comprehensive dataset submission form

Open Submission Form

Supported file types: .txt, .csv, .json, .xlsx

Ready to Make an Impact?