Wikimedia's Structured Data Lands on Kaggle!

2025-04-16
Wikimedia's Structured Data Lands on Kaggle!

The Wikimedia Foundation and Kaggle are collaborating to release a beta version of structured datasets from Wikipedia in both French and English. This data, specifically formatted for machine learning, is perfect for data science training and development. Kaggle, home to over 461,000 publicly accessible datasets, provides a rich resource for researchers, students, and machine learning practitioners. This collaboration ensures data quality and provenance, and we're excited to see what people build with it.

AI