As I’m trying to learn more about machine learning I spent some time to look for data that I can use. While GitHub is the place to get open source code, there doesn’t seem to be a counterpart for open data. Below are a couple of websites that help finding data.
In Bluemix there is an Analytics Exchange which gives you access to free and open data in categories such as economy and business, leisure, transportation, and others. The screenshot shows a sample dataset which contains reviews from Airbnb.
There are several other websites that help finding datasets. Unfortunately a lot of datasets are not open. So don’t forget to check the licenses first.
mldata.org provides a repository with a lot of datasets that can be used for machine learning.