1, Kaggle datasets
Kaggle data set address
https://www.kaggle.com/datasets
Each data set corresponds to a small community, where you can discuss the data and look for common code, or in which to create their own projects. This contains a number of different types, different structures of content data sets. Meanwhile, also in which you can get to the data associated with each data set, which contains a set of data analysis notes provided by many scientists and other data.
2, Amazon datasets
AWS Open Data Address
https://registry.opendata.aws/
This data set contains the data content in different areas, such as: public transportation, ecological resources, satellite imagery. While providing a search function to help users find the information needed to describe the data set, there are a variety of data sets and use cases, very easy to use.
Set of data stored in the Amazon Web Services (AWS) resources, for the use of machine learning to build their own experiments AWS users, the transmission speed will be very blocks.
3, UCI machine learning datasets
UCI datasets Address:
https://archive.ics.uci.edu/ml/datasets.html
This dataset comes from the School of Information and Computer Science, University of California, which contains more than 100 data sets. According to the type of machine learning problem of classification of data sets, can be found in univariate or multivariate time series data sets, and classification, regression or recommended system data sets.
4, Google search engine data collection
Search engine Google data collection
https://toolbox.google.com/datasetsearch
In late 2018, Google launched a set of data search services. This is a search engine can search by the name of the data set, the goal is to provide a unified search portal for the tens of thousands of different sets of data repositories, very easy to use.
5, Microsoft datasets
In July 2018, Microsoft and the research community with the outside world, Microsoft released a research and development data.
Microsoft datasets Address:
https://msropendata.com/
It includes a cloud server data store, dedicated to promoting collaborative global research community, and which provides a series of published studies for the content of the data set.
6, Awesome open data sets favorites list
Awesom Public Datasets
https://github.com/awesomedata/awesome-public-datasets
This data set list, compiled by subject large data sets, for example: biology, economics, education and so on. Most of the data sets listed are free, but before using any data set, the data set required to check licensing requirements.
7, government data sets
Many countries have provided the government data sets available to the public in a variety of content on the network, such as:
European governments datasets
https://data.europa.eu/euodp/data/dataset
US government data sets
https://www.data.gov/
New Zealand government datasets
https://catalogue.data.govt.nz/dataset
Indian government datasets
https://data.gov.in/
Northern Ireland public data sets
https://www.opendatani.gov.uk/
8, VisualData datasets
VisualData data set
https://www.visualdata.io/
Visual data contains some excellent data set used to build the model of computer vision, users can query by a CV topics such as semantic segmentation, image title, image generation, automatic driving cars and other content.