DataSets for Machine and Deep Learning
- 01. The UC Irvine Machine Learning Repository
- 02. GitHub - A list of 650+ datasets available via public APIs
- 03. GitHub - This data sets cover a broad range of applications, and include binary/multi-class classification problems and regression problems
- 04. 3,250 Machine Learning Datasets
- 05. OpenML - Thousands of Data Sets for Better Machine Learning
- 06. VisualData Discovery -
Best place to find and share computer vision datasets
- 07. Roboflow - Computer Vision Datasets
- 08. Kaggle datasets: Search engine for machine learning datasets
- 09. Jupyter datasets: A list of commonly available datasets and data search engines
- 10. IBM Data Asset eXchange -
Explore useful and relevant data sets for enterprise data science
- 11. Dataset Search in Google
- 12. GitHub - Awesome Public Datasets
- 13. DBpedia is a community-driven initiative to derive organized material from Wikimedia projects’ resources. This is an Open Knowledge Graph to enrich the datasets.
- 14. WordNet is a large lexical database of English. Nouns, verbs, adjectives and adverbs are grouped into sets of cognitive synonyms (synsets), each expressing a distinct concept.
- 15. FactForge.net is a hub of Linked Open Data (LOD) and news articles about people, organizations and locations
- 16. World Facts - Database including information regarding nations, languages, currencies, and other similar topics.
- 17. GLEIF – Global Legal Entity Identifier Foundation - The global online source for open, standardized and high-quality legal entity reference data
- 18. ConceptNet is a freely-available semantic network, designed to help computers understand the meanings of words that people use.
- 19. ImageNet is an image database organized according to the WordNet hierarchy (currently only the nouns), in which each node of the hierarchy is depicted by hundreds and thousands of images.