Awesome Public Datasets
This is a list of topic-centric public data sources in high quality. They are collected and tidied from blogs, answers, and user responses. Most of the data sets listed below are free, however, some are not. This project was incubated at OMNILab, Shanghai Jiao Tong University during Xiaming Chen's Ph.D. studies. OMNILab is now part of the BaiYuLan Open AI community.
Other amazingly awesome lists can be found in sindresorhus's awesome list.
Agriculture
The global dataset of historical yields for major crops
1981–2016 - The Global Dataset of
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Agriculture/Global-dataset-of-historical-yields-for-major-crops.yml)\]
Hyperspectral benchmark dataset on soil moisture - This dataset was
measured in a five-day
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Agriculture/Hyperspectral-Benchmark-Dataset-On-Soil-Moisture.yml)\]
Lemons quality control dataset - Lemon dataset has been prepared to
investigate the
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Agriculture/Lemon-Dataset.yml)\]
Optimized Soil Adjusted Vegetation Index - The IDB is a tool for
working with remote sensing
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Agriculture/Optimized%20Soil%20Adjusted%20Vegetation%20Index)\]
U.S. Department of Agriculture's Nutrient Database - USDA National
Nutrient Database for
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Agriculture/U.S.-Department-of-Agricultures-Nutrient-Database.yml)\]
U.S. Department of Agriculture's PLANTS Database - The Complete
PLANTS Checklist is nearly 7
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Agriculture/U.S.-Department-of-Agricultures-PLANTS-Database.yml)\]
Architecture
Swiss Apartment Models - This dataset contains detailed data on
42,207 apartments (242,257
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Architecture/appartment-models.yml)\]
Biology
1000 Genomes - The 1000 Genomes Project ran between 2008 and 2015,
creating the largest
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/1000-Genomes.yml)\]
ANHIR - Automatic Non-rigid Histological Image Registration (ANHIR)
consists of 2D \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/ANHIR.yml)\]
American Gut (Microbiome Project) - The American Gut project is the
largest crowdsourced
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/American-Gut-Microbiome-Project.yml)\]
BCNB - There are WSIs of 1058 patients, part of tumor regions are
annotated in WSIs. Except
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/BCNB.yml)\]
Broad Bioimage Benchmark Collection (BBBC) - The Broad Bioimage
Benchmark Collection (BBBC)
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Broad-Bioimage-Benchmark-Collection-BBBC.yml)\]
Broad Cancer Cell Line Encyclopedia
(CCLE)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Broad-Cancer-Cell-Line-Encyclopedia-CCLE.yml)\]
CIMA - CIMA dataset includes images of 2D histological microscopy
tissue slices.
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/CIMA.yml)\]
Cell Image Library - This library is a public and easily accessible
resource database of \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Cell-Image-Library.yml)\]
Complete Genomics Public Data - A diverse data set of whole human
genomes are freely
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Complete-Genomics-Public-Data.yml)\]
CytoImageNet - A large-scale dataset of microscopy images. Contains
890,737 total grayscale
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/CytoImageNet.yml)\]
EBI ArrayExpress - ArrayExpress Archive of Functional Genomics Data
stores data from high- \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/EBI-ArrayExpress.yml)\]
EBI Protein Data Bank in Europe - The Electron Microscopy Data Bank
(EMDB) is a public \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/EBI-Protein-Data-Bank-in-Europe.yml)\]
ENCODE project - The Encyclopedia of DNA Elements (ENCODE)
Consortium is an ongoing \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/ENCODE-project.yml)\]
Electron Microscopy Pilot Image Archive (EMPIAR) - EMPIAR, the
Electron Microscopy Public
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Electron-Microscopy-Pilot-Image-Archive-EMPIAR.yml)\]
Ensembl Genomes
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Ensembl-Genomes.yml)\]
Gene Expression Omnibus (GEO) - GEO is a public functional genomics
data repository \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Gene-Expression-Omnibus-GEO.yml)\]
Gene Ontology (GO) - GO annotation
files
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Gene-Ontology-GO.yml)\]
Global Biotic Interactions
(GloBI)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Global-Biotic-Interactions-GloBI.yml)\]
Harvard Medical School (HMS) LINCS Project - The Harvard Medical
School (HMS) LINCS Center is \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Harvard-Medical-School-LINCS-Project.yml)\]
Human Genome Diversity Project - A group of scientists at Stanford
University have \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Human-Genome-Diversity-Project.yml)\]
Human Microbiome Project (HMP) - The HMP sequenced over 2000
reference genomes isolated from
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Human-Microbiome-Project-HMP.yml)\]
ICOS PSP Benchmark - The ICOS PSP benchmarks repository contains an
adjustable real-world
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/ICOS-PSP-Benchmark.yml)\]
International HapMap
Project
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/International-HapMap-Project.yml)\]
Journal of Cell Biology DataViewer - All JCB data was moved to
Biostudies
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Journal-of-Cell-Biology-DataViewer.yml)\]
KEGG - KEGG is a database resource for understanding high-level
functions and utilities of \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/KEGG.yml)\]
NCBI
Proteins
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/NCBI-Proteins.yml)\]
NCBI Taxonomy - The NCBI Taxonomy database is a curated set of
names and classifications for
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/NCBI-Taxonomy.yml)\]
NCI Genomic Data Commons - The GDC Data Portal is a robust
data-driven platform that allows
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/NCI-Genomic-Data-Commons.yml)\]
NIH Microarray
data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/NIH-Microarray-data.yml)\]
OpenSNP genotypes data - openSNP allows customers of
direct-to-customer genetic tests to \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/OpenSNP-genotypes-data.yml)\]
Palmer Penguins - The goal of palmerpenguins is to provide a great
dataset for data
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Palmer-Penguins.yml)\]
Pathguid - Protein-Protein Interactions
Catalog
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Pathguid.yml)\]
Protein Data Bank - This resource is powered by the Protein Data
Bank archive-information \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Protein-Data-Bank.yml)\]
Psychiatric Genomics Consortium - The purpose of the Psychiatric
Genomics Consortium (PGC) is
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Psychiatric-Genomics-Consortium.yml)\]
PubChem Project - PubChem is the world's largest collection of
freely accessible chemical
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/PubChem-Project.yml)\]
PubGene (now Coremine Medical) - COREMINE™ is a family of tools
developed by the Norwegian \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/PubGene-now-Coremine-Medical.yml)\]
Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) - COSMIC,
the Catalogue Of Somatic
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Sanger-Catalogue-of-Somatic-Mutations-in-Cancer-COSMIC.yml)\]
Sanger Genomics of Drug Sensitivity in Cancer Project
(GDSC)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Sanger-Genomics-of-Drug-Sensitivity-in-Cancer-Project-GDSC.yml)\]
Sequence Read Archive(SRA) - The Sequence Read Archive (SRA) stores
raw sequence data from
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Sequence-Read-ArchiveSRA.yml)\]
Serratus - Analysis of 7.1 million RNA/DNA sequencing datasets to
discover the total
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Serratus-Open-Virome.yml)\]
Stanford Microarray Data (Retired NOW)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Stanford-Microarray-Data.yml)\]
Stowers Institute Original Data
Repository
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Stowers-Institute-Original-Data-Repository.yml)\]
Systems Science of Biological Dynamics (SSBD) Database - Systems
Science of Biological \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Systems-Science-of-Biological-Dynamics-SSBD-Database.yml)\]
The Cancer Genome Atlas (TCGA), available via Broad
GDAC
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/The-Cancer-Genome-Atlas-TCGA-available-via-Broad-GDAC.yml)\]
The Catalogue of Life - The Catalogue of Life is a quality-assured
checklist of more than 1.8
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/The-Catalogue-of-Life.yml)\]
The Personal Genome Project - The Personal Genome Project,
initiated in 2005, is a vision and
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/The-Personal-Genome-Project.yml)\]
UCSC Public Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/UCSC-Public-Data.yml)\]
UniGene
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/UniGene.yml)\]
Universal Protein Resource (UnitProt) - The Universal Protein
Resource (UniProt) is a \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/Universal-Protein-Resource.yml)\]
Rfam - The Rfam database is a collection of RNA families, each
represented by multiple
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Biology/rfam.yml)\]
Chemistry
Ionic Liquids Database -
ILThermo
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Chemistry/ionicliquids.yml)\]
Climate+Weather
Actuaries Climate Index
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/Actuaries-Climate-Index.yml)\]
Australian Weather - Updated webpage for Australian Weather
data.
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/Australian-Weather.yml)\]
Aviation Weather Center - Consistent, timely and accurate weather
information for the world
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/Aviation-Weather-Center.yml)\]
Brazilian Weather - Historical data (In Portuguese) - Data related
to climate and weather
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/Brazilian-Weather.yml)\]
Several Climate Datasets - The C3S Climate Data Store (CDS) is a
one-stop shop for
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/CDS.yml)\]
Canadian Meteorological
Centre
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/Canadian-Meteorological-Centre.yml)\]
Caravan - a dataset for large-sample hydrology - Caravan is an open
community dataset of
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/Caravan.yml)\]
Climate Data from UEA (updated
monthly)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/Climate-Data-from-UEA-updated-monthly.yml)\]
Dutch Weather - The KNMI Data Center (KDC) portal provides access
to KNMI data on weather, \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/Dutch-Weather.yml)\]
European Climate Assessment & Dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/European-Climate-Assessment-&-Dataset.yml)\]
German Climate Data Center
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/German-Meteorological-Service-CDC.yml)\]
Global Climate Data Since 1929
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/Global-Climate-Data-Since-1929.yml)\]
Charting The Global Climate Change News Narrative 2009-2020 - These
four datasets represent
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/GlobalClimateChangeNewsNarrative2009-2020.yml)\]
NASA Global Imagery Browse
Services
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/NASA-Global-Imagery-Browse-Services.yml)\]
NOAA Bering Sea Climate
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/NOAA-Bering-Sea-Climate.yml)\]
NOAA Climate
Datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/NOAA-Climate-Datasets.yml)\]
NOAA Realtime Weather
Models
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/NOAA-Realtime-Weather-Models.yml)\]
NOAA SURFRAD Meteorology and Radiation
Datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/NOAA-SURFRAD-Meteorology-and-Radiation-Datasets.yml)\]
Open-Meteo - Open-Source Weather API - Open-source weather API with
free access for non- \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/Open-Meteo.yml)\]
The World Bank Open Data Resources for Climate
Change
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/The-World-Bank-Open-Data-Resources-for-Climate-Change.yml)\]
UEA Climatic Research Unit
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/UEA-Climatic-Research-Unit.yml)\]
WU Historical Weather
Worldwide
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/WU-Historical-Weather-Worldwide.yml)\]
Wahington Post Climate Change - To analyze warming temperatures in
the United States, The
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/Washington%20Post%20Climate%20Change.yml)\]
WorldClim - Global Climate Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Climate+Weather/WorldClim.yml)\]
ComplexNetworks
AMiner Citation Network Dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/AMiner-Citation-Network-Dataset.yml)\]
CrossRef DOI URLs
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/CrossRef-DOI-URLs.yml)\]
DBLP Citation
dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/DBLP-Citation-dataset.yml)\]
DIMACS Road Networks
Collection
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/DIMACS-Road-Networks-Collection.yml)\]
NBER Patent Citations
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/NBER-Patent-Citations.yml)\]
NIST complex networks data
collection
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/NIST-complex-networks-data-collection.yml)\]
Network Repository with Interactive Exploratory Analysis
Tools
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/Network-Repository-with-Interactive-Exploratory-Analysis-Tools.yml)\]
Protein-protein interaction
network
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/Protein.yml)\]
PyPI and Maven Dependency
Network
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/PyPI-and-Maven-Dependency-Network.yml)\]
Scopus Citation
Database
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/Scopus-Citation-Database.yml)\]
Small Network Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/Small-Network-Data.yml)\]
Stanford GraphBase
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/Stanford-GraphBase-Steven-Skiena.yml)\]
Stanford Large Network Dataset
Collection
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/Stanford-Large-Network-Dataset-Collection.yml)\]
Stanford Longitudinal Network Data
Sources
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/Stanford-Longitudinal-Network-Data-Sources.yml)\]
The Koblenz Network Collection
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/The-Koblenz-Network-Collection.yml)\]
The Laboratory for Web Algorithmics
(UNIMI)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/The-Laboratory-for-Web-Algorithmics-UNIMI.yml)\]
UCI Network Data
Repository
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/UCI-Network-Data-Repository.yml)\]
UFL sparse matrix
collection
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/UFL-sparse-matrix-collection.yml)\]
WSU Graph Database
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/WSU-Graph-Database.yml)\]
Community Resource for Archiving Wireless Data At Dartmouth -
Contains datasets of pcap files \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComplexNetworks/crawdad.yml)\]
ComputerNetworks
3.5B Web Pages from CommonCrawl
2012
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/3.5B-Web-Pages-from-CommonCrawl-2012.yml)\]
53.5B Web clicks of 100K users in Indiana
Univ.
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/53.5B-Web-clicks-of-100K-users-in-Indiana-Univ..yml)\]
CAIDA Internet Datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/CAIDA-Internet-Datasets.yml)\]
CRAWDAD Wireless datasets from Dartmouth
Univ.
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/CRAWDAD-Wireless-datasets-from-Dartmouth-Univ..yml)\]
ClueWeb09 - 1B web pages
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/ClueWeb09.yml)\]
ClueWeb12 - 733M web pages
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/ClueWeb12.yml)\]
CommonCrawl Web Data over 7
years
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/CommonCrawl-Web-Data-over-7-years.yml)\]
Shopper Intent Prediction from Clickstream E‑Commerce Data with
Minimal Browsing
Information
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/Coveo-Shopper-Intent-Prediction.yaml)\]
Criteo click-through
data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/Criteo-click-through-data.yml)\]
Internet-Wide Scan Data Repository
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/Internet-Wide-Scan-Data-Repository.yml)\]
MIRAGE-2019 - MIRAGE-2019 is a human-generated dataset for mobile
traffic analysis with
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/MIRAGE-2019.yml)\]
Merklemap DNS records Dataset - Contains 4B+ DNS records accross
700 million unique
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/Merklemap-DNS-Records-dataset.yml)\]
OONI: Open Observatory of Network Interference - Internet
censorship data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/OONI-Open-Observatory-of-Network-Interference.yml)\]
Open Mobile Data by
MobiPerf
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/Open-Mobile-Data-by-MobiPerf.yml)\]
The Peer-to-Peer Trace Archive - Real-world measurements play a key
role in studying the \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/P2P-Trace-Archive.yml)\]
Rapid7 Sonar Internet Scans
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/Rapid7-Sonar-Internet-Scans.yml)\]
UCSD Network Telescope, IPv4 /8
net
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ComputerNetworks/UCSD-Network-Telescope-IPv4-slash8-net.yml)\]
CyberSecurity
CCCS-CIC-AndMal-2020 - The dataset includes 200K benign and 200K
malware samples totalling to
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//CyberSecurity/CCCS-CIC-AndMal-2020.yml)\]
Traffic and Log Data Captured During a Cyber Defense Exercise -
This dataset was acquired
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//CyberSecurity/Traffic-and-Log-Data-Captured-During-a-Cyber-Defense-Exercise.yml)\]
DataChallenges
AIcrowd Competitions
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/AIcrowd-Competitions.yml)\]
Bruteforce
Database
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/Bruteforce-Database.yml)\]
Challenges in Machine Learning
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/Challenges-in-Machine-Learning.yml)\]
CrowdANALYTIX dataX
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/CrowdANALYTIX-dataX.yml)\]
D4D Challenge of Orange
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/D4D-Challenge-of-Orange.yml)\]
DrivenData Competitions for Social
Good
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/DrivenData-Competitions-for-Social-Good.yml)\]
ICWSM Data Challenge (since
2009)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/ICWSM-Data-Challenge-since-2009.yml)\]
KDD Cup by Tencent 2012
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/KDD-Cup-by-Tencent-2012.yml)\]
Kaggle Competition Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/Kaggle-Competition-Data.yml)\]
Localytics Data Visualization
Challenge
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/Localytics-Data-Visualization-Challenge.yml)\]
Netflix
Prize
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/Netflix-Prize.yml)\]
Space Apps Challenge
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/Space-Apps-Challenge.yml)\]
Telecom Italia Big Data
Challenge
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/Telecom-Italia-Big-Data-Challenge.yml)\]
TravisTorrent Dataset - MSR'2017 Mining
Challenge
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/TravisTorrent-Dataset.yml)\]
TunedIT - Data mining & machine learning data sets, algorithms,
challenges
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/TunedIT.yml)\]
Yelp Dataset Challenge - The Yelp dataset is a subset of our
businesses, reviews, and user \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//DataChallenges/Yelp-Dataset-Challenge.yml)\]
EarthScience
38-Cloud (Cloud Detection) - Contains 38 Landsat 8 scene images and
their manually extracted
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/38-Cloud.yml)\]
AQUASTAT - Global water resources and
uses
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/AQUASTAT.yml)\]
BODC - marine data of ~22K vars
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/BODC.yml)\]
EOSDIS - NASA's earth observing system
data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/EOSDIS.yml)\]
Earth Models
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/Earth-Models.yml)\]
Global Wind Atlas - The Global Wind Atlas is a free, web-based
application developed to help
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/Global-Wind-Atlas.yml)\]
Integrated Marine Observing System (IMOS) - roughly 30TB of ocean
measurements
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/Integrated-Marine-Observing-System-IMOS.yml)\]
Marinexplore - Open Oceanographic Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/Marinexplore.yml)\]
Alabama Real-Time Coastal Observing System
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/MyMobileBay.yml)\]
National Estuarine Research Reserves System-Wide Monitoring
Program - long-term estuarine \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/NERRS-SWMP.yml)\]
Oil and Gas Authority Open Data - The dataset covers 12,500
offshore wellbores, 5,000 seismic
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/Oil-and-Gas-Authority-UK.yml)\]
Radiance GeoJSON — Global Light Pollution - Global nighttime
light pollution dataset derived
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/Radiance-GeoJSON-Light-Pollution.yml)\]
Smithsonian Institution Global Volcano and Eruption
Database
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/Smithsonian-Institution-Global-Volcano-and-Eruption-Database.yml)\]
USGS Earthquake
Archives
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/USGS-Earthquake-Archives.yml)\]
Wellhead Protection Area (protection zone) prediction using
breakthrough curves - This
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//EarthScience/WHPA.yml)\]
Economics
Asian Productivity Organization (APO) - The AEPM provides a graphic
dashboard view of
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/APO.yml)\]
ASEAN Stats - The ASEANstatsDataPortal was first launched in
June 2018. The Portal is \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/ASEAN%20Stats.yml)\]
American Economic Association
(AEA)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/American-Economic-Association-AEA.yml)\]
Asian KLEMS - Asia KLEMS is an Asian regional research consortium
to promote building
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/Asian%20KLEMS.yml)\]
Harvard Atlas of Economic Complexity - A database for people to
explore global trade flows
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/Atlas%20Economic%20Complexity.yml)\]
BIS Financial Database - The files contain the same data as in the
BIS Statistics Explorer
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/BIS%20Financial%20Database.yml)\]
Barro-Lee Education Attainment - Barro-Lee Educational Attainment
Data from 1950 to 2010. \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/Barro%20Lee.yml)\]
CEPII Database - A database of the world economy, through its
country and region profiles, in
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/CEPII%20Database.yml)\]
EUKLEMS - EU KLEMS is an industry level, growth and productivity
research project. EU KLEMS \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/EUKLEMS.yml)\]
Economic Freedom of the World
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/Economic-Freedom-of-the-World-Data.yml)\]
Historical National Accounts - The datahub on Comparative
Historical National Accounts
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/Historical%20National%20Accounts.yml)\]
Historical MacroEconomic
Statistics
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/Historical-MacroEconomic-Statistics.yml)\]
INFORUM - Interindustry Forecasting at the University of
Maryland
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/INFORUM.yml)\]
DBnomics – the world's economic database - Aggregates hundreds of
millions of time series \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/International-Economics-Database.yml)\]
International Trade Statistics - The new link contains trade based
on filtered search on the
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/International-Trade-Statistics.yml)\]
Internet Product Code Database
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/Internet-Product-Code-Database.yml)\]
Joint External Debt Data Hub
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/Joint-External-Debt-Data-Hub.yml)\]
Jon Haveman International Trade Data
Links
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/Jon-Haveman-International-Trade-Data-Links.yml)\]
Latin America KLEMS - LAKLEMS is a technical cooperation project
financed by the Inter- \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/LA%20KLEMS.yml)\]
Long-Term Productivity Database - The Long-Term Productivity
database was created as a
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/Long-Term-Productivity-Database.yml)\]
Maddison Project Database - The Maddison Project Database provides
information on comparative
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/Maddison%20Project.yml)\]
National Transfer Accounts - The goal of the National Transfer
Accounts (NTA) project is to
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/NTA.yml)\]
OpenCorporates Database of Companies in the
World
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/OpenCorporates-Database-of-Companies-in-the-World.yml)\]
Our World in Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/Our-World-in-Data.yml)\]
Penn World Table - PWT version 10.0 is a database with information
on relative levels of
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/Penn%20World%20Table.yml)\]
SciencesPo World Trade Gravity
Datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/SciencesPo-World-Trade-Gravity-Datasets.yml)\]
The Atlas of Economic Complexity
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/The-Atlas-of-Economic-Complexity.yml)\]
The Center for International Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/The-Center-for-International-Data.yml)\]
The Observatory of Economic
Complexity
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/The-Observatory-of-Economic-Complexity.yml)\]
UN Commodity Trade Statistics
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/UN-Commodity-Trade-Statistics.yml)\]
UN Human Development Reports
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/UN-Human-Development-Reports.yml)\]
World Input-Output Database - World Input-Output Tables and
underlying data, covering 43
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/World%20Input-Output%20Database.yml)\]
World KLEMS - Analytical KLEMS-type data sets for a broad set of
countries around the world.
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Economics/World%20KLEMS.yml)\]
Education
College Scorecard Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Education/College-Scorecard-Data.yml)\]
New York State Education Department Data - The New York State
Education Department (NYSED) is
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Education/New-York-State-Education-Department.yml)\]
Program for International Student Assessement (PISA) - Contains
15-year-old students' \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Education/PISA.yml)\]
Student Data from Free Code
Camp
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Education/Student-Data-from-Free-Code-Camp.yml)\]
Energy
AMPds - The Almanac of Minutely Power dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/AMPds.yml)\]
BLUEd - Building-Level fUlly labeled Electricity Disaggregation
dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/BLUEd.yml)\]
COMBED
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/COMBED.yml)\]
DBFC - Direct Borohydride Fuel Cell (DBFC)
Dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/DBFC.yml)\]
DEL - Domestic Electrical Load study datsets for South Africa
(1994 -
2014)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/DEL.yml)\]
ECO - The ECO data set is a comprehensive data set for
non-intrusive load monitoring and
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/ECO.yml)\]
EIA
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/EIA.yml)\]
Global Power Plant Database - The Global Power Plant Database is a
comprehensive, open source
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/Global%20Power%20Plant%20Database.yml)\]
HES - Household Electricity Study,
UK
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/HES.yml)\]
HFED
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/HFED.yml)\]
MORED: a Moroccan Buildings’ Electricity Consumption Dataset -
Since spring of 2019, a data
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/MORED.yml)\]
Marktstammdatenregister - The German Marktstammdatenregister
(MaStR) is a database of all
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/MaStR.yml)\]
PEM1 - Proton Exchange Membrane (PEM) Fuel Cell
Dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/PEM1.yml)\]
PLAID - The Plug Load Appliance Identification
Dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/PLAID.yml)\]
The Public Utility Data Liberation Project (PUDL) - PUDL makes US
energy data easier to
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/PUDL.yml)\]
REDD
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/REDD.yml)\]
SYND - A synthetic energy dataset for non-intrusive load
monitoring - With SynD, we present a
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/SYND.yml)\]
Smart Meter Data Portal - The Smart Meter Data Portal is part of
the National Science
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/Smart%20Meter%20Data%20Portal.yml)\]
Tracebase
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/Tracebase.yml)\]
Ukraine Energy Centre
Datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/UDEC.yml)\]
UK-DALE - UK Domestic Appliance-Level
Electricity
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/UK-DALE.yml)\]
WHITED
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/WHITED.yml)\]
iAWE
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Energy/iAWE.yml)\]
Entertainment
Top Streamers on Twitch - This contains data of Top 1000 Streamers
from past year.
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Entertainment/TwitchStreamersData.yml)\]
Finance
BIS Statistics - BIS statistics, compiled in cooperation with
central banks and other
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Finance/BIS%20Statistics.yml)\]
Blockmodo Coin Registry - A registry of JSON formatted information
files that is primarily
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Finance/Blockmodo-Coin-Registry)\]
CBOE Futures Exchange
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Finance/CBOE-Futures-Exchange.yml)\]
Complete FAANG Stock data - This data set contains all the stock
data of FAANG companies from
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Finance/FAANG-StockData.yml)\]
Google Finance
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Finance/Google-Finance.yml)\]
Google
Trends
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Finance/Google-Trends.yml)\]
NASDAQ
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Finance/NASDAQ.yml)\]
NYSE Market Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Finance/NYSE-Market-Data.yml)\]
OANDA
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Finance/OANDA.yml)\]
OSU Financial data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Finance/OSU-Financial-data.yml)\]
Quandl
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Finance/Quandl.yml)\]
SEC EDGAR - EDGAR, the Electronic Data Gathering, Analysis, and
Retrieval system, is the \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Finance/SEC-EDGAR.yml)\]
St Louis Federal
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Finance/St-Louis-Federal.yml)\]
Yahoo Finance
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Finance/Yahoo-Finance.yml)\]
GIS
Awesome 3D Semantic City Models - Collection of open 3D semantic
city and region models.
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/3D-Semantic-City-Models.yml)\]
ArcGIS Open Data portal
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/ArcGIS-Open-Data-portal.yml)\]
Cambridge, MA, US, GIS data on
GitHub
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/Cambridge-MA-US-GIS-data-on-GitHub.yml)\]
Database of all continents, countries,
States/Subdivisions/Provinces and Cities - Database
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/Database-of-Continents-Coutries-States-Cities.yml)\]
Factual Global Location
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/Factual-Global-Location-Data.yml)\]
IEEE Geoscience and Remote Sensing Society DASE
Website
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/GRSS-DASE-Website.yml)\]
Geo Maps - High Quality GeoJSON maps programmatically
generated
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/Geo-Maps.yml)\]
Geo Spatial Data from ASU
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/Geo-Spatial-Data-from-ASU.yml)\]
Geo Wiki Project - Citizen-driven Environmental
Monitoring
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/Geo-Wiki-Project.yml)\]
GeoFabrik - OSM data extracted to a variety of formats and
areas
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/GeoFabrik.yml)\]
GeoNames Worldwide
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/GeoNames-Worldwide.yml)\]
Global Administrative Areas Database (GADM) - Geospatial data
organized by country. Includes \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/Global-Administrative-Areas-Database-GADM.yml)\]
Homeland Infrastructure Foundation-Level
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/Homeland-Infrastructure-Foundation.yml)\]
Landsat 8 on AWS
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/Landsat-8-on-AWS.yml)\]
List of all countries in all
languages
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/List-of-all-countries-in-all-languages.yml)\]
National Weather Service GIS Data
Portal
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/National-Weather-Service-GIS-Data-Portal.yml)\]
Natural Earth - vectors and rasters of the
world
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/Natural-Earth.yml)\]
OpenAddresses
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/OpenAddresses.yml)\]
OpenStreetMap
(OSM)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/OpenStreetMap-OSM.yml)\]
Pleiades - Gazetteer and graph of ancient
places
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/Pleiades.yml)\]
Reverse Geocoder using OSM
data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/Reverse-Geocoder-using-OSM-data.yml)\]
Robin Wilson - Free GIS Datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/Robin-Wilson-Free-GIS-Datasets.yml)\]
Shadow Accrual Maps - The repository contains the accumulated
shadow information for New York
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/Shadow-Accrual-Maps.yml)\]
TIGER/Line - U.S. boundaries and
roads
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/TIGER-Line.yml)\]
TZ Timezones shapefile
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/TZ-Timezones-shapfiles.yml)\]
TwoFishes - Foursquare's coarse
geocoder
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/TwoFishes.yml)\]
UN Environmental Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/UN-Environmental-Data.yml)\]
World boundaries from the U.S. Department of
State
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/World-boundaries-from--the-U.S.-Department-of-State.yml)\]
World countries in multiple
formats
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/World-countries-in-multiple-formats.yml)\]
MAP-VERSE - MAP usability - Validated Empirical Research by
Systematic Evaluation - A curated
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//GIS/map-verse.yml)\]
Government
Alberta, Province of Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Alberta-Province-of-Canada.yml)\]
Antwerp, Belgium
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Antwerp-Belgium.yml)\]
Argentina (non official)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Argentina-non-official.yml)\]
Datos Argentina - Portal de datos abiertos de la República
Argentina. Encontrá datos públicos \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Argentina.yml)\]
Austin, TX, US
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Austin-TX-US.yml)\]
Australia
(abs.gov.au)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Australia-abs.gov.au.yml)\]
Australia (data.gov.au)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Australia-data.gov.au.yml)\]
Austria (data.gv.at)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Austria-data.gv.at.yml)\]
Baton Rouge, LA, US
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Baton-Rouge-LA-US.yml)\]
Beersheba, Israel - Open Data Portal (Smart7
OpenData)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Beersheba-Israel.yml)\]
Belgium
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Belgium.yml)\]
City of Berkeley Open Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Berkeley-CA-Open-Data.yml)\]
Brazil
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Brazil.yml)\]
Buenos Aires, Argentina
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Buenos-Aires-Argentina.yml)\]
Calgary, AB, Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Calgary-AB-Canada.yml)\]
Cambridge, MA, US
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Cambridge-MA-US.yml)\]
Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Canada.yml)\]
Chicago
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Chicago.yml)\]
Chile
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Chile.yml)\]
China
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/China)\]
Dallas Open Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Dallas-Open-Data.yml)\]
DataBC - data from the Province of British
Columbia
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/DataBC.yml)\]
Debt to the Penny - The Debt to the Penny dataset provides
information about the total
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Debt-to-penny.yml)\]
Denver Open Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Denver-Open-Data.yml)\]
Durham, NC Open Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Durham-NC-Open-Data.yml)\]
Edmonton, AB, Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Edmonton-AB-Canada.yml)\]
England LGInform
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/England-LGInform.yml)\]
EuroStat
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/EuroStat.yml)\]
EveryPolitician - Ongoing project collating and sharing data on
every politician.
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/EveryPolitician.yml)\]
Federal Committee on Statistical Methodology (FCSM) (formerly
FedStats)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/FedStats.yml)\]
Finland
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Finland.yml)\]
France
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/France.yml)\]
Fredericton, NB,
Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Fredericton-NB-Canada.yml)\]
Gatineau, QC,
Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Gatineau-QC-Canada.yml)\]
Germany
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Germany.yml)\]
Ghent, Belgium
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Ghent-Belgium.yml)\]
Glasgow, Scotland, UK
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Glasgow-Scotland-UK.yml)\]
Greece
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Greece.yml)\]
Guardian world
governments
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Guardian-world-governments.yml)\]
Halifax, NS, Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Halifax-NS-Canada.yml)\]
Helsinki Region, Finland
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Helsinki-Region-Finland.yml)\]
Hong Kong, China
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Hong-Kong-China.yml)\]
Houston, TX, US
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Houston-TX-US.yml)\]
Indian Government Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Indian-Government-Data.yml)\]
Indonesian Data Portal
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Indonesian-Data-Portal.yml)\]
Iowa - Welcome to the State of Iowa's data portal. Please explore
data about Iowa and your \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Iowa.yml)\]
Ireland's Open Data Portal
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Irelands-Open-Data-Portal.yml)\]
Israel's Open Data Portal
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Israel.yml)\]
Istanbul Municipality Open Data Portal
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Istanbul-Municipality-Open-Data.yml)\]
Italy - Il Portale dati.gov.it è il catalogo nazionale dei metadati
relativi ai dati \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Italy.yml)\]
Jail deaths in America - The U.S. government does not release jail
by jail mortality data,
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Jail-deaths-in-America.yml)\]
Japan
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Japan.yml)\]
Laval, QC,
Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Laval-QC-Canada.yml)\]
Lexington, KY
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Lexington-KY.yml)\]
London Datastore, UK
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/London-Datastore-UK.yml)\]
London, ON,
Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/London-ON-Canada.yml)\]
Los Angeles Open Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Los-Angeles-Open-Data.yml)\]
Luxembourg - Luxembourgish Open Data
Portal
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Luxembourg.yml)\]
Malaysia
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Malaysia.yml)\]
MassGIS, Massachusetts,
U.S.
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/MassGIS-Massachusetts-U.S..yml)\]
Metropolitan Transportation Commission (MTC), California,
US
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Metropolitain-Transportation-Commission-MTC-California-US.yml)\]
Mexico
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Mexico.yml)\]
Mississauga, ON,
Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Missisauga-ON-Canada.yml)\]
Moldova
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Moldova.yml)\]
Moncton, NB, Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Moncton-NB-Canada.yml)\]
Montreal, QC, Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Montreal-QC-Canada.yml)\]
Mountain View, California, US
(GIS)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Mountain-View-California-US-GIS.yml)\]
NYC Open Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/NYC-Open-Data.yml)\]
NYC betanyc
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/NYC-betanyc.yml)\]
Netherlands
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Netherlands.yml)\]
New York Department of Sanitation Monthly Tonnage - DSNY Monthly
Tonnage Data provides
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/New-York-Department-of-Sanitation.yml)\]
New Zealand
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/New-Zealand.yml)\]
OECD
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/OECD.yml)\]
Oakland, California, US
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Oakland-California-US.yml)\]
Oklahoma
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Oklahoma.yml)\]
Open Data for Africa
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Open-Data-for-Africa.yml)\]
Open Government Data (OGD) Platform India
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Open-Government-Data-OGD-Platform-India.yml)\]
OpenDataSoft's list of 1,600 open
data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/OpenDataSofts-list-of-1600-open-data.yml)\]
Oregon
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Oregon.yml)\]
Ottawa, ON, Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Ottawa-ON-Canada.yml)\]
Palo Alto, California, US
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Palo-Alto-California-US.yml)\]
OpenDataPhilly - OpenDataPhilly is a catalog of open data in the
Philadelphia region. In \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Philadelphia-Open-Data.yml)\]
Portland, Oregon
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Portland-Oregon.yml)\]
Portugal - Pordata organization
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Portugal.yml)\]
Puerto Rico Government
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Puerto-Rico-Government.yml)\]
Quebec City, QC, Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Quebec-City-QC-Canada.yml)\]
Quebec Province of Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Quebec-Province-of-Canada.yml)\]
Regina SK, Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Regina-SK-Canada.yml)\]
Rio de Janeiro, Brazil
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Rio-de-Janeiro-Brazil.yml)\]
Romania
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Romania.yml)\]
Russia
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Russia.yml)\]
San Diego, CA
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/San%20Diego,%20CA.yml)\]
San Antonio, TX - Community Information Now - CI:Now is a nonprofit
serving Bexar (San \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/San-Antonio-TX-US-Community-Information-Now.yml)\]
San Francisco Data sets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/San-Francisco-Data-sets.yml)\]
San Jose, California, US
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/San-Jose-California-US.yml)\]
San Mateo County, California, US
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/San-Mateo-County-California-US.yml)\]
Saskatchewan, Province of Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Saskatchewan-Province-of-Canada.yml)\]
Seattle
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Seattle.yml)\]
Singapore Government Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Singapore-Government-Data.yml)\]
South Africa Trade Statistics
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/South-Africa-Trade-Statistics.yml)\]
South Africa
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/South-Africa.yml)\]
State of Utah, US
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/State-of-Utah-US.yml)\]
Switzerland
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Switzerland.yml)\]
Taiwan gov
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Taiwan-g0v.yml)\]
Taiwan
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Taiwan.yml)\]
Tel-Aviv Open
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Tel-Aviv.yml)\]
Texas Open Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Texas-Open-Data.yml)\]
The World
Bank
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/The-World-Bank.yml)\]
Toronto, ON, Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Toronto-ON-Canada.yml)\]
Tunisia
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Tunisia.yml)\]
U.K. Government Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/U.K.-Government-Data.yml)\]
U.S. American Community
Survey
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/U.S.-American-Community-Survey.yml)\]
U.S. CDC Public Health
datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/U.S.-CDC-Public-Health-datasets.yml)\]
U.S. Census Bureau
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/U.S.-Census-Bureau.yml)\]
U.S. Department of Housing and Urban Development
(HUD)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/U.S.-Department-of-Housing-and-Urban-Development-HUD.yml)\]
U.S. Federal Government Agencies
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/U.S.-Federal-Government-Agencies.yml)\]
U.S. Federal Government Data
Catalog
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/U.S.-Federal-Government-Data-Catalog.yml)\]
U.S. Food and Drug Administration
(FDA)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/U.S.-Food-and-Drug-Administration-FDA.yml)\]
U.S. National Center for Education Statistics
(NCES)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/U.S.-National-Center-for-Education-Statistics-NCES.yml)\]
U.S. Open Government
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/U.S.-Open-Government.yml)\]
UK 2011 Census Open Atlas
Project
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/UK-2011-Census-Open-Atlas-Project.yml)\]
UNESCO Data Hub - UNESCO's official data catalog providing
authoritative global statistics \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/UNESCO-Data-Hub.yml)\]
US Counties - This is a repository of various data, broken down by
US county. While most of
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/US-Counties.yml)\]
U.S. Patent and Trademark Office (USPTO) Bulk Data
Products
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/USPTO-Bulk-Data-Products.yml)\]
Uganda Bureau of
Statistics
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Uganda-Bureau-of-Statistics.yml)\]
Ukraine
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Ukraine.yml)\]
United Nations
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/United-Nations.yml)\]
Uruguay
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Uruguay.yml)\]
Valley Transportation Authority (VTA), California,
US
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Valley-Transportation-Authority-VTA-California-US.yml)\]
Vancouver, BC Open Data
Catalog
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Vancouver-BC-Open-Data-Catalog.yml)\]
Victoria, BC, Canada
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Victoria-BC-Canada.yml)\]
Vienna, Austria
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Vienna-Austria.yml)\]
Statistics from the General Statistics Office of Vietnam - Data in
different categories are
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/Vietnam.yml)\]
U.S. Congressional Research Service (CRS)
Reports
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Government/everycrsreport.yml)\]
Healthcare
AWS COVID-19 Datasets - We're working with organizations who make
COVID-19-related data
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/Aws-COVID-19.yml)\]
COVID-19 Case Surveillance Public Use Data - The COVID-19 case
surveillance system database
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/COVID-19-Case-Surveillance-Public-Use-Data.yml)\]
Covid-19 non-processed data of Ecuador - It's a project which
provides non-processed datasets
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/COVID-19-Ecuador-Data.yml)\]
2019 Novel Coronavirus COVID-19 Data Repository by Johns Hopkins
CSSE - This is the data
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/COVID-19-Johns-Hopkins.yml)\]
Coronavirus (Covid-19) Data in the United States - The New York
Times is releasing a series
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/COVID-19-New-York-Times.yml)\]
COVID-19 Reported Patient Impact and Hospital Capacity by
Facility - The following dataset
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/COVID-19-Reported-Patient-Impact-and-Hospital-Capacity-by-Facility.yml)\]
Composition of Foods Raw, Processed, Prepared USDA National
Nutrient Database for Standard
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/Composition-of-Foods-Raw-Processed-Prepared-USDA-National-Nutrient-Database-for-Standard-Reference.yml)\]
The COVID Tracking Project - The COVID Tracking Project collects
and publishes the most \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/Covid-Tracking-Project.yml)\]
EHDP Large Health Data
Sets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/EHDP-Large-Health-Data-Sets.yml)\]
GDC - GDC supports several cancer genome programs for CCG, TCGA,
TARGET etc.
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/GDC.yml)\]
Gapminder World demographic
databases
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/Gapminder-World-demographic-databases.yml)\]
MeSH, the vocabulary thesaurus used for indexing articles for
PubMed
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/MeSH-the-vocabulary-thesaurus-used-for-indexing-articles-for-PubMed.yml)\]
MeDAL - A large medical text dataset curated for abbreviation
disambiguation - Medical
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/Medal-medical-abbreviations.yml)\]
Medicare Coverage Database (MCD),
U.S.
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/Medicare-Coverage-Database-MCD-U.S..yml)\]
Medicare Data Engine of medicare.gov
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/Medicare-Data-Engine-of-medicare.gov-Data.yml)\]
Medicare Data File
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/Medicare-Data-File.yml)\]
Nightingale Open Science
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/Nightingale.yml)\]
Number of Ebola Cases and Deaths in Affected Countries
(2014)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/Number-of-Ebola-Cases-and-Deaths-in-Affected-Countries-2014.yml)\]
Open-ODS (structure of the UK NHS)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/Open-ODS.yml)\]
OpenPaymentsData, Healthcare financial relationship
data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/OpenPaymentsData-Healthcare-financial-relationship-data.yml)\]
PhysioBank Databases - A large and growing archive of physiological
data.
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/PhysioBank-Databases.yml)\]
Spanish Flu Dataset - Historical dataset about the 1918–1920
Spanish Flu pandemic, including
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/Spanish-Flu.yml)\]
The Cancer Imaging Archive
(TCIA)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/TCIA.yml)\]
The Cancer Genome Atlas project
(TCGA)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/The-Cancer-Genome-Atlas-project-TCGA.yml)\]
World Health Organization Global Health
Observatory
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/World-Health-Organization-Global-Health-Observatory.yml)\]
Yahoo Knowledge Graph COVID-19 Datasets - The Yahoo Knowledge Graph
team at Verizon Media is
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/Yahoo-COVID-19.yml)\]
Informatics for Integrating Biology and the
Bedside
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Healthcare/i2b2.yml)\]
ImageProcessing
10k US Adult Faces
Database
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/10k-US-Adult-Faces-Database.yml)\]
2GB of Photos of
Cats
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/2GB-of-Photos-of-Cats.yml)\]
Audience Unfiltered faces for gender and age
classification
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Adience-Unfiltered-faces-for-gender-and-age-classification.yml)\]
Affective Image Classification
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Affective-Image-Classification.yml)\]
Airborne Object Detection and Tracking - The Airborne Object
Tracking (AOT) dataset is a
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Airborne-Object-Detection-and-Tracking.yml)\]
Animals with attributes
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Animals-with-attributes.yml)\]
CADDY Underwater Stereo-Vision Dataset of divers' hand gestures -
Contains 10K stereo pair
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/CADDY-Underwater-Stereo-Vision-Dataset-of-hand-gestures.yml)\]
Cytology Dataset – CCAgT: Images of Cervical Cells with AgNOR
Stain Technique - Contains 9339
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/CCAgT.yml)\]
Caltech Pedestrian Detection
Benchmark
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Caltech-Pedestrian-Detection-Benchmark.yml)\]
Chars74K dataset - Character Recognition in Natural Images (both
English and Kannada are
available)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Chars74K-dataset.yml)\]
Cube++ - 4890 raw 18-megapixel images, each containing a SpyderCube
color target in their
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Cube-Plus-Plus.yml)\]
Densely Annotated Video Driving Data Set - This data set consists
of 28 video sequences of
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/DAVID.yml)\]
Danbooru Tagged Anime Illustration Dataset - A large-scale anime
image database with 3.33m+ \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Danbooru-Tagged-Anime-Illustration-Dataset.yml)\]
DukeMTMC Data Set - DukeMTMC aims to accelerate advances in
multi-target multi-camera
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/DukeMTMC-Data-Set.yml)\]
ETH Entomological Collection (ETHEC) Fine Grained Butterfly
(Lepidoptra) Images
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/ETH_Entomological_Collection_Fine_Grained_Butterfly_Images.yml)\]
Face Recognition Benchmark
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Face-Recognition-Benchmark.yml)\]
Flickr: 32 Class Brand
Logos
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Flickr-32-Class-Brand-Logos.yml)\]
GDXray - X-ray images for X-ray testing and Computer
Vision
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/GDXray.yml)\]
HumanEva Dataset - The HumanEva-I dataset contains 7 calibrated
video sequences (4 grayscale
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/HumanEva-Dataset.yml)\]
ImageNet (in WordNet hierarchy)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/ImageNet.yml)\]
Indoor Scene
Recognition
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Indoor-Scene-Recognition.yml)\]
International Affective Picture System,
UFL
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/International-Affective-Picture-System-UFL.yml)\]
KITTI Vision Benchmark
Suite
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/KITTI-Vision-Benchmark-Suite.yml)\]
Labeled Information Library of Alexandria - Biology and
Conservation - Contains over 10 \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/LILA-BC.yml)\]
Long duration stitched and unstitched 8K/30 fps stereoscopic 360°
videos - This 360° video
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Long-duration-stitched-and-unstitched-8K-30-fps-stereoscopic-360deg-videos.yml)\]
MNIST database of handwritten digits, near 1 million
examples
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/MNIST-database-of-handwritten-digits-near-1-million-examples.yml)\]
Multi-View Region of Interest Prediction Dataset for Autonomous
Driving - Contains 16 driving
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/MV-ROI.yml)\]
Massive Visual Memory Stimuli,
MIT
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Massive-Visual-Memory-Stimuli-MIT.yml)\]
Newspaper Navigator - This dataset consists of extracted visual
content for 16,358,041
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Newspaper-Navigator.yml)\]
Open Images From Google - Pictures with segmentation masks for 2.8
million object instances
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/OpenImagesByGoogle.yml)\]
RuFa - Contains images of text written in one of two Arabic fonts
(Ruqaa and Nastaliq
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/RuFa-Arabic-font-dataset.yml)\]
SUN database,
MIT
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/SUN-database-MIT.yml)\]
SVIRO Synthetic Vehicle Interior Rear Seat Occupancy - 25.000
synthetic scenery's across ten \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/SVIRO.yml)\]
Several Shape-from-Silhouette
Datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Several-Shape-from-Silhouette-Datasets.yml)\]
Stanford Dogs
Dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Stanford-Dogs-Dataset.yml)\]
The Action Similarity Labeling (ASLAN)
Challenge
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/The-Action-Similarity-Labeling-ASLAN-Challenge.yml)\]
The Oxford-IIIT Pet
Dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/The-Oxford-IIIT-Pet-Dataset.yml)\]
Violent-Flows - Crowd Violence / Non-violence Database and
benchmark
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Violent-Flows.yml)\]
Visual genome
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/Visual-genome.yml)\]
YouTube Faces Database
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ImageProcessing/YouTube-Faces-Database.yml)\]
MachineLearning
All-Age-Faces Dataset - Contains 13'322 Asian face images
distributed across all ages (from 2
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/All-Age-Faces-Dataset.yml)\]
Audi Autonomous Driving Dataset - We have published the Audi
Autonomous Driving Dataset
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Audi-Autonomous-Driving-Dataset.yml)\]
B3FD - Facial age (and gender) estimation dataset with 375k
images - The B3FD dataset is a
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Biometrically-Filtered-Famous-Figure-Dataset-for-Age-Estimation.yml)\]
Context-aware data sets from five
domains
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Context-aware-datasets-from-five-domains.yml)\]
Delve Datasets for classification and
regression
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Delve-Datasets-for-classification-and-regression.yml)\]
Discogs Monthly Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Discogs-Monthly-Data.yml)\]
Fluorescent Neuronal Cells - By releasing this dataset, we aim at
providing a new testbed for
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Fluorescent-Neuronal-Cells.yml)\]
Free Music Archive
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Free-Music-Archive.yml)\]
IMDb Database
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/IMDb-Database.yml)\]
Iranis - A Large-scale Dataset of Farsi/Arabic License Plate
Characters
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Iranis.yml)\]
Keel Repository for classification, regression and time
series
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Keel-Repository-for-classification-regression-and-time-series.yml)\]
LLVIP - This dataset contains 30976 images, or 15488 pairs, most of
which were taken at very
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/LLVIP.yml)\]
Labeled Faces in the Wild (LFW)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Labeled-Faces-in-the-Wild-LFW.yml)\]
Lending Club Loan
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Lending-Club-Loan-Data.yml)\]
Machine Learning Data Set Repository
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Machine-Learning-Data-Set-Repository.yml)\]
Million Song Dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Million-Song-Dataset.yml)\]
More Song
Datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/More-Song-Datasets.yml)\]
MovieLens Data Sets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/MovieLens-Data-Sets.yml)\]
New Yorker caption contest
ratings
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/New-Yorker-caption-contest-ratings.yml)\]
RDataMining - "R and Data Mining" ebook
data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/RDataMining.yml)\]
Registered Meteorites on
Earth
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Registered-Meteorites-on-Earth.yml)\]
Restaurants Health Score Data in San
Francisco
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Restaurants-Health-Score-Data-in-San-Francisco.yml)\]
TikTok Dataset - More than 300 dance videos that capture a single
person performing dance
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Tik-Tok-Dataset.yml)\]
UCI Machine Learning Repository
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/UCI-Machine-Learning-Repository.yml)\]
Yambda-5B — A Large-Scale Multi-modal Dataset for Ranking And
Retrieval - Industrial-scale
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/YaMBDa-5B-Music-Interaction-Dataset.yml)\]
Yahoo! Ratings and Classification
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Yahoo-Ratings-and-Classification-Data.yml)\]
YouTube-BoundingBoxes
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/YouTube-BoundingBoxes.yml)\]
Youtube 8m
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/Youtube-8m.yml)\]
eBay Online Auctions
(2012)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//MachineLearning/eBay-Online-Auctions-2012.yml)\]
Museums
Canada Science and Technology Museums Corporation's Open
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Museums/Canada-Science-and-Technology-Museums-Corporations-Open-Data.yml)\]
Cooper-Hewitt's Collection
Database
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Museums/Cooper-Hewitt-Collection-Database.yml)\]
Metropolitan Museum of Art Collection
API
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Museums/Metropolitan-Museum-of-Art-Collection-API.yml)\]
Minneapolis Institute of Arts
metadata
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Museums/Minneapolis-Institute-of-Arts-metadata.yml)\]
Natural History Museum (London) Data
Portal
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Museums/Natural-History-Museum-London-Data-Portal.yml)\]
Rijksmuseum Historical Art
Collection
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Museums/Rijksmuseum-Historical-Art-Collection.yml)\]
Tate Collection
metadata
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Museums/Tate-Collection-metadata.yml)\]
The Getty vocabularies
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Museums/The-Getty-vocabularies.yml)\]
NaturalLanguage
Automatic Keyphrase
Extraction
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Automatic-Keyphrase-Extraction.yml)\]
The Big Bad NLP Database
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/BigBadNLPDatabase.yml)\]
Blizzard Challenge Speech - The speech + text data comes from
professional audiobooks
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Blizzard-Speech.yml)\]
Blogger Corpus
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Blogger-Corpus.yml)\]
CLiPS Stylometry Investigation
Corpus
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/CLiPS-Stylometry-Investigation-Corpus.yml)\]
ClueWeb09 FACC
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/ClueWeb09-FACC.yml)\]
ClueWeb12 FACC
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/ClueWeb12-FACC.yml)\]
DBpedia - Structured data from
Wikipedia
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/DBpedia.yml)\]
Dirty Words - With millions of images in our library and billions
of user-submitted keywords,
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Dirty-Words.yml)\]
Flickr Personal
Taxonomies
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Flickr-Personal-Taxonomies.yml)\]
Freebase of people, places, and things
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Freebase-of-people-places-and-things.yml)\]
German Political Speeches Corpus - Collection of political speeches
from the German
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/German-Political-Speeches-Corpus.yml)\]
Google Books Ngrams
(2.2TB)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Google-Books-Ngrams-2.2TB.yml)\]
Google MC-AFP - Generated based on the public available Gigaword
dataset using Paragraph Vectors
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Google-MC-AFP.yml)\]
Google Web 5gram (1TB,
2006)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Google-Web-5gram-1TB-2006.yml)\]
Gutenberg eBooks
List
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Gutenberg-eBooks-List.yml)\]
Hansards text chunks of Canadian
Parliament
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Hansards-text-chunks-of-Canadian-Parliament.yml)\]
LJ Speech - Speech dataset consisting of 13,100 short audio clips
of a single speaker reading
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/LJ-Speech.yml)\]
M-AILabs Speech - The M-AILABS Speech Dataset is the first large
dataset that we are
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/M-AILABS-Speech.yml)\]
Microsoft MAchine Reading COmprehension Dataset (or MS
MARCO)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/MS-MARCO.yml)\]
Machine Comprehension Test (MCTest) of text from Microsoft
Research
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Machine-Comprehension-Test-MCTest-of-text-from-Microsoft-Research.yml)\]
Machine Translation of European
languages
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Machine-Translation-of-European-languages.yml)\]
Making Sense of Microposts 2013 - Concept
Extraction
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Making-Sense-of-Microposts-2013.yml)\]
Making Sense of Microposts 2016 - Named Entity rEcognition and
Linking
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Making-Sense-of-Microposts-2016.yml)\]
Multi-Domain Sentiment Dataset (version
2.0)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Multi-Domain-Sentiment-Dataset-version-2.0.yml)\]
No Language Left Behind (NLLB - 200vo) - Dataset based on Meta's
metadata for mined bitext.
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/NoLanguageLeftBehindNLLB200vo.yml)\]
Noisy speech database for training speech enhancement algorithms
and TTS models - Clean and
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Noisy-Speech.yml)\]
Open Multilingual Wordnet
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Open-Multilingual-Wordnet.yml)\]
POS/NER/Chunk annotated
data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/POS-NER-Chunk-annotated-data.yml)\]
Personae
Corpus
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Personae-Corpus.yml)\]
SMS Spam Collection in
English
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/SMS-Spam-Collection-in-English.yml)\]
SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K
articles)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/SaudiNewsNet-Collection-of-Saudi-Newspaper-Articles-Arabic-30K-articles.yml)\]
Stanford Question Answering Dataset
(SQuAD)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Stanford-Question-Answering-Dataset-SQuAD.yml)\]
USENET postings corpus of
2005~2011
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/USENET-postings-corpus-of-2005~2011.yml)\]
Universal Dependencies
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Universal-Dependencies.yml)\]
Webhose - News/Blogs in multiple
languages
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Webhose.yml)\]
Wikidata - Wikipedia
databases
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Wikidata.yml)\]
Wikipedia Links data - 40 Million Entities in
Context
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Wikipedia-Links-data.yml)\]
WordNet databases and
tools
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/WordNet-databases-and-tools.yml)\]
Wordbank - Open, de-identified database of vocabulary development
from 84,138 children and \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Wordbank.yml)\]
WorldTree Corpus of Explanation Graphs for Elementary Science
Questions - a corpus of
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//NaturalLanguage/Worldtree-Explanation-Corpus.yml)\]
Neuroscience
Allen Institute Datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/Allen-Institute-Datasets.yml)\]
Brain Catalogue
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/Brain-Catalogue.yml)\]
Brainomics
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/Brainomics.yml)\]
CodeNeuro Datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/CodeNeuro-Datasets.yml)\]
Collaborative Research in Computational Neuroscience
(CRCNS)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/Collaborative-Research-in-Computational-Neuroscience-CRCNS.yml)\]
FCP-INDI
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/FCP-INDI.yml)\]
Human Connectome Project
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/Human-Connectome-Project.yml)\]
NDAR
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/NDAR.yml)\]
NIMH Data Archive
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/NIMH-Data-Archive.yml)\]
NeuroData
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/NeuroData.yml)\]
NeuroMorpho - NeuroMorpho.Org is a centrally curated inventory of
digitally reconstructed \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/NeuroMorpho.yml)\]
Neuroelectro
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/Neuroelectro.yml)\]
OASIS
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/OASIS.yml)\]
OpenNEURO
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/OpenNEURO)\]
OpenfMRI
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/OpenfMRI.yml)\]
Study Forrest
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/Study-Forrest.yml)\]
The Nencki-Symfonia EEG/ERP dataset - A high-density
electroencephalography (EEG) dataset
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Neuroscience/The_Nencki-Symfonia_EEG_ERP_dataset.yml)\]
Physics
CERN Open Data Portal
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Physics/CERN-Open-Data-Portal.yml)\]
Crystallography Open Database
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Physics/Crystallography-Open-Database.yml)\]
IceCube - South Pole Neutrino
Observatory
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Physics/IceCube.yml)\]
Ligo Open Science Center (LOSC) - Gravitational wave data from the
LIGO Hanford and \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Physics/LIGO-Open-Science-Center.yml)\]
NASA Exoplanet Archive
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Physics/NASA-Exoplanet-Archive.yml)\]
NSSDC (NASA) data of 550 space
spacecraft
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Physics/NSSDC-NASA-data-of-550-space-spacecraft.yml)\]
Quantum simulations of an electron in a two dimensional potential
well - The data was
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Physics/Quantum.yml)\]
Sloan Digital Sky Survey (SDSS) - Mapping the
Universe
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Physics/Sloan-Digital-Sky-Survey-SDSS.yml)\]
ProstateCancer
EOPC-DE-Early-Onset-Prostate-Cancer-Germany - Early Onset Prostate
Cancer - Germany. \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/EOPC-DE-Early-Onset-Prostate-Cancer-Germany.yml)\]
GENIE - Data from the Genomics Evidence Neoplasia Information
Exchange (GENIE) project of the
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/GENIE.yml)\]
Genomic-Hallmarks-Prostate-Adenocarcinoma-CPC-GENE - Comprehensive
genomic profiling of 477
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Genomic-Hallmarks-Prostate-Adenocarcinoma-CPC-GENE.yml)\]
MSK-IMPACT-Clinical-Sequencing-Cohort-MSKCC-Prostate-Cancer -
Targeted sequencing of clinical
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/MSK-IMPACT-Clinical-Sequencing-Cohort-MSKCC-Prostate-Cancer.yml)\]
Metastatic-Prostate-Adenocarcinoma-MCTP - Comprehensive profiling
of 61 prostate cancer
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Metastatic-Prostate-Adenocarcinoma-MCTP.yml)\]
Metastatic-Prostate-Cancer-SU2CPCF-Dream-Team - Comprehensive
analysis of 150 metastatic
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Metastatic-Prostate-Cancer-SU2CPCF-Dream-Team.yml)\]
NPCR-2001-2015 - Database from CDC's National Program of Cancer
Registries (NPCR). The
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/NPCR-2001-2015.yml)\]
NPCR-2005-2015 - Database from CDC's National Program of Cancer
Registries (NPCR). The
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/NPCR-2005-2015.yml)\]
NaF-Prostate - NaF Prostate is a collection of F-18 NaF positron
emission tomography/computed
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/NaF-Prostate.yml)\]
Neuroendocrine-Prostate-Cancer - Whole exome and RNA Seq data of
castration resistant
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Neuroendocrine-Prostate-Cancer.yml)\]
PLCO-Prostate-Diagnostic-Procedures - The Prostate Diagnostic
Procedures dataset (95,837
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/PLCO-Prostate-Diagnostic-Procedures.yml)\]
PLCO-Prostate-Medical-Complications - The Prostate Medical
Complications dataset (3,350
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/PLCO-Prostate-Medical-Complications.yml)\]
PLCO-Prostate-Screening-Abnormalities - The Prostate Screening
Abnormalities dataset (10,527
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/PLCO-Prostate-Screening-Abnormalities.yml)\]
PLCO-Prostate-Screening - The Prostate Screening dataset (177,315
records, 35,875 subjects,
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/PLCO-Prostate-Screening.yml)\]
PLCO-Prostate-Treatments - The Prostate Treatments dataset (13,409
records, 7,614 subjects,
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/PLCO-Prostate-Treatments.yml)\]
PLCO-Prostate - The Prostate dataset is a comprehensive dataset
that contains nearly all the
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/PLCO-Prostate.yml)\]
PRAD-CA-Prostate-Adenocarcinoma-Canada - Prostate Adenocarcinoma -
Canada. Collected by the
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/PRAD-CA-Prostate-Adenocarcinoma-Canada.yml)\]
PRAD-FR-Prostate-Adenocarcinoma-France - Prostate Adenocarcinoma -
France. Collected by ten
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/PRAD-FR-Prostate-Adenocarcinoma-France.yml)\]
PRAD-UK-Prostate-Adenocarcinoma-United-Kingdom - Prostate
Adenocarcinoma - United Kingdom.
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/PRAD-UK-Prostate-Adenocarcinoma-United-Kingdom.yml)\]
PROSTATEx-Challenge - Retrospective set of prostate MR studies. All
studies included
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/PROSTATEx-Challenge.yml)\]
Prostate-3T - The Prostate-3T project provided imaging data to TCIA
as part of an ISBI
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-3T.yml)\]
Prostate-Adenocarcinoma-Broad-Cornell-2012 - Comprehensive
profiling of 112 prostate cancer
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-Adenocarcinoma-Broad-Cornell-2012.yml)\]
Prostate-Adenocarcinoma-Broad-Cornell-2013 - Comprehensive
profiling of 57 prostate cancer
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-Adenocarcinoma-Broad-Cornell-2013.yml)\]
Prostate-Adenocarcinoma-CNA-study-MSKCC - Copy-number profiling of
103 primary prostate
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-Adenocarcinoma-CNA-study-MSKCC.yml)\]
Prostate-Adenocarcinoma-Fred-Hutchinson-CRC - Comprehensive
profiling of prostate cancer
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-Adenocarcinoma-Fred-Hutchinson-CRC.yml)\]
Prostate Adenocarcinoma (MSKCC/DFCI) - Whole Exome Sequencing of
1013 prostate cancer
samples.
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-Adenocarcinoma-MSKCC-DFCI.yml)\]
Prostate-Adenocarcinoma-MSKCC - MSKCC Prostate Oncogenome Project.
181 primary, 37 metastatic
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-Adenocarcinoma-MSKCC.yml)\]
Prostate-Adenocarcinoma-Organoids-MSKCC - Exome profiling of
prostate cancer samples and
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-Adenocarcinoma-Organoids-MSKCC.yml)\]
Prostate-Adenocarcinoma-Sun-Lab - Whole-genome and Transcriptome
Sequencing of 65 Prostate
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-Adenocarcinoma-Sun-Lab.yml)\]
Prostate-Adenocarcinoma-TCGA-PanCancer-Atlas - Comprehensive TCGA
PanCanAtlas data from 11k
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-Adenocarcinoma-TCGA-PanCancer-Atlas.yml)\]
Prostate-Adenocarcinoma-TCGA - Integrated profiling of 333 primary
prostate adenocarcinoma
samples.
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-Adenocarcinoma-TCGA.yml)\]
Prostate-Diagnosis - PCa T1- and T2-weighted magnetic resonance
images (MRIs) were acquired
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-Diagnosis.yml)\]
Prostate-Fused-MRI-Pathology - The Prostate Fused-MRI-Pathology
collection is a combination
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-Fused-MRI-Pathology.yml)\]
Prostate-MRI - The Prostate-MRI collection of prostate Magnetic
Resonance Images (MRIs) was
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-MRI.yml)\]
Prostate-R - The R package 'ElemStatLearn' contains a prostate
cancer dataset from Stamey et
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/Prostate-R.yml)\]
QIN-PROSTATE-Repeatability - The QIN-PROSTATE-Repeatability dataset
is a dataset with
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/QIN-PROSTATE-Repeatability.yml)\]
QIN-PROSTATE - The QIN PROSTATE collection of the Quantitative
Imaging Network (QIN) contains
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/QIN-PROSTATE.yml)\]
SEER-YR1973_2015.SEER9 - The SEER November 2017 Research Data files
from nine SEER registries
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/SEER-YR1973_2015.SEER9.yml)\]
SEER-YR1992_2015.SJ_LA_RG_AK - The SEER November 2017 Research Data
files from the San Jose-
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/SEER-YR1992_2015.SJ_LA_RG_AK.yml)\]
SEER-YR2000_2015.CA_KY_LO_NJ_GA - The SEER November 2017 Research
Data files from the Greater
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/SEER-YR2000_2015.CA_KY_LO_NJ_GA.yml)\]
SEER-YR2000_2015.CA_KY_LO_NJ_GA - The July - December 2005
diagnoses for Louisiana from their
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/SEER-YR2005.LO_2ND_HALF.yml)\]
TCGA-PRAD-US - TCGA Prostate Adenocarcinoma (499
samples).
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//ProstateCancer/TCGA-PRAD-US.yml)\]
Psychology+Cognition
OSU Cognitive Modeling Repository
Datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Psychology+Cognition/OSU-Cognitive-Modeling-Repository-Datasets.yml)\]
Open Cognitive Science Data - Publicly available behavioral
datasets from across cognitive
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Psychology+Cognition/Open-Cognitive-Science-Data-Repository.yml)\]
PublicDomains
Ably Open Realtime Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Ably.yml)\]
Amazon
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Amazon.yml)\]
Archive.org Datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Archive.org-Datasets.yml)\]
Archive-it from Internet
Archive
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Archive.yml)\]
CMU JASA data archive
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/CMU-JASA-data-archive.yml)\]
CMU StatLab collections
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/CMU-StatLab-collections.yml)\]
Data.World
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Data.World.yml)\]
Data360
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Data360.yml)\]
Enigma Public
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Enigma-Public.yml)\]
Google
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Google.yml)\]
Grand Comics Database - The Grand Comics Database (GCD) is a
nonprofit, internet-based \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/GrandComics.yml)\]
Infochimps
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Infochimps.yml)\]
KDNuggets Data
Collections
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/KDNuggets-Data-Collections.yml)\]
Microsoft Azure Data Market Free
DataSets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Microsoft-Azure-Data-Market-Free-DataSets.yml)\]
Microsoft Data Science for Research
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Microsoft-Data-Science-for-Research.yml)\]
Microsoft Research Open Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Microsoft-Research-Open-Data)\]
Open Library Data Dumps
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Open-Library-Data-Dumps.yml)\]
Reddit Datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Reddit-Datasets.yml)\]
RevolutionAnalytics
Collection
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/RevolutionAnalytics-Collection.yml)\]
Sample R data
sets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Sample-R-data-sets.yml)\]
Stack Overflow Annual Developer Survey - Annual developer surverys
full data sets from 2011
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Stack-Overflow-Annual-Developer-Survey.yml)\]
StatSci.org
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/StatSci.org.yml)\]
Stats4Stem R data sets
(archived)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Stats4Stem-R-data-sets.yml)\]
The Washington Post
List
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/The-Washington-Post-List.yml)\]
UCLA SOCR data
collection
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/UCLA-SOCR-data-collection.yml)\]
UFO Reports
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/UFO-Reports.yml)\]
Wikileaks 911 pager
intercepts
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Wikileaks-911-pager-intercepts.yml)\]
Yahoo Webscope
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//PublicDomains/Yahoo-Webscope.yml)\]
SearchEngines
Academic Torrents of data sharing from
UMB
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SearchEngines/Academic-Torrents-of-data-sharing-from-UMB.yml)\]
Base dos Dados - Data Basis: Open Data Repository for
Brazil
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SearchEngines/BaseDosDados.yml)\]
Datahub.io
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SearchEngines/Datahub.io.yml)\]
Domains Project - Sorted list of Internet
domains
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SearchEngines/DomainsProject.yml)\]
Harvard Dataverse Network of scientific
data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SearchEngines/Harvard-Dataverse-Network-of-scientific-data.yml)\]
ICPSR
(UMICH)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SearchEngines/ICPSR-UMICH.yml)\]
Institute of Education Sciences
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SearchEngines/Institute-of-Education-Sciences.yml)\]
National Technical Reports Library
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SearchEngines/National-Technical-Reports-Library.yml)\]
Open Data Certificates
(beta)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SearchEngines/Open-Data-Certificates-beta.yml)\]
OpenDataNetwork - A search engine of all Socrata powered data
portals
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SearchEngines/OpenDataNetwork.yml)\]
Statista.com - statistics and Studies
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SearchEngines/Statista.com.yml)\]
Zenodo - An open dependable home for the long-tail of
science
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SearchEngines/Zenodo.yml)\]
SocialNetworks
2021 Portuguese Elections Twitter Dataset - 57M+ tweets, 1M+
users - This dataset contains
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/2021_Portuguese_Elections_Twitter_Dataset_57M_tweets_1M_users.yml)\]
72 hours #gamergate Twitter
Scrape
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/72-hours-gamergate-Twitter-Scrape.yml)\]
CMU Enron Email of 150 users
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/CMU-Enron-Email-of-150-users.yml)\]
Cheng-Caverlee-Lee September 2009 - January 2010 Twitter
Scrape
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Cheng-Caverlee-Lee-Twitter-Scrape-September-2009~January-2010.yml)\]
China Biographical Database - The China Biographical Database is a
freely accessible \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/China-Biographical-Database.yml)\]
Clubhouse
Dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Clubhouse-Dataset.yml)\]
A Twitter Dataset of 40+ million tweets related to COVID-19 - Due
to the relevance of the \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Covid19-40-Million-Tweets.yml)\]
43k+ Donald Trump Twitter Screenshots - This archive contains
screenshots of 43,475 Donald
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Donald-Trump-Twitter-Screenshots.yml)\]
EDRM Enron EMail of 151 users, hosted on
S3
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/EDRM-Enron-EMail-of-151-users-hosted-on-S3.yml)\]
Facebook Data Scrape
(2005)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Facebook-Data-Scrape-2005.yml)\]
Facebook Social Connectedness Index - We use an anonymized snapshot
of all active Facebook
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Facebook-Social-Connectedness-Index.yml)\]
Facebook Social Networks from LAW (since
2007)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Facebook-Social-Networks-from-LAW-since-2007.yml)\]
Foursquare from UMN/Sarwat
(2013)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Foursquare-from-UMN-Sarwat-2013.yml)\]
GitHub Collaboration Archive
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/GitHub-Collaboration-Archive.yml)\]
Google Scholar citation
relations
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Google-Scholar-citation-relations.yml)\]
High-Resolution Contact Networks from Wearable
Sensors
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/High-Resolution-Contact-Networks-from-Wearable-Sensors.yml)\]
Indie Map: social graph and crawl of top IndieWeb
sites
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Indie-Map.yml)\]
Mobile Social Networks from
UMASS
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Mobile-Social-Networks-from-UMASS.yml)\]
Network Twitter
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Network-Twitter-Data.yml)\]
Reddit Comments
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Reddit-Comments.yml)\]
Skytrax' Air Travel Reviews
Dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Skytrax-Air-Travel-Reviews-Dataset.yml)\]
Social Twitter
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Social-Twitter-Data.yml)\]
SourceForge.net Research
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/SourceForge.net-Research-Data.yml)\]
The Reddit COVID dataset - This dataset attempts to capture the
full extent of COVID-19
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/The-Reddit-COVID-Dataset.yml)\]
Twitch Top Streamer's
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/TwitchTopStreamers.yml)\]
Twitter Data for Online Reputation
Management
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Twitter-Data-for-Online-Reputation-Management.yml)\]
Twitter Data for Sentiment
Analysis
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Twitter-Data-for-Sentiment-Analysis.yml)\]
Twitter Graph of entire Twitter
site
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Twitter-Graph-of-entire-Twitter-site.yml)\]
Twitter Scrape Calufa May
2011
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Twitter-Scrape-Calufa-May-2011.yml)\]
UNIMI/LAW Social Network
Datasets
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/UNIMILAW-Social-Network-Datasets.yml)\]
United States Congress Twitter Data - Daily datasets with tweets of
1100+ accounts associated
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/United-States-Congressional-Twitter-Data.yml)\]
Yahoo! Graph and Social
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Yahoo-Graph-and-Social-Data.yml)\]
Youtube Video Social Graph in
2007,2008
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialNetworks/Youtube-Video-Social-Graph-in-2007~2008.yml)\]
SocialSciences
ACLED (Armed Conflict Location & Event Data
Project)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/ACLED.yml)\]
Authoritarian Ruling Elites Database - The Authoritarian Ruling
Elites Database (ARED) is a
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Authoritarian-Ruling-Elites.yml)\]
Canadian Legal Information
Institute
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Canadian-Legal-Information-Institute.yml)\]
Center for Systemic Peace Datasets - Conflict Trends, Polities,
State Fragility, etc
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Center-for-Systemic-Peace-Datasets.yml)\]
Correlates of War Project
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Correlates-of-War-Project.yml)\]
Cryptome Conspiracy Theory Items
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Cryptome-Conspiracy-Theory-Items.yml)\]
Datacards
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Datacards.yml)\]
European Social Survey
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/European-Social-Survey.yml)\]
FBI Hate Crime 2013 - aggregated
data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/FBI-Hate-Crime-2013.yml)\]
Fragile States Index
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Fragile-States-Index.yml)\]
GDELT Global Events Database
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/GDELT-Global-Events-Database.yml)\]
General Social Survey (GSS) since 1972
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/General-Social-Survey-GSS-since-1972.yml)\]
German Social Survey
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/German-Social-Survey.yml)\]
Global Religious Futures
Project
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Global-Religious-Futures-Project.yml)\]
Gun Violence Data - A comprehensive, accessible database that
contains records of over 260k
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Gun-Violence-Data.yml)\]
Humanitarian Data Exchange
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Humanitarian-Data-Exchange.yml)\]
INFORM Index for Risk
Management
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/INFORM-Index-for-Risk-Management.yml)\]
Institute for Demographic Studies
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Institute-for-Demographic-Studies.yml)\]
Inter-American Development Bank Open Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Inter-American-Development-Bank-Open-Data.yml)\]
International Networks Archive
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/International-Networks-Archive.yml)\]
International Social Survey Program ISSP
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/International-Social-Survey-Program-ISSP.yml)\]
International Studies Compendium
Project
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/International-Studies-Compendium-Project.yml)\]
James McGuire Cross National
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/James-McGuire-Cross-National-Data.yml)\]
MIT Reality Mining
Dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/MIT-Reality-Mining-Dataset.yml)\]
MacroData Guide by Norsk samfunnsvitenskapelig
datatjeneste
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/MacroData-Guide-by-Norsk-samfunnsvitenskapelig-datatjeneste.yml)\]
Mass Mobilization Data Project - The Mass Mobilization (MM) data
are an effort to understand
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Mass-Mobilization-Data-Project.yml)\]
Microsoft Academic Knowledge Graph - The Microsoft Academic
Knowledge Graph is a large RDF \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Microsoft-Academic-Knowledge-Graph.yml)\]
Minnesota Population Center
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Minnesota-Population-Center.yml)\]
Notre Dame Global Adaptation Index
(ND-GAIN)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Notre-Dame-Global-Adaptation-Index-NG-DAIN.yml)\]
Open Crime and Policing Data in England, Wales and Northern
Ireland
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Open-Crime-and-Policing-Data-in-England-Wales-and-Northern-Ireland.yml)\]
OpenSanctions - A global database of persons and companies of
political, criminal, or
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/OpenSanctions.yml)\]
Paul Hensel General International Data
Page
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Paul-Hensel-General-International-Data-Page.yml)\]
PewResearch Internet Survey
Project
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/PewResearch-Internet-Survey-Project.yml)\]
PewResearch Society Data
Collection
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/PewResearch-Society-Data-Collection.yml)\]
Political Polarity
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Political-Polarity-Data.yml)\]
StackExchange Data Explorer
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/StackExchange-Data-Explorer.yml)\]
Terrorism Research and Analysis
Consortium
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Terrorism-Research-and-Analysis-Consortium.yml)\]
Texas Inmates Executed Since
1984
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Texas-Inmates-Executed-Since-1984.yml)\]
Titanic Survival Data Set
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Titanic-Survival-Data-Set.yml)\]
UCB's Archive of Social Science Data
(D-Lab)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/UCBs-Archive-of-Social-Science-Data-D-Lab.yml)\]
UCLA Social Sciences Data
Archive
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/UCLA-Social-Sciences-Data-Archive.yml)\]
UN Civil Society Database
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/UN-Civil-Society-Database.yml)\]
UPJOHN for Labor Employment
Research
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/UPJOHN-for-Labor-Employment-Research.yml)\]
Universities Worldwide
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Universities-Worldwide.yml)\]
Uppsala Conflict Data Program
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/Uppsala-Conflict-Data-Program.yml)\]
World Bank Open Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/World-Bank-Open-Data.yml)\]
World Inequality Database - The World Inequality Database
(WID.world) aims to provide open \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/World-Inequality-Database.yml)\]
WorldPop project - Worldwide human population
distributions
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//SocialSciences/WorldPop-project.yml)\]
Software
FLOSSmole data about free, libre, and open source software
development
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Software/FLOSSmole-data-about-free-libre-and-open-source-software-development.yml)\]
GHTorrent - Scalable, queryable, offline mirror of data offered
through the GitHub REST API.
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Software/GHTorrent.yml)\]
Libraries.io Open Source Repository and Dependency
Metadata
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Software/Libraries.io-Open-Source-Repository-and-Dependency-Metadata.yml)\]
Public Git Archive - a Big Code dataset for all – dataset of
182,014 top-bookmarked Git
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Software/source%7Bd%7D-Public-Git-Archive.yml)\]
Code duplicates - 2k Java file and 600 Java function pairs labeled
as similar or different by
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Software/source%7Bd%7D-code-duplicates.yml)\]
Commit messages - 1.3 billion GitHub commit messages till March
2019
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Software/source%7Bd%7D-commit-messages.yml)\]
Pull Request review comments - 25.3 million GitHub PR review
comments since January 2015 till
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Software/source%7Bd%7D-pull-request-review-comments.yml)\]
Source Code Identifiers - 41.7 million distinct splittable
identifiers collected from 182,014
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Software/source%7Bd%7D-source-code-identifiers.yml)\]
Sports
American Ninja Warrior Obstacles - Contains every obstacle in the
history of American Ninja
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/American-Ninja-Warrior-Obstacles.yml)\]
Betfair Historical Exchange Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/Betfair-Historical-Exchange-Data.yml)\]
Cricsheet Matches (cricket)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/Cricsheet-Matches-cricket.yml)\]
Equity in Athletics - The Equity in Athletics Data Analysis Cutting
Tool is brought to you by \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/Equity-in-Athletics.yml)\]
Ergast Formula 1, from 1950 up to date
(API)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/Ergast-Formula-1-from-1950-up-to-date-API.yml)\]
Football/Soccer resources (data and
APIs)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/FootballSoccer-resources-data-and-APIs.yml)\]
Lahman's Baseball Database
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/Lahmans-Baseball-Database.yml)\]
NFL play-by-play data - NFL play-by-play data sourced from:
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/NFL-play-by-play.yml)\]
Pinhooker: Thoroughbred Bloodstock Sale
Data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/Pinhooker.yml)\]
Pro Kabadi season 1 to 7 - Pro Kabadi League is a
professional-level Kabaddi league in India.
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/Pro_Kabadi_season1_7.yml)\]
Retrosheet Baseball Statistics
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/Retrosheet-Baseball-Statistics.yml)\]
Tennis database of rankings, results, and stats for
ATP
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/Tennis-database-of-rankings-results-and-stats-for-ATP.yml)\]
Tennis database of rankings, results, and stats for
WTA
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/Tennis-database-of-rankings-results-and-stats-for-WTA.yml)\]
Transfermarkt Datasets - Clean, structured and automatically
updated football (soccer) data
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/Transfermarkt-Datasets.yml)\]
USA Soccer Teams and Locations - USA soccer teams and locations.
MLS, NWSL, and USL \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Sports/USA-Soccer.yml)\]
TimeSeries
3W dataset - To the best of its authors' knowledge, this is the
first realistic and public
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//TimeSeries/3W-dataset-rare-undesirable-real-events-in-oil-wells.yml)\]
Databanks International Cross National Time Series Data
Archive
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//TimeSeries/Databanks-International-Cross-National-Time-Series-Data-Archive.yml)\]
Hard Drive Failure
Rates
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//TimeSeries/Hard-Drive-Failure-Rates.yml)\]
Heart Rate Time Series from MIT
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//TimeSeries/Heart-Rate-Time-Series-from-MIT.yml)\]
Time Series Data Library (TSDL) from
MU
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//TimeSeries/Time-Series-Data-Library-TSDL-from-MU.yml)\]
Turing Change Point Dataset - Contains 42 annotated time series
collected for the development
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//TimeSeries/Turing-Change-Point-Dataset.yml)\]
UC Riverside Time Series
Dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//TimeSeries/UC-Riverside-Time-Series-Dataset.yml)\]
Transportation
Airlines OD Data 1987-2008
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/Airlines-OD-Data-1987~2008.yml)\]
Ford GoBike Data (formerly Bay Area Bike Share
Data)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/Bay-Area-Bike-Share-Data.yml)\]
Bike Share Systems (BSS)
collection
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/Bike-Share-Systems-BSS-collection.yml)\]
Dutch Traffic Information
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/Dutch-Traffic-Information.yml)\]
GeoLife GPS Trajectory from Microsoft
Research
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/GeoLife-GPS-Trajectory-from-Microsoft-Research.yml)\]
German train system by Deutsche
Bahn
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/German-train-system-by-Deutsche-Bahn.yml)\]
Hubway Million Rides in
MA
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/Hubway-Million-Rides-in-MA.yml)\]
Melbourne Pedestrian Counting - This dataset contains hourly
pedestrian counts since 2009
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/Melbourne-pedestrian-counting.yml)\]
Montreal BIXI Bike Share
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/Montreal-BIXI-Bike-Share.yml)\]
NYC Taxi Trip Data
2009-
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/NYC-Taxi-Trip-Data-2009.yml)\]
NYC Taxi Trip Data 2013
(FOIA/FOILed)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/NYC-Taxi-Trip-Data-2013-FOIA-FOILed.yml)\]
NYC Uber trip data April 2014 to September
2014
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/NYC-Uber-trip-data-April-2014-to-September-2014.yml)\]
Open Traffic
collection
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/Open-Traffic-collection.yml)\]
OpenFlights - airport, airline and route
data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/OpenFlights.yml)\]
Philadelphia Bike Share Stations
(JSON)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/Philadelphia-Bike-Share-Stations-JSON.yml)\]
Plane Crash Database, since
1920
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/Plane-Crash-Database-since-1920.yml)\]
RITA Airline On-Time Performance
data
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/RITA-Airline-On.yml)\]
RITA/BTS transport data collection
(TranStat)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/RITA-BTS-transport-data-collection-TranStat.yml)\]
Renfe (Spanish National Railway Network)
dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/Spanish-train-system-by-Renfe.yml)\]
Toronto Bike Share Stations (JSON and GBFS
files)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/Toronto-Bike-Share-Stations-XML-file.yml)\]
Transport for London
(TFL)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/Transport-for-London-TFL.yml)\]
Travel Tracker Survey (TTS) for
Chicago
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/Travel-Tracker-Survey-TTS-for-Chicago.yml)\]
U.S. Bureau of Transportation Statistics
(BTS)
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/U.S.-Bureau-of-Transportation-Statistics-BTS.yml)\]
U.S. Domestic Flights 1990 to
2009
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/U.S.-Domestic-Flights-1990-to-2009.yml)\]
U.S. Freight Analysis Framework since
2007
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/U.S.-Freight-Analysis-Framework-since-2007.yml)\]
U.S. National Highway Traffic Safety Administration - Fatalities
since 1975 - Contains CSV \[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//Transportation/U.S.-National-Highway-Traffic-Safety-Administation-Fatalities-since-1975.yml)\]
eSports
CS:GO Competitive Matchmaking Data - In this data set we have data
about the CSGO matchmaking
\[\...\]
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//eSports/csgo.yml)\]
FIFA-2021 Complete Player
Dataset
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//eSports/fifa2021.yml)\]
OpenDota data
dump
\[[Meta](https://github.com/awesomedata/apd-core/tree/master/core//eSports/opendota-dump.yml)\]
Complementary Collections
- Data Packaged Core Datasets
- OpenDataMonitor: An overview of available open data resources in Europe
- Quora: Where can I find large datasets open to the public?
- RS.io: 100+ Interesting Data Sets for Statistics
- CVonline: Image Databases
- InnoTrek: Leveraging open data to understand urban lives
- CV Papers: CV Datasets on the web