Dataset Directory
Connecting machine learning practitioners with meaningful datasets
Well-annotated data is required to develop effective machine learning tools for the clinical environment. With the Dataset Directory, we connect machine learning practitioners with accessible and meaningful datasets for their projects.
Here are organizations to contact about their datasets or with datasets ready to be pulled directly from their websites.Â
Cancer Genome Atlas Cervical Kidney Renal Papillary Cell Carcinoma
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Breast Invasive Carcinoma
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Cervical Squamous Cell Carcinoma and Endocervical Adenocarcinoma
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Colon Adenocarcinoma
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Esophageal Carcinoma
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Kidney Chromophobe
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Liver Hepatocellular Carcinoma
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Low Grade Glioma
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Lung Adenocarcinoma
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Ovarian Cancer
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Prostate Adenocarcinoma
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Rectum Adenocarcinoma
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Sarcoma
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Stomach Adenocarcinoma
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Thyroid Cancer
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Urothelial Bladder Carcinoma
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
Cancer Genome Atlas Uterine Corpus Endometrial Carcinoma
This data collection is part of a larger effort to build a research community focused on connecting cancer phenotypes to genotypes by providing clinical images matched to subjects from The Cancer Genome Atlas (TCGA).
CHESTXRAY14
Frontal view chest X-ray images labeled considering 14 common thorax disease conditions.
MURA
MURA (musculoskeletal radiographs) is a large dataset of bone X-rays. Algorithms are tasked with determining whether an X-ray study is normal or abnormal.
Osteoarthritis Initiative
This is a multi-center, longitudinal, prospective observational study of knee osteoarthritis (OA). The overall aim of the OAI is to develop a public domain research resource to facilitate the scientific evaluation of biomarkers for osteoarthritis as potential surrogate endpoints for disease onset and progression.