{"_id":"5b0e13ffc4664e0003c75aba","category":{"_id":"5b0e13ffc4664e0003c75aaa","project":"5b0e13ffc4664e0003c75a66","version":"5b0e13ffc4664e0003c75a67","__v":0,"sync":{"url":"","isSync":false},"reference":false,"createdAt":"2016-12-05T15:44:15.650Z","from_sync":false,"order":6,"slug":"datasets-hub","title":"DATASETS HUB"},"project":"5b0e13ffc4664e0003c75a66","parentDoc":null,"__v":0,"user":"5613e4f8fdd08f2b00437620","version":{"_id":"5b0e13ffc4664e0003c75a67","project":"5b0e13ffc4664e0003c75a66","__v":4,"createdAt":"2015-09-17T16:58:03.490Z","releaseDate":"2015-09-17T16:58:03.490Z","categories":["5b0e13ffc4664e0003c75a68","5b0e13ffc4664e0003c75a69","5b0e13ffc4664e0003c75a6a","5b0e13ffc4664e0003c75a6b","5b0e13ffc4664e0003c75a6c","5b0e13ffc4664e0003c75a6d","5b0e13ffc4664e0003c75a6e","5b0e13ffc4664e0003c75a6f","5b0e13ffc4664e0003c75a70","5b0e13ffc4664e0003c75a71","5b0e13ffc4664e0003c75a72","5b0e13ffc4664e0003c75a73","5b0e13ffc4664e0003c75a74","5b0e13ffc4664e0003c75a75","5b0e13ffc4664e0003c75a76","5b0e13ffc4664e0003c75a77","5b0e13ffc4664e0003c75a89","5b0e13ffc4664e0003c75a8a","5b0e13ffc4664e0003c75a9d","5b0e13ffc4664e0003c75a9e","5b0e13ffc4664e0003c75a9f","5b0e13ffc4664e0003c75aa0","5b0e13ffc4664e0003c75aa1","5b0e13ffc4664e0003c75aa2","5b0e13ffc4664e0003c75aa3","5b0e13ffc4664e0003c75aa4","5b0e13ffc4664e0003c75aa5","5b0e13ffc4664e0003c75aa6","5b0e13ffc4664e0003c75aa7","5b0e13ffc4664e0003c75aa8","5b0e13ffc4664e0003c75aa9","5b0e13ffc4664e0003c75aaa","5b0e13ffc4664e0003c75aab","5b0e13ffc4664e0003c75aac","5b0e13ffc4664e0003c75aad","5b0e13ffc4664e0003c75aae","5b0e13ffc4664e0003c75aaf","5b0e13ffc4664e0003c75ab2","5bb3374f4306ad0003eb18e7","5bbf3c5373e72a000318362b","5bc065567d1cb0000384c649","5cbf19a5f9181f0033fbb968"],"is_deprecated":false,"is_hidden":false,"is_beta":true,"is_stable":true,"codename":"","version_clean":"1.0.0","version":"1.0"},"githubsync":"","metadata":{"title":"","description":"","image":[]},"updates":["5888bf6752d5b70f004e33fb","5a398eb7467a790034961bec","5a4642b03f866700300d97b3","5a6f83bd9b29600012a75988","5a92daa420cacd00127d563c"],"next":{"pages":[],"description":""},"createdAt":"2016-12-05T15:48:28.014Z","link_external":false,"link_url":"","sync_unique":"","hidden":false,"api":{"results":{"codes":[]},"settings":"","auth":"required","params":[],"url":""},"isReference":false,"order":0,"body":"## Overview\n\nDataSTAGE powered by Seven Bridges hosts 20 TOPMed studies (datasets)  which you can use in your genomics analyses and are organized into the following categories:\n\n  * [Heart disease](doc:about-datasets#section-heart-disease)\n  * [Lung disease](doc:about-datasets#section-lung-disease)\n  * [Medical Genetics and Human Variation](doc:about-datasets#section-medical-genetics-and-human-variation)\n\n\n### Heart disease\n\n  The following hart disease related studies are available:\n\n* [The Jackson Heart Study](doc:jackson-heart-study)\n*  [Genomic Activities such as Whole Genome Sequencing and Related Phenotypes in the Framingham Heart Study](doc:genomic-activities-such-as-whole-genome-sequencing-and-related-phenotypes-in-the-framingham-heart-study)\n* [Genetics of Cardiometabolic Health in the Amish](doc:genetics-of-cardiometabolic-health-in-the-amish)\n* [MESA and MESA Family AA-CAC](doc:mesa-and-mesa-family-aa-cac)\n* [Partners HealthCare Biobank](doc:partners-healthcare-biobank) \n* [Trans-Omics for Precision Medicine (TOPMed) Whole Genome Sequencing Project: ARIC](doc:trans-omics-for-precision-medicine-topmed-whole-genome-sequencing-project-aric)\n* [GOLDN Epigenetic Determinants of Lipid Response to Dietary Fat and Fenofibrate](doc:goldn-epigenetic-determinants-of-lipid-response-to-dietary-fat-and-fenofibrate)\n* [Diabetes Heart Study (DHS) African American Coronary Artery Calcification (AA CAC)](doc:diabetes-heart-study-dhs-african-american-coronary-artery-calcification-aa-cac)\n* [Women's Health Initiative](doc:womens-health-initiative)\n* [San Antonio Family Heart Study](doc:san-antonio-family-heart-study)\n* [GeneSTAR (Genetic Study of Atherosclerosis Risk)](doc:nhlbi-top-med-gene-star)  \n* [Genetic Epidemiology Network of Arteriopathy (GENOA)](doc:genetic-epidemiology-network-of-arteriopathy)\n* [Massachusetts General Hospital (MGH) Atrial Fibrillation Study](doc:massachusetts-general-hospital-mgh-atrial-fibrillation-study)\n* [Heart and Vascular Health Study (HVH)](doc:heart-and-vascular-health-study-hvh)\n* [The Vanderbilt Atrial Fibrillation Registry](doc:the-vanderbilt-atrial-fibrillation-registry)\n* [Cleveland Clinic Atrial Fibrillation (CCAF) Study](doc:cleveland-clinic-atrial-fibrillation-ccaf-study)\n* [The Vanderbilt Atrial Fibrillation Registry) Study](doc:the-vanderbilt-atrial-fibrillation-registry)\n* [Novel Risk Factors for the Development of Atrial Fibrillation in Women](doc:phs001040-novel-risk-factors-for-the-development-of-atrial-fibrillation-in-women)\n* [Trans-Omics for Precision Medicine (TOPMed) Whole Genome Sequencing Project: Cardiovascular Health Study](doc:phs001368-trans-omics-for-precision-medicine-topmed-whole-genome-sequencing-project-cardiovascular-health-study)\n* [Whole Genome Sequencing of Venous Thromboembolism (WGS of VTE)](doc:/phs001402-whole-genome-sequencing-of-venous-thromboembolism)\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n### Lung disease\n\nThe following lung disease related studies are available:\n\n* [Study of African Americans, Asthma, Genes and Environment (SAGE) Study](doc:study-of-african-americans-asthma-genes-and-environment-sage)\n* [Boston Early-Onset COPD Study](doc:boston-early-onset-copd-study)\n* [Genetic Epidemiology of COPD (COPDGene)](doc:genetic-epidemiology-of-copd)\n* [Genetics and Epidemiology of Asthma in Barbados](doc:genetics-and-epidemiology-of-asthma-in-barbados)\n\n### Medical Genetics and Human Variation\n\n* [International HapMap Project](doc:international-hapmap-project)\n\n \n## Metadata for datasets on DataSTAGE powered by Seven Bridges\n\nMetadata is data about the genomic information carried by files. It is data about the time, place, and manner in which the genomic data was obtained as well as the genomic data's source and type. You can use metadata on the Platform to browse and query datasets. Metadata describing datasets on the Platform consist of properties which describe the entities of each dataset.\n\nEntities are particular resources with UUIDs, such as files, cases, samples, and cell lines. These can be the subject of your query.\n\nProperties can either describe an entity or relate that entity to another entity. For instance, properties include an entity's vital status, gender, data format, or experimental strategy.\n\nView the metadata schema, which includes a list of entities and their related properties \n\nBelow, learn how to start working with datasets via their metadata on the visual interface.\n\n## Explore datasets using the visual interface\n\nThe Data Browser allows you to explore datasets using an interactive graphical interface. Start by building queries to filter data using various metadata attributes. Then, access these files for further analysis.\n\nTo access the Data Browser, click **Data** on the top navigation bar and select **Data Browser**. You'll see the screen below. Here, you can select the dataset to query.\n[block:image]\n{\n  \"images\": [\n    {\n      \"image\": [\n        \"https://files.readme.io/c856a27-about-datasets-1.png\",\n        \"about-datasets-1.png\",\n        605,\n        449,\n        \"#f4f5f5\"\n      ]\n    }\n  ]\n}\n[/block]\nTake advantage of pre-built example queries or build your own from scratch using metadata entities and properties. Learn more about queries in the [Data Browser](doc:about-the-data-browser).\n\nOnce you've located specific files using a Data Browser query, you can access this data for further analysis.","excerpt":"","slug":"about-datasets","type":"basic","title":"ABOUT DATASETS"}
## Overview DataSTAGE powered by Seven Bridges hosts 20 TOPMed studies (datasets) which you can use in your genomics analyses and are organized into the following categories: * [Heart disease](doc:about-datasets#section-heart-disease) * [Lung disease](doc:about-datasets#section-lung-disease) * [Medical Genetics and Human Variation](doc:about-datasets#section-medical-genetics-and-human-variation) ### Heart disease The following hart disease related studies are available: * [The Jackson Heart Study](doc:jackson-heart-study) * [Genomic Activities such as Whole Genome Sequencing and Related Phenotypes in the Framingham Heart Study](doc:genomic-activities-such-as-whole-genome-sequencing-and-related-phenotypes-in-the-framingham-heart-study) * [Genetics of Cardiometabolic Health in the Amish](doc:genetics-of-cardiometabolic-health-in-the-amish) * [MESA and MESA Family AA-CAC](doc:mesa-and-mesa-family-aa-cac) * [Partners HealthCare Biobank](doc:partners-healthcare-biobank) * [Trans-Omics for Precision Medicine (TOPMed) Whole Genome Sequencing Project: ARIC](doc:trans-omics-for-precision-medicine-topmed-whole-genome-sequencing-project-aric) * [GOLDN Epigenetic Determinants of Lipid Response to Dietary Fat and Fenofibrate](doc:goldn-epigenetic-determinants-of-lipid-response-to-dietary-fat-and-fenofibrate) * [Diabetes Heart Study (DHS) African American Coronary Artery Calcification (AA CAC)](doc:diabetes-heart-study-dhs-african-american-coronary-artery-calcification-aa-cac) * [Women's Health Initiative](doc:womens-health-initiative) * [San Antonio Family Heart Study](doc:san-antonio-family-heart-study) * [GeneSTAR (Genetic Study of Atherosclerosis Risk)](doc:nhlbi-top-med-gene-star) * [Genetic Epidemiology Network of Arteriopathy (GENOA)](doc:genetic-epidemiology-network-of-arteriopathy) * [Massachusetts General Hospital (MGH) Atrial Fibrillation Study](doc:massachusetts-general-hospital-mgh-atrial-fibrillation-study) * [Heart and Vascular Health Study (HVH)](doc:heart-and-vascular-health-study-hvh) * [The Vanderbilt Atrial Fibrillation Registry](doc:the-vanderbilt-atrial-fibrillation-registry) * [Cleveland Clinic Atrial Fibrillation (CCAF) Study](doc:cleveland-clinic-atrial-fibrillation-ccaf-study) * [The Vanderbilt Atrial Fibrillation Registry) Study](doc:the-vanderbilt-atrial-fibrillation-registry) * [Novel Risk Factors for the Development of Atrial Fibrillation in Women](doc:phs001040-novel-risk-factors-for-the-development-of-atrial-fibrillation-in-women) * [Trans-Omics for Precision Medicine (TOPMed) Whole Genome Sequencing Project: Cardiovascular Health Study](doc:phs001368-trans-omics-for-precision-medicine-topmed-whole-genome-sequencing-project-cardiovascular-health-study) * [Whole Genome Sequencing of Venous Thromboembolism (WGS of VTE)](doc:/phs001402-whole-genome-sequencing-of-venous-thromboembolism) ### Lung disease The following lung disease related studies are available: * [Study of African Americans, Asthma, Genes and Environment (SAGE) Study](doc:study-of-african-americans-asthma-genes-and-environment-sage) * [Boston Early-Onset COPD Study](doc:boston-early-onset-copd-study) * [Genetic Epidemiology of COPD (COPDGene)](doc:genetic-epidemiology-of-copd) * [Genetics and Epidemiology of Asthma in Barbados](doc:genetics-and-epidemiology-of-asthma-in-barbados) ### Medical Genetics and Human Variation * [International HapMap Project](doc:international-hapmap-project) ## Metadata for datasets on DataSTAGE powered by Seven Bridges Metadata is data about the genomic information carried by files. It is data about the time, place, and manner in which the genomic data was obtained as well as the genomic data's source and type. You can use metadata on the Platform to browse and query datasets. Metadata describing datasets on the Platform consist of properties which describe the entities of each dataset. Entities are particular resources with UUIDs, such as files, cases, samples, and cell lines. These can be the subject of your query. Properties can either describe an entity or relate that entity to another entity. For instance, properties include an entity's vital status, gender, data format, or experimental strategy. View the metadata schema, which includes a list of entities and their related properties Below, learn how to start working with datasets via their metadata on the visual interface. ## Explore datasets using the visual interface The Data Browser allows you to explore datasets using an interactive graphical interface. Start by building queries to filter data using various metadata attributes. Then, access these files for further analysis. To access the Data Browser, click **Data** on the top navigation bar and select **Data Browser**. You'll see the screen below. Here, you can select the dataset to query. [block:image] { "images": [ { "image": [ "https://files.readme.io/c856a27-about-datasets-1.png", "about-datasets-1.png", 605, 449, "#f4f5f5" ] } ] } [/block] Take advantage of pre-built example queries or build your own from scratch using metadata entities and properties. Learn more about queries in the [Data Browser](doc:about-the-data-browser). Once you've located specific files using a Data Browser query, you can access this data for further analysis.