Summary
"Task 5.2 : Unified infectious disease-related cohort data portal
We will implement a unified data portal providing a single point of access for researchers to comprehensive infectious disease cohorts - from the Consortium and beyond - combining detailed descriptions and direct access to datasets held in the underlying data repositories European Nucleotide Archive (ENA) and European Genome-phenome Archive (EGA), and the respective connected cohort data hubs built upon these repositories. Following an initial back-fill of the portal, new datasets from infectious disease-related cohorts will be automatically identified in each archive or data hub, catalogued in the portal's back-end database and rapidly displayed both through the intuitive web portal interface and well documented programmatic API interface. This integration of data from multiple archive locations has been successfully implemented in previous large projects such as the European Virus Archive (http://www.european-virus-archive.com/evag-portal) or more recently the HipSci (http://www.hipsci.org/lines/#/lines) that provide extensive metadata, clear visualisation of the available datasets from a range of archive locations and direct access to underlying data in each archive, including support for batch processing and information on applying for access to managed datasets. For this task we will specifically reuse technical components from the HipSci data portal, developed at EMBL-EBI."
More information & hyperlinks