Wellcome Trust partners with Sage Bionetworks

Data re-use can generate new insights that in turn lead to vital health benefits. To stimulate and celebrate the innovative re-use of data, the Wellcome Trust launched the Wellcome Data Re-use Prizes. With the support of Sage Bionetworks and its Synapse platform, the Wellcome Data Re-use Prizes presented two awards, each based on a topic of strategic importance to Wellcome: AMR surveillance and malaria.

Winners of this year’s prizes (source: Wellcome Trust)

AMR surveillance

Antimicrobial resistance is one of the major public health concerns of this century. Entrants focused on the AMR Register, an open data resource launched by Wellcome’s Drug-resistant Infections programme and led by the Open Data Institute. The register has collected information from AMR surveillance programmes generated by the pharmaceutical industry.

The winning entry is Antibiotic Resistance: Interdisciplinary Action (AR:IA) (opens in a new tab). The team developed an interactive web app that lets users quickly visualise resistance rates to antibiotics for common infections and countries of interest. The data on the platform will help doctors to prescribe more appropriately in the face of local drug-resistance.

The winner and runners up were announced at the European Congress of Clinical Microbiology & Infectious Diseases (opens in a new tab).

See all the entries in this category (opens in a new tab).


The World Health Organization (WHO) considers nearly half the world’s population is at risk from malaria. In 2016 there were an estimated 216 million new cases and 445,000 deaths, mostly in sub-Saharan Africa.

Entrants focused on the Malaria Atlas Project, a Repository of Open Access Data (ROAD-MAP), launched with support from a Wellcome Biomedical Resources grant, and then funding from the Bill & Melinda Gates Foundation.

The repository contains a wealth of data on malaria risk and intervention coverage – all of which is free to be accessed, re-analysed and re-used by anyone.

The winning entry is Rethinking the Causal Relationship between Malaria and Anaemia for African Children (opens in a new tab). The team’s findings suggest anaemia is not enough to help us fully understand malaria prevalence in communities. They applied a novel approach to analysing the available data and their method could be used to help identify other factors at play.

The winners and runners up will be attending the London School of Hygiene & Tropical Medicine Malaria Centre’s ‘Eliminating Malaria: evidence, impact, and policy’ event (opens in a new tab).

See all the entries in this category (opens in a new tab).

Parkinson’s Disease Digital Biomarker DREAM Challenge

Recent advances in mobile health have demonstrated great potential to leverage sensor-based technologies for remote monitoring of health and disease – particularly for diseases affecting motor function such as Parkinson’s disease (PD). However, converting sensor-based data streams into digital biomarkers is complex and no methodological standards have yet evolved to guide this process.

The Parkinson’s Disease Digital Biomarker (PDDB) DREAM Challenge was first-of-its-kind, designed to benchmark methods for processing sensor data for development of digital signatures reflective of PD. Over 440 data experts from six continents participated in the PDDB DREAM Challenge.

The challenge was divided into two sub-challenges:

  • In the first sub-challenge, participants used accelerometry and gyroscope data from mPower – a large mobile health study where over 15,000 individuals with PD or controls used their iPhones to, among other things, perform short walk and balance tests – to extract features that were used to predict whether the user had PD.
  • In the second sub-challenge, participants extracted features for three different Parkinson’s symptoms from a study funded by The Michael J. Fox Foundation where patients performed tasks while wearing three wearable sensors. These features were used to predict clinician-assessed disease severity for tremor, dyskinesia and bradykinesia.

NCI-CPTAC DREAM Proteogenomics Challenge

Cancer is a disease of the genome: sequential accumulation of DNA mutations lead to both direct and indirect changes in the structure and abundance of the proteins that perform most of the functional activities within the cell. While methodologies for the analysis of cancer genomes and transcriptomes have undergone rapid benchmarking and standardization, our understanding of how best to analyze the cancer proteome remains less-developed. In particular, there are key questions remaining in how to infer the abundances of peptides not detected in a subset of samples, in optimizing database searches to detect cancer-specific peptides caused by point-mutations, alternative transcript isoforms or fusion genes, in understanding the association between DNA, mRNA and protein data, and in performing robust absolute quantitation.

To address these issues, we have set up the NCI-CPTAC DREAM Challenge as a community-based collaborative competition of researchers from across the world working together to answer key questions in cancer proteomics, focused around the integration of diverse data-types. The CPTAC effort includes the collection of  tumour/normal pairs with matched genomic, transcriptomic and proteomic characterization, which provide a unique data to run the proteogenomic challenge.

Digital Mammography DREAM Challenge

The goal of the Digital Mammography (DM) DREAM Challenge was to apply an open science, crowd-sourced approach to develop and assess algorithms for risk stratification of screening mammograms that can be used to improve breast cancer detection. These algorithms could potentially benefit interpretation of other tumor imaging, impacting a wide set of cancer patients. The DM Challenge encouraged teams to apply deep learning methods to a large set of mammography images of over 640k images from 80k women. Dozens of teams participated in this Challenge, resulting in the development of many novel approaches to cancer detection and the establishment of public standards and benchmarks in this field.

Read our our press release announcing the winners of the first phase of the DM Challenge.

Multiple Myeloma DREAM Challenge

Multiple myeloma (MM) is a cancer of the plasma cells in the bone marrow, and its clinical course depends on a complex interplay of clinical traits and molecular characteristics of the plasma cells. Since risk-adapted therapy is becoming standard of care, there is an urgent need for a precise risk stratification model to assist in therapeutic decision-making. While progress has been made, there remains a significant opportunity to improve patient stratification to optimize treatment and to develop new therapies for high-risk patients. The MM DREAM Challenge aims to accelerate the development and evaluation of such risk models in multiple myeloma.

ICGC-TCGA DREAM Somatic Mutation Calling – RNA Challenge

The ICGC-TCGA DREAM Somatic Mutation Calling – RNA (SMC-RNA) Challenge is a community-based collaborative competition of researchers from across the world charged to rigorously assess the accuracy of methods used in cancer-associated RNA-seq analysis, with focus in two areas: quantification of known isoforms and detection of novel fusion transcripts. The SMC-RNA Challenge will analyze dozens of samples created to have known alterations representing different tumor types, allowing confidence that the winning methods will be generalizable across the broad range of human cancers.