Sage Bionetworks is currently recruiting a research scientists with a strong background in statistical modeling, machine learning, and data analysis, particularly as applied to large-scale biological and/or cancer genomics data sets. We are active in a number of research areas including immuno-oncology, spatial-temporal tumor heterogeneity, and drug response which we study using state-of-the-art molecular, imaging, and clinical modalities. The research scientist will contribute to the DREAM Challenges – a community supporting data challenge – as well as large-scale consortiums including the Human Tumor Atlas Network, the Cancer Systems Biology Consortium, and the Physical Sciences Oncology Network.
• Conduct data QC, data harmonization, and data analysis on large-scale genomic data.
• Create workflows to facilitate collaboration across consortiums (e.g., Dockerize computational methods).
• Manage data access in support of consortiums (e.g., data transfer, meta-data management, and QC on Sage’s data sharing platform).
• Present results concisely and effectively to collaborators.
Example projects include:
- Implement a pipeline to compare existing image analysis algorithms across benchmark datasets.
- Develop or apply consensus clustering approaches to define expression signatures correlated with patient outcome.
- Model drug response using genomic, transcriptomic, and clinical features.
- Validate methylation markers of risk computationally.
• PhD in statistics, mathematics, physics, computational biology, computer science, bioinformatics, or related quantitative discipline.
• Masters degree in one of the above areas, 5+ years of significant relevant work experience, and a strong track record of statistical analysis and/or machine learning will also be considered.
• Proven expertise in state-of-the-art machine learning and statistical techniques, such as modeling (e.g., regularized regression, survival analysis, GLM), supervised learning (e.g., SVMs, neural networks), unsupervised learning (e.g., k-means), dimensionality reduction (e.g., PCA), and Bayesian analysis.
• Exceptional problem-solving skills, particularly the ability to address a defined problem or hypothesis (biological or otherwise) creatively and with limited supervision.
• Strong programming skills in R and/or Python.
• Experience working with high-dimensional biological data, such as gene expression, genomic, imaging, drug response, flow cytometry, or CyTOF. Immediate needs and emphasis are in RNA-seq, single-cell RNA-seq, single-cell imaging, video, and drug response data.
• Knowledge of biology, particularly cancer.
• Demonstrated excellence in research.
• Software development skills, including experience with version control software (e.g., github)
• Familiarity with cloud environments (especially AWS) and containerization approaches (principally Docker)
• A passion for open-access innovation.
• Strong collaboration, teamwork, presentation, and communication skills.
About Sage Bionetworks
Sage Bionetworks is a world-leading nonprofit biomedical research organization in Seattle, WA. We are dedicated to building and supporting open communities of collaborative research in human health and genomics. We are developing multiple initiatives designed to facilitate scientific collaborations and enable direct contributions of ideas and data from citizens to research projects. Sage embraces diversity and equity. We are based in Seattle, WA, and collaborate broadly throughout the world.