GWAS Bioinformatics Scientist

Location
Gaithersburg, MD, USA
Posted
Oct 24, 2018
Ref
JR79-21197
Hotbed
BioCapital
Required Education
Doctorate/PHD/MD
Position Type
Full time
The Cancer Genomics Research Laboratory (CGR) investigates the contribution of germline and somatic genetic variation to cancer susceptibility and outcomes in support of the National Cancer Institute (NCI)'s Division of Cancer Epidemiology and Genetics (DCEG). Working in concert with epidemiologists, biostatisticians and basic research scientists in DCEG's intramural research program, CGR provides the capacity to conduct genome-wide discovery studies and targeted regional approaches to identify the heritable determinants of various forms of cancer. This includes the design and analysis of high throughput studies using various types of “-omics” technologies such as array- and sequence-based genome-wide association studies, studies of tumor characteristics using integrated genomic data analysis, and molecular epidemiologic studies based on novel metabolomic and microbiomic assays.

KEY ROLES/RESPONSIBILITIES

The Cancer Genomics Research (CGR) laboratory in Gaithersburg, MD, is a fast-paced, high-throughput organization dedicated to the support of molecular, genetic and epidemiologic studies for investigators at the National Cancer Institute's Division of Cancer Epidemiology & Genetics (DCEG). The Division includes over 70 principal investigators in epidemiology, genetics, and biostatistics who conduct multidisciplinary family- and population-based research to discover the genetic and environmental determinants of cancer, and new approaches to cancer prevention. This includes the design and analysis of high throughput studies using various types of “-omics” technologies such as array- and sequence-based genome-wide association studies, studies of tumor characteristics using integrated genomic data analysis and molecular epidemiologic studies based on novel metabolomic and microbiomic assays. We are seeking a highly motivated scientist to join the bioinformatics team at the CGR and provide analytical support to DCEG. Working with DCEG investigators, external collaborators, CGR management and staff, the successful incumbent will provide leadership and support to the extensive DCEG GWAS analytical efforts, specifically:
  • Accessing, extracting and preparing data for analysis, including combining data run on multiple platforms as well as externally generated data in support of meta-analyses
  • Harmonizing and maintaining diverse data with associated metadata
  • Execute state of the art GWAS main effect analyses, including data QC, imputation, population structure analysis, association analyses
  • Organizing results into clear presentations (including QQ-plots, Manhattan plots) and concise summaries of work, in formats useful for scientific interpretation
  • Development and execution of advanced analyses including polygenic risk scores, pathway-based analyses, integrative analyses
  • Incumbent will be expected to work closely with DCEG PIs in support of scientific manuscript development, submission, revision activities with significant coauthorship and potentially lead authorship opportunities

BASIC QUALIFICATIONS

To be considered for this position, you must minimally meet the knowledge, skills, and abilities listed below:
  • Possession of a Doctoral degree from an accredited college/university in a field related to biomedical research as designated by the program requirements. Foreign degrees must be evaluated for U.S equivalency
  • In addition to educational requirements, a minimum of two (2) years of progressively responsible scientific and/or complex system management/bioinformatics experience
  • Strong knowledge required of genome-wide association analysis and interpretation, and applied computational research on large multivariate datasets
  • Expertise in algorithmic implementation, statistical programming and data manipulation, using e.g. R/Bioconductor, Matlab, Python, and a wide range of contemporary, open-source bioinformatics tools and database structures (e.g. PLINK, SNPTEST, GLU, IMPUTE2, BEAGLE, UCSC Genome Browser, etc.)
  • Programming experience with Shell, Python, Perl, C/C++, and/or JAVA

PREFERRED QUALIFICATIONS

Candidates with these desired skills will be given preferential consideration:
  • A Ph.D. in bioinformatics, statistics, genetics, computational biology
  • Pipeline development experience
  • Familiarity with publicly available data sources (such as dbGaP, TCGA, ENCODE, 1000 Genomes, TARGET, GTEX) and diverse genomic annotations
  • Experience managing large datasets and computational tasks, experience working in a Linux environment (especially a compute cluster environment)
  • Cloud computing experience
  • Team oriented with excellent written and verbal communication skills, with demonstrated ability to self-educate in current bioinformatics techniques and resources
  • Excellent organizational skills and attention to detail

EXPECTED COMPETENCIES
  • Careful attention to detail. Must be analytical, flexible, innovative, and self-motivated
  • Ability to organize and execute multiple projects in parallel
  • Ability to carry out a wide range of high throughput data analyses, and troubleshoot with minimal supervision
  • Excellent written and oral communication-ability to communicate clearly and effectively with staff and other groups within CGR and DCEG, and able to interpret data and write scientific reports as well as other written communications
  • Computer literacy (Microsoft Office products, general computer proficiency)


Equal Opportunity Employer (EOE) | Minority/Female/Disabled/Veteran (M/F/D/V) | Drug Free Workplace (DFW)