Drug Discovery Data Scientist (NCI)

San Francisco, California
Jan 11, 2018
Required Education
Masters Degree/MBA
Position Type
Full time


The mission of the Data Science and Information Technology Program (DSITP) is to leverage leading-edge data science and information technology skills and capabilities to provide the best tools and services to accelerate translation of biomedical data to scientific discoveries, medical treatments, and diagnostic and prevention tools for cancer and AIDS patients.

The DSITP Strategic and Data Science Initiatives Group has a focus to establish new collaborations in computing and data science to address cancer challenges and accelerate cancer research. The group’s efforts include developing the capacity within the NCI intramural research community to increasingly employ high-performance computing, as well as establishing a longer-term direction and path forward for broader NCI, academic, government, and private industry participation in predictive oncology.

The Accelerating Therapeutics for Opportunities in Medicine (ATOM) Consortium is a public-private partnership with the mission of transforming drug discovery by accelerating the development of more effective therapies for patients.  ATOM founding members are GSK, Lawrence Livermore National Laboratory, Frederick National Laboratory, and the University of California, San Francisco.  The ATOM consortium members will develop, test, and validate a multidisciplinary approach to drug discovery in which modern science, technology and engineering, supercomputing simulations, data science, and artificial intelligence are highly integrated into a single drug-discovery platform that can ultimately be shared with the drug development community at large. 

Frederick National Laboratory is hiring a Senior Drug Discovery Data Scientist to work in computationally-driven cancer drug discovery as part of the ATOM Consortium.   

This position is based in San Francisco, California.


  • Provide technical and strategic input to develop, implement and maintain a data and model lake to support the ATOM consortium
  • Acquire, augment, analyze and act on complex, high-dimensional data sets using high performance computing and advanced algorithms
  • Computational models R&D: includes building predictive computational models to accelerate cancer drug discovery and assessing the predictivity and uncertainty of computational models used by the ATOM consortium
  • Be able to work independently as part of a multi-disciplinary team to support the ATOM Consortium scientific and technical goals
  • Create and maintain strong professional relationships with all ATOM stakeholders
  • Foster and nurture collaborative work environment



To be considered for this position, you must minimally meet the knowledge, skills, and abilities listed below:

  • Possession of a Master’s degree in a related discipline from an accredited college or university according to the Council for Higher Education Accreditation. (Additional qualifying experience may be substituted for the required education). Foreign degrees must be evaluated for U.S. equivalency
  • In addition to the education requirements, a minimum of eight (8) years of progressively responsible job-related experience including three (3) years of experience in a leadership capacity.
  • Familiarity with large datasets, handling of biomedical datasets and understanding of data analysis workflows is required.
  • Proficiency with one or more programming language such as Java, Perl, Python, R with data analysis packages or modules is required.
  • Must be able to obtain and maintain a security clearance.


Candidates with these desired skills will be given preferential consideration:

  • Ph.D. in a relevant discipline
  • A strong background in any one or more of the following: cancer biology, pharmaceutical or medicinal chemistry, pharmacology, toxicology, drug discovery, computational biology, in silico biology, bioinformatics, biostatistics, computational science, systems biology, molecular dynamics, multiscale modeling, machine learning, adaptive learning or high-performance computing.

Expected Competencies:

  • Analytical approach to defining scientific questions/design of scientifically testable hypotheses
  • Must be detail-oriented and possess strong organizational and analytical skills with the ability to prioritize multiple tasks/projects
  • Exceptional written and oral communication skills in order to work closely with numerous stakeholders from diverse organizations
  • Proficiency with Microsoft Office products