Data Scientist (NCI)

Frederick, Maryland
Jan 31, 2018
Required Education
Bachelors Degree
Position Type
Full time


The Advanced Biomedical Computing Center (ABCC) provides technology development, scientific consultation, collaboration, data analysis and training to the NCI and NIH scientists and staff. The Core Infrastructure and Systems Biology (CISB) group in ABCC strives to streamline and provide innovative solutions for the NCI/NIH community to access and use biological information collected across different sources and formats. Integrating diverse data sources to enable disease agnostic access and analysis, variant impact annotation, identifier conversions across species, and merging clinical and research data enables translation from basic to the goal of precision medicine.


  • Develop custom computational pipelines to mine information from diverse data types and gain research insights
  • Integrate data from biological annotation databases, structured experimental and clinical data, and unstructured text to develop applications that are easy to use by the researchers
  • Implement machine learning and statistical modeling to predict patterns in the data
  • Document approaches and mechanisms clearly and comprehensively.
  • Research and proof cutting edge methods for data mining and data presentation


To be considered for this position, you must minimally meet the knowledge, skills, and abilities listed below:

  • Possession of a Bachelor’s degree in Applied Mathematics, Computer Science or a related field from an accredited college or university according to the Council for Higher Education Accreditation. (Additional qualifying education may be substituted for the required expertise). Foreign degrees must be evaluated for U.S. equivalency
  • In addition to the educational requirements, a minimum of two (2) years of experience is required
  • Expertise in mining scientific documents using NLP
  • Demonstrated expertise in data processing languages such as Python or R
  • Expertise in developing custom data mining algorithms
  • Strong drive and initiative to explore new technologies in data analytics and knowledge mining
  • Excellent written and verbal communication skills / ability to document and communicate complex technical concepts for a variety of audiences
  • Must be able to obtain and maintain a security clearance


Candidates with these desired skills will be given preferential consideration:

  • Master’s degree in Applied Mathematics or Computer Sciences or a related field
  • Experience with large scale data mining using Hadoop, Spark or other similar infrastructures
  • Experience in developing data integration and visualization tools
  • Knowledge of current bioinformatics experiments and analysis methods including microarray and NGS analysis, to understand the data provenance of how the data was generated, underlying biology and any associated complexities