Senior Data Scientist

San Diego, CA, US
Dec 05, 2018
Required Education
Bachelors Degree
Position Type
Full time
Celgene is a global biopharmaceutical company leading the way in medical innovation to help patients live longer, better lives. Our purpose as a company is to discover and develop therapies that will change the course of human health. We value our passion for patients, quest for innovation, spirit of independence and love of challenge. With a presence in more than 70 countries, and growing - we look for talented people to grow our business, advance our science and contribute to our unique culture.

Responsibilities include, but are not limited to, the following:

  • Design, build and manage internal data repositories that integrate vast amounts of genomic, phenotypic and screening data from public, internal and partner sources.
  • Implement data solutions to disseminate and visualize datasets using contemporary application platforms (Shiny, Spotfire, etc)
  • Enable knowledge collection across R/ED by creating knowledge bases from the analysis of public and internal data sets and integration with annotation resources
  • Use skills to enable on-going innovation of data management systems, processes and procedures to enhance R/ED productivity
  • Make data, including raw/interim data, available to R/ED department personnel as required
  • Acquire user feedback to inform business requirements for future data systems development.
  • Help develop, enhance, and automate processes for queuing and prioritizing data management and curation requests

Experience and Education

  • Bachelor's degree in a relevant discipline with at least 14 years' experience, Master's degree with at least 12 years' experience or PhD with at least 6 years' experience in biomedical data management, assay development, specimen data management or related discipline
  • Demonstrated proficiency with molecular biology assay concepts and ability to support, develop and deploy laboratory and other research data management processes and procedures as they apply to complex, high dimensional data sets
  • Extensive practical experience working with diverse but highly-connected scientific knowledge collections and their query interfaces to enable research hypotheses around compound targets, mechanisms of action, and patient response
  • Demonstrated proficiency with current software engineering methodologies, such as Agile, source control, project management and issue tracking
  • Working knowledge of cloud computing. Preference will be given to candidates with AWS experience
  • Working knowledge of Rest APIs and container strategies strongly preferred.
  • Knowledge of distributed database design and implementation
  • Excellent skills in R programming and experience in additional computer languages such as Perl, Python, or Java (or C/C++)
  • Experience producing visualization of data sets (eg., R/shiny, Spotfire, etc)
  • Working knowledge of both Windows and Linux operating systems is required
  • Along with programming proficiency must have creativity, and show a strong capacity for independent thinking and the ability to grasp underlying biological questions
  • Must thrive in a complex, dynamic environment while adapting to dynamically changing priorities
  • Must have excellent written and verbal communication and presentation skills
Must have excellent time management and organizational skills

Celgene is committed to equal opportunity in the terms and conditions of employment for all employees and job applicants without regard to race, color, religion, sex, sexual orientation, age, gender identity or gender expression, national origin, disability or veteran status. Celgene complies with all applicable national, state and local laws governing nondiscrimination in employment as well as employment eligibility verification requirements of the Immigration and Nationality Act. All applicants must have authorization to work for Celgene in the U.S.