Amgen

Principal Data Engineer

Employer
Amgen
Location
Thousand Oaks, CA
Posted
Mar 22, 2019
Ref
R-67111
Required Education
Doctorate/PHD/MD
Position Type
Full time
Amgen is seeking Data Engineers to help realize Amgen's Operations Data Strategy. This program will produce business insights through data science solutions. You will build upon our awarded Enterprise Data Lake to develop value added data products that span the Operations Domain (Process Development, Supply Chain, Quality, Engineering, Manufacturing). There is no more challenging data environment than Life Sciences due to the integration of scientific research, manufacturing, logistics of pharmaceutical products. Expect to make a difference in providing patients with products that meet their medical needs in a competitive landscape.

Successful candidates will have:
  • The requisite technical skills;
  • Ability to synthesize business and technical constraints and requirements
  • the requisite technical skills;
  • The ability to absorb the nuances of the Bio-Tech operations value chain, including supply chain, logistics, and manufacturing source systems;
  • High personal standards of productivity and quality;
  • The ability to contribute in a collaborative and fast paced environment;
  • Able to join-in with hands-on development tasks;
  • Able to function as scrum master for the Data Engineering Team;
  • Able to explain data architecture decisions and strategy to management.


Key Activities for the Data Engineer include:
  • Defines and approves complex data product architectures for product and projects
  • Decides when a new design pattern is needed to fulfill specific requirements
  • Owns budget responsibility within the context of project planning.
  • Collaborate with Data Architects, Business SME's, and Data Scientists to architect data products and services.
  • Provide architectural oversight for processes which perform data transformation, metadata extraction, workload management and error processing management.
  • Lead the design and planning to implement standardized, automated operational and quality control processes to deliver accurate and timely data and reporting to meet or exceed SLAs.
  • Drive the exploration and adoption of new tools, and techniques and propose improvements to the data pipeline
  • Integrate the operations data platform with the Data Scientist workbench, the Data Marketplace, and Analytic Tools such as Tableau, Spotfire, R, etc.
  • Act as a product manager for the operations data platform backlog
  • Act as a run manager, provide Run/DevOps support


Basic Qualifications:

Doctorate degree and 2 years of Information Systems experience

OR

Master's degree and 6 years of Information Systems experience

OR

Bachelor's degree and 8 years of Information Systems experience

OR

Associate degree and 10 years of Information Systems experience

OR

High school diploma / GED and 12 years of Information Systems experience

Preferred Qualifications:

  • BS/MS degree in Computer Science, Engineering or related field
  • 5 or more years of experience designing complex and inter - dependent data models for analytic , Machine learning use cases.
  • 8 or more years of experience architecting and building processes that extract, process and add value to data sets from multiple source systems.
  • 5 or more years of experience architecting and building processes that extract, process and add value to data sets from multiple source systems.
  • Experiencing with data modeling and tuning of relational as well as NoSQL datastores (Oracle, Red-shift, Impala, HDFS/Hive, Athena, etc.)
  • Experience working with distributed computing tools (Spark, Hive, etc.)
  • Experience with AWS cloud services: EC2, EMR, RDS, Redshift, S3, Lambda
  • Experience with data pipeline and workflow management tools: Airflow, etc.
  • 5 or more years of experience with one or more general purpose programming languages, including but not limited to: Java, Scala, C, C++, C#, Swift/Objective C, Python, or JavaScript.
  • 5 or more years experience working with leading agile development methodologies such as Sprint and Scrum
  • Experience with Software engineering best-practices, including but not limited to version control (Git, TFS, Subversion, etc.), CI/CD (Jenkins, Maven, Gradle, etc.), automated unit testing, Dev Ops.
  • Experience with Semantic technologies and approaches is a plus.
  • Biotech / Pharma experience is a plus
  • Full stack development using infrastructure cloud services (AWS preferred) and cloud-native tools and design patterns (Containers, Serverless, Docker, etc) is a plus.

Amgen is committed to unlocking the potential of biology for patients suffering from serious illnesses by discovering, developing, manufacturing and delivering innovative human therapeutics. This approach begins by using tools like advanced human genetics to unravel the complexities of disease and understand the fundamentals of human biology.

Amgen focuses on areas of high unmet medical need and leverages its expertise to strive for solutions that improve health outcomes and dramatically improve people's lives. A biotechnology pioneer since 1980, Amgen has grown to be one of the world's leading independent biotechnology companies, has reached millions of patients around the world and is developing a pipeline of medicines with breakaway potential.