Skip to main content

This job has expired

You will need to login before you can apply for a job.

R&D Data Engineer

Employer
Regeneron Pharmaceuticals, Inc.
Location
Tarrytown, New York
Start date
Nov 19, 2021

View more

Discipline
Science/R&D, Research
Required Education
Doctorate/PHD/MD
Position Type
Full time
Hotbed
Pharm Country

Job Details

The Regeneron Genetics Center’s Genome Informatics & Data Engineering R&D team is looking for a Data Engineer to support the design, development, and maintenance of our genetic data lake/lakehouse, as well as the adjacent database systems, ETL pipelines, analytical tools, and APIs. You will work heavily within our Databricks/Apache Spark ecosystem on AWS and help to expand and improve our technology stack and data pipelines that support one of the largest genomic data lakes in the world. As part of the R&D team, you will be responsible for supporting core data infrastructure but will also have the ability to contribute to innovative systems & methods development efforts.

In this role, a typical day might include the following:

  • Engineer & maintain production data pipelines to support our data lake within a lakehouse paradigm, including custom ETL/ELTs, pipeline CI/CD, and table optimization processes.
  • Design software and infrastructure for data consumers, including performant APIs for application backends, library APIs for analytic pipelines and data scientists, and interfaces with low-latency operational databases.
  • Contribute to pipeline development, orchestration, and productionalization for high-performance informatics & analytics tools.
  • Interact and collaborate with other scientific and technical teams to identify & unify new data assets, ensure efficient and interoperable systems, and identify new technologies and services to address bottlenecks.
  • Contribute to the design and implementation of a relational database/common data model for integrating core data assets across teams.
  • Support application development team with robust, performant backend systems
  • Contribute to bioinformatic & data analytics software development projects as needed, working with domain professionals in genome informatics, high-performance data analytics, genetics, and clinical informatics. Opportunities to contribute to open-source projects, such as Project Glow, are available.
  • Provide technical expertise and architectural leadership to improve the value, usability, and interoperability of core RGC data assets
  • Keep abreast of state-of-the-art technologies, promoting innovative solutions and ensuring the RGC has the technical resources to stay at the forefront of the industry.

This job might be for you if:
  • You have an interest in the health & life-sciences data domain.
  • You have a willingness to support and contribute to R&D efforts developing new methods & systems improving data value & time-to-discovery.
  • With your sleeves rolled up, you work on current problems while thinking of future solutions.
  • You have excellent written and verbal communication skills with expertise in documentation and visual presentations.

To be considered for this role you must have B.S. with 3+ years of relevant experience or M.S./Ph.D in computer science, data/software engineering, or a related field. Experience in engineering high-performance big data systems, with robust ETL pipelines and supporting broad consumer use-cases. Knowledge of cloud-based data technologies such as Apache Spark, Hadoop, Relational databases/SQL/NoSQL, Databricks Delta, and AWS services including Athena, Glue, RDS, Redshift, Kinesis. Deep technical foundation in data engineering, software engineering, backend/database design & management, machine learning operations and distributed systems. Domain knowledge in genomics, bioinformatics, or statistical genetics is a plus Experience in/willingness to work in Databricks Apache Spark, Python, SQL, Scala, Unix, AWS Cloud, Serverless/microservices

#LI-EG2

Does this sound like you? Apply now to take your first steps toward living the Regeneron Way! We have an inclusive and diverse culture that provides amazing benefits including health and wellness programs, fitness centers and stock for employees at all levels!

Regeneron is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion or belief (or lack thereof), sex, nationality, national or ethnic origin, civil status, age, citizenship status, membership of the Traveler community, sexual orientation, disability, genetic information, familial status, marital or registered civil partnership status, pregnancy or maternity status, gender identity, gender reassignment, military or veteran status, or any other protected characteristic in accordance with applicable laws and regulations. We will ensure that individuals with disabilities are provided reasonable accommodations to participate in the job application process. Please contact us to discuss any accommodations you think you may need. role.

Company

Regeneron is a leading biotechnology company that invents life-transforming medicines for people with serious diseases. Founded and led for 30 years by physician-scientists, our unique ability to repeatedly and consistently translate science into medicine has led to seven FDA-approved treatments and numerous product candidates in development, all of which were homegrown in our laboratories. Our medicines and pipeline are designed to help patients with eye disease, allergic and inflammatory diseases, cancer, cardiovascular and metabolic diseases, infectious diseases, pain and rare diseases.
 
Regeneron is accelerating and improving the traditional drug development process through our proprietary VelociSuite® technologies, such as VelocImmune® which produces optimized fully-human antibodies, and ambitious research initiatives such as the Regeneron Genetics Center, which is conducting one of the largest genetics sequencing efforts in the world.

Stock Symbol: REGN

Stock Exchange: NASDAQ

FacebookTwitterInstagramYouTube Logo

Company info
Website
Phone
914-847-7000
Location
Corporate Headquarters
777 Old Saw Mill River Road
Tarrytown
New York
10591
United States

Get job alerts

Create a job alert and receive personalized job recommendations straight to your inbox.

Create alert