Software Developer (Genome Informatics)

Tarrytown, New York
May 14, 2022
Required Education
Bachelors Degree
Position Type
Full time

The RGC’s Genome Informatics team is looking for a Software Engineer to assist the design, implementation, testing and deployment of the codebases that support the production and R&D systems. Genome Informatics leads the primary and secondary analysis of more than 500,000 samples a year, including production pipelines, cloud computing infrastructure, and sequencing and variant quality control. Working closely with other RGC teams, our extensive genomics production system supports multi-omics applications (RNA, long reads), unprecedented-scale variant calling, disease association studies, and loci-specific analyses that directly impact innovative drug development.

You will support our software engineering efforts to develop, implement and deploy at-speed and at-scale solutions for both production and R&D experiments.

In this role, a typical day might include the following:

  • Support software engineering efforts in GI to ensure 24/7 uptime and rapid development cycle of production.

  • Develop, implement and solve software solutions in a cloud environment and distributed compute infrastructure (Spark).

  • Develop, maintain and document genomic pipelines to ensure at-speed and at-scale genomics production.

  • Develop applications for routine visualization of QC metrics, compute performance and resource usage for RGC production.

  • Work with GI engineering lead to continuously optimize and improve the productionalization process.

This job might be for you if:

  • You have an interest in the health & life-sciences data domain.

  • You have a passion to bring innovative technologies in the field.

  • You are thorough and focus on the quality of the final product.

  • You enjoy collaborating with multi-functional teams of scientists, analysts, and engineers.

  • With your sleeves rolled up, you work on current problems while thinking of future solutions.

To be considered for this role you must have a B.S. in Computer Science or a related field. At least 1 -2 years' experience with Python and Bash as well as Linux operating system (Ubuntu). Solid understanding of software engineering principles and standard methodologies, including SDLC, version control (Git) and CI/CD. Experience working with databases (SQL) and in a cloud environment (AWS or GCP). Experience with distributed systems (Hadoop, Spark, Kubernetes) is a plus. Experience with standard bioinformatics tools (e.g. Samtools, BCFtools, VCFtools, BWA, GATK, Picard, BEDtools, PLINK) is preferred.


Does this sound like you? Apply now to take your first steps toward living the Regeneron Way! We have an inclusive and diverse culture that provides amazing benefits including health and wellness programs, fitness centers and stock for employees at all levels! 

Regeneron is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion or belief (or lack thereof), sex, nationality, national or ethnic origin, civil status, age, citizenship status, membership of the Traveler community, sexual orientation, disability, genetic information, familial status, marital or registered civil partnership status, pregnancy or maternity status, gender identity, gender reassignment, military or veteran status, or any other protected characteristic in accordance with applicable laws and regulations. We will ensure that individuals with disabilities are provided reasonable accommodations to participate in the job application process. Please contact us to discuss any accommodations you think you may need.