Lead Software Engineer, Clinical Informatics

Tarrytown, New York, United States of America
Jul 24, 2021
Required Education
Masters Degree/MBA
Position Type
Full time
The Clinical Informatics team is seeking an experienced Software Engineer/ Informatics Developer who will be responsible for applications optimizing the Regeneron Genetic Center's (RGC's) clinical data infrastructure and contributing to the design and implementation of innovative applications. Key areas of focus are obtaining and transforming data from the UK Biobank, developing the backend infrastructure to support efficient queries and outward facing analysis and visualization tools, and leveraging the RGCs distributed computing infrastructure. You will also focus on building out big data distributed systems architecture capable of efficiently processing terabytes of clinical and genomic data; collaborating with other team members to develop novel and scalable machine learning approaches for mining clinical and genetic data; and building automation around various components of our systems.

A typical day may include:

  • Maintain, support and refine the RGC's existing clinical data infrastructure
  • Lead all activities related to UK Biobank data
  • Lead technical aspects of CI's initiatives in data mining, advanced phenotyping, and data visualization
  • Create clinical phenotype matrices and complex phenotypes from clinical data sources
  • Collaborate with the RGC-IT team to automate clinical informatics processes
  • Collaborate and coordinate with other development teams and efforts to integrate new and existing tools into both the RGC and wider REGN ecosystem
  • Interact with key partners to clearly define and iterate on requirements
  • Keep abreast of the latest advances in state-of-the-art software technologies

This role may be for you if:
  • You have strong analytical skills.
  • You can multitask and manage simultaneous projects to meet deadlines with a strong attention to detail.
  • Possess the ability to interpret and communicate analytical information in a clear, concise manner.

To be considered for this role you must possess a Master's degree in Computer Science or related field with at least 1 year of experience in life sciences or other healthcare related settings. Extensive experience with AWS with programming in a modern object-oriented language, including Scala and Python including client-side software development, ( HTML, JavaScript, jQuery, CSS, D3, Node.js, React); SQL and NoSQL databases, including Hive, MongoDB and MySQL; Experience with Python and Scala libraries used for Data Science and Machine Learning including Pandas, Scikit-learn, NumPy, Scipy, Spacy, Matplotlib, NLTK; Linux command line and bash scripting experience; and analyzing the time complexities of key data structures and algorithms.

Demonstrated experience working with UK Biobank data, preferred.


Does this sound like you? Apply now to take your first steps toward living the Regeneron Way! We have an inclusive and diverse culture that provides amazing benefits including health and wellness programs, fitness centers and stock for employees at all levels!

Regeneron is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion or belief (or lack thereof), sex, nationality, national or ethnic origin, civil status, age, citizenship status, membership of the Traveler community, sexual orientation, disability, genetic information, familial status, marital or registered civil partnership status, pregnancy or maternity status, gender identity, gender reassignment, military or veteran status, or any other protected characteristic in accordance with applicable laws and regulations. We will ensure that individuals with disabilities are provided reasonable accommodations to participate in the job application process. Please contact us to discuss any accommodations you think you may need.