Data Engineer

Seattle, WA
Oct 07, 2021
Required Education
Bachelors Degree
Position Type
Full time

Just-Evotec Biologics is seeking a highly motivated data engineer that desires a significant opportunity to improve worldwide access to biotherapeutics. The position will support the development and application of models, visualizations, and processing systems for large scale data analysis across the entire biotherapeutic development pipeline from antibody discovery and design through to process development and GMP manufacturing. The successful candidate will have strong working knowledge of relational database management systems and experience building databases, data warehouses, and/or data pipelines. The successful candidate must also possess strong communication and collaboration skills, a desire to learn new scientific concepts and domains, and an ability to work directly with scientific staff across multiple disciplines.


  • In collaboration with data scientists, build a data warehouse (or related solution) to enable interactive analysis and visualization of scientific data stored in disparate databases

  • Work with data scientists to develop and manage web applications/dashboards to facilitate open access to lab data for historical and ongoing experiments

  • Identify data leaks, working with scientists throughout the company, and implement plans to capture missing information in centralized databases

  • Develop data pipelines to seamlessly connect large datasets to data science modeling environments

Qualifications and Education Requirements:

  • BS or MS in data science, engineering, physical sciences, mathematics, computer science, or related field, with a data science/engineering focus

  • Coding proficiency in Python, with experience collaborating on projects via GitHub or Bitbucket strongly preferred

  • Understanding of relational databases and experience with database design

Preferred Qualifications:

  • Demonstrated strength in SQL, Query performance tuning, Data modeling, ETL development, and Data warehousing

  • Experience building data pipelines to support model development and deployment in python deep learning libraries (esp. tensorflow)

  • Experience with Dash, Flask, or related dashboarding framework in Python 

  • Familiarity with python core data science libraries in python (e.g. pandas, numpy, sklearn)

  • Enthusiasm for mission-driven life sciences research and development

  • Experience working with scientific or manufacturing data, especially bioprocessing time series and sensor data

  • Excellent verbal and written communication skills with ability to organize, analyze/interpret, and effectively communicate results

  • Strong focus on quality and attention to detail with effective task/time management organizational skills

About Just Evotec Biologics

Just – Evotec Biologics, wholly-owned by Evotec SE, is a unique platform company that integrates the design, engineering, development, and manufacture of biologics. With deep experience in the fields of protein, process and manufacturing sciences, the Just team came together to solve the scientific and technical hurdles that block access to life-changing protein therapeutics; from the design of therapeutic molecules to the design of the manufacturing plants used to produce them. Just’s focus is to create access and value for a global market through scientific and technological innovation. Our state-of-the-art labs and cGMP clinical manufacturing plant are co-located in Seattle’s South Lake Union neighborhood – the center of Seattle’s medical, global health, and technology industries and a noted top emerging life science hub in the U.S. Our fast-growing team of 200+ employees is expanding Just’s innovative platform and footprint – building our first North American J.POD® commercial manufacturing facility in the Seattle area.