Automation Engineer - Scientific Data, AI/ML Pipelines & Integration Dev

SFVALLEY, LLC
Boston, United States of America
13 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Junior
Compensation
$ 140K

Job location

Boston, United States of America

Tech stack

API
Artificial Intelligence
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Automation of Tests
Code Review
Data Cleansing
Data Integrity
ETL
Relational Databases
Database Queries
Software Debugging
Django
Amazon DynamoDB
Github
Identity and Access Management
Python
Laboratory Information Management Systems
Network Security
PostgreSQL
Machine Learning
MongoDB
MySQL
NoSQL
NumPy
Oracle Applications
Performance Tuning
Scrum
Systems Development Life Cycle
Mockito
TensorFlow
Software Deployment
SQL Databases
Systems Integration
Data Logging
Enterprise Software Applications
Test Driven Development
Feature Engineering
Data Ingestion
PyTorch
Flask
Delivery Pipeline
State Machines
Backend
FastAPI
Pandas
Amazon Web Services (AWS)
Pytest
Containerization
Gitlab-ci
Scikit Learn
Information Technology
Atlassian Tools
Data Management
Machine Learning Operations
Functional Programming
REST
Data Pipelines
GXP
Docker
Jenkins
Microservices

Job description

Zifo is seeking a passionate Software Developer who can work at the intersection of science, data, and technology. The role requires strong expertise in Benchling, Python, SQL/NoSQL, AWS and FastAPI, along with the ability to work directly with scientists performing assay-based experiments. The successful candidate will translate experimental workflows into robust data components, scientific system integrations, AI-enabled insights, and next-generation data pipelines Requirements

  • Collaborate with scientists, assay teams, and lab operations to capture end-to-end assay and experimental workflows, from sample onboarding and execution through data ingestion, validation, and downstream analytics
  • Translate scientific and operational requirements into well-defined functional, technical, and data requirements for laboratory platforms, system integrations, and next-generation data pipelines
  • Design, develop, and maintain Python-based backend services, APIs, microservices, and data pipelines on AWS using FastAPI and supporting frameworks such as Flask or Django, including integrations with scientific systems such as Benchling, Signals, LIMS, ELN, CDS, and SDMS.
  • Design and optimize SQL and NoSQL data models and build ETL/ELT and next-generation data pipelines to support structured, semi-structured, and high-volume scientific data, analytics, and AI/ML workloads, including dataset preparation, feature engineering, and model integration into pipelines and applications.
  • Implement and maintain CI/CD pipelines for automated build, testing and deployment
  • Ensure solutions meet performance, data integrity, security, and regulatory compliance requirements (e.g., GxP, 21 CFR Part 11)
  • Perform code reviews, debugging, and performance optimization
  • Coordinate across cross-functional and geographically distributed teams, managing dependencies and ensuring delivery alignment
  • Create ready to deliver technical documentation and track deliverables using JIRA and Confluence, As a Junior Software Engineer specializing in Open 3D Engine (O3DE), you will play a crucial role in the development of cutting-edge AI systems for 3D environments. This position o…
  • Just now
  • Apply easily, The role of a Network Security Engineer offers a unique opportunity to leverage your expertise in network security to contribute to the training of next-generation AI systems. Your…
  • 4 days ago
  • Apply easily

Requirements

  • Bachelor's or master's degree in computer science, Engineering, Life Sciences with 3-8 years of hands-on experience in Python development with FastAPI
  • Proficiency in SQL, including schema design, complex queries, and performance optimization
  • Relational databases such as PostgreSQL, MySQL, Oracle, AWS RDS/Aurora, NoSQL databases such as DynamoDB, MongoDB, or equivalent
  • Experience with scientific data and laboratory informatics, including familiarity with Benchling or similar scientific data platforms ELN like Benchling, LIMS, ELN, SDMS, CDS,, within the life sciences or pharmaceutical industry. (Preferred)
  • AWS experience, including S3, EC2, Lambda, Step Functions, RDS / Aurora, IAM, monitoring, and logging
  • Proficiency with Git-based collaborative development, including branch management, pull requests, code reviews, and integration with CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, AWS CodePipeline) to ensure reliable and traceable software delivery
  • Hands-on experience with Test-Driven Development and Python testing frameworks such as pytest, unittest, and mocking libraries
  • Working knowledge of AI/ML concepts, including data preparation, feature engineering, model integration, and inference workflows
  • Exposure to the data and ML libraries such as pandas, NumPy, and scikit-learn (exposure to TensorFlow or PyTorch is a plus)
  • Ability to design data models aligned to scientific and assay workflows & integrating scientific or enterprise systems and working directly with scientists or lab users
  • Knowledge of containerization (Docker) and modern deployment best practices
  • Familiarity with Agile/Scrum & SDLC development methodologies & Solid understanding of REST APIs, microservices, and integration patterns
  • Strong communication, stakeholder engagement, and cross-team coordination skills

Additional Preferences

  • Willingness to travel/ relocate based on project or business needs
  • Ability to work in a fast-paced, client-focused environment
  • Comfortable managing cross-team coordination and dependency management, particularly across globally distributed teams and user groups

Benefits & conditions

We look for Science - Biotechnology, Pharmaceutical Technology, Biomedical Engineering, Microbiology etc. We possess scientific and technical knowledge and bear professional and personal goals. While we have a "no doors" policy to promote free access within, we do have a tough door to walk in. We search with a two-point agenda - technical competency and cultural adaptability. We offer a competitive compensation package including accrued vacation, medical, dental, vision, 401k with company matching, life insurance, and flexible spending accounts. If you share these sentiments and are prepared for the atypical, then Zifo is your calling! Zifo is an equal opportunity employer, and we value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

About the company

Zifo is a global specialist scientific and process informatics services company supporting life sciences, biotech, and pharmaceutical organizations. We enable digital transformation across R&D, manufacturing, and quality by delivering data-driven, scalable, and compliant software solutions., CURIOSITY DRIVEN, SCIENCE FOCUSED, EMPLOYEE BUILT. Our culture is unlike any other, one where we debate, challenge ourselves, and interact with all alike. We are a curious bunch, characterized by our passion to learn and spirit of teamwork. Zifo is a global R&D solutions provider focused on the industries of Pharma, Biotech, Manufacturing QC, Medical Devices, specialty chemicals and other research-based organizations. Our team's knowledge of science and expertise in technology help Zifo better serve our customers around the globe, including 18 of the Top 20 Biopharma companies.

Apply for this position