Infrastructure/Data Science Architect-Open Position-CA
Role details
Job location
Tech stack
Requirements
-
Advanced degree in Computer Science, Data Engineering, Biomedical Informatics, or a related field
-
Minimum 7 years of experience building and operating data science and AI infrastructure, including with open-source tooling.
-
Demonstrated experience leading platform projects that span multiple teams (IT + security/compliance + platform owners)
-
Significant hands-on experience in a Healthcare or Research setting with:
-
Kubernetes and container-based platforms in production environments
-
Healthcare or research-focused ML and LLM applications
-
R and Python for advanced analytics
-
Distributed computing and scalable ML systems
-
Python and Linux/Unix scripting for automation and tooling
Demonstrated experience implementing Continuous Integration/Continuous Delivery concepts in research or ML environments.
Strong verbal and written communication skills and a track record of partnering with non-platform stakeholders (such as researchers, analysts, or clinical teams) to translate their needs into platform capabilities, * Prior experience in clinical research, public health, or life sciences environments.
- Familiarity with regulated data environments (HIPAA, IRB-controlled research).
- Experience supporting collaborative, multi-institutional research projects.
- Experience deploying ML systems in onprem or private cloud research computing environments.
- Fluency with: Python, Linux OS and scripting, CI/CD, container building (Docker compatible), configuration syntaxes (YAML, TOML, XML), JSON, RESTful service, Airflow
- Working knowledge of R programming, RStudio configuration, Spark/H2O, serverless functions, Ci/CD, Gitlab, Vault, Django, Rancher, Harvester