Software Engineer - Core Infrastructure
Role details
Job location
Tech stack
Job description
- Maintain and grow multi-cloud compute infrastructure to support large-scale ML model training and computational workloads.
- Build and optimize configuration and procedures for monitoring resource allocation and deployment automation.
- Scale autoscaling compute clusters to handle increasing workloads.
- Enhance orchestration and scheduling frameworks to improve execution throughput, reliability, and compute utilization across heterogeneous pipelines., Clinical Administrator Clinical Development Clinical Operations Clinical Program Manager Clinical Project Manager Clinical Research Associate Clinical Research Nurse Clinical Research Scientist Clinical Services Clinical Study Manager Clinical Supplies Clinical Trials Manager / Administrator Drug Safety Feasibility Investigator Patient Recruitment Pharmacoeconomics Pharmacovigilance Study Site Coordinator Study Start Up
Data Management / Statistics
Select options under Data Management / Statistics
Biostatistics Clinical Data Management Data Analyst Informatics SAS Programming Statistical Programming Statistics
Finance / Administration
Select options under Finance / Administration
Administration Contracts / Proposals Customer Services Finance Legal Licensing Purchasing & Procurement
Healthcare
Select options under Healthcare
Carer / Healthcare Assistant Consultant General Practitioner Nurse Pharmacy Physician / Doctor, Biology Biotechnology Chemistry Epidemiology Genetics and Genomics Laboratory Pharmacokinetics Pharmacology Pre - clinical Proteomics Scientific Toxicology
Regulatory Affairs
Select options under Regulatory Affairs
CMC Compliance Labelling Regulatory Writing
Sales / Commercial
Select options under Sales / Commercial
Account Management Business Analytics Business Development Commercial Management Product Management Sales Therapy Specialist
Requirements
- 5+ years of experience building and maintaining cloud infrastructure at scale (e.g., AWS or GCP).
- Proficiency in Python, Bash, Terraform, Ray, and Kubernetes.
- Experience with compute clusters running distributed ML training jobs with 1,000+ GPUs is highly desirable.
- Hands-on experience with physical hardware and datacenter management is a plus.
Benefits & conditions
- $150 000.00 - $170 000.00 Per Annum