AI Systems Engineer
Go Arrow
11 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Compensation
£ 100KJob location
Tech stack
Artificial Intelligence
Amazon Web Services (AWS)
Systems Engineering
Azure
Cloud Computing
Information Engineering
Distributed Systems
Fault Tolerance
Machine Learning
Performance Tuning
Software Systems
Systems Architecture
Graphics Processing Unit (GPU)
Google Cloud Platform
Deep Learning
Reliability of Systems
Kubernetes
Machine Learning Operations
Docker
Job description
We are seeking an experienced AI Systems Engineer to design, build, and optimize the infrastructure and systems that power our AI-driven applications. In this role, you will bridge the gap between AI research, data engineering, and software systems, ensuring that advanced machine learning and AI models run efficiently, securely, and at scale., * System Architecture & Design
- Design scalable architectures for AI model training, inference, and deployment.
- Develop high-performance computing systems optimized for deep learning workloads.
- Integrate AI components into production-grade pipelines and distributed systems.
- Infrastructure & Deployment
- Build and maintain cloud-based and on-premise infrastructure for AI workloads (AWS, GCP, Azure).
- Implement CI/CD pipelines for machine learning and AI services.
- Manage containerization and orchestration tools (Docker, Kubernetes) for AI workloads.
- Performance Optimization
- Optimize model serving for latency, throughput, and cost efficiency.
- Work with GPUs/TPUs and distributed training frameworks (Horovod, Ray, DeepSpeed).
- Implement caching, batching, and model compression strategies to improve performance.
- Monitoring & Reliability
- Develop monitoring tools for AI model health, performance, and data drift.
- Ensure system reliability, fault tolerance, and security for production AI systems.
- Collaborate with MLOps and Data Engineering teams to establish observability and traceability.
- Collaboration & Innovation
- Partner with AI researchers and data scientists to translate models into deployable systems.
- Provide technical guidance on system scalability, performance, and architecture.
- Stay up to date with emerging trends in AI infrastructure, distributed computing, and model deployment.
Job Types: Full-time, Permanent
Pay: £75,000.00-£100,000.00 per year
Requirements
Do you have experience in Systems engineering?