Senior Software Engineer - Python/AI/ML Infrastructure
Role details
Job location
Tech stack
Job description
As a Senior Software Engineer on the AI/ML Infrastructure team, you will play a key role in designing, developing, and deploying the infrastructure that supports our AI/ML/LLM initiatives. You will collaborate with software engineers, machine learning engineers, and platform infrastructure engineers to deliver scalable and high-quality solutions.
You will be responsible for architecting and implementing robust technical components that drive the next generation of Process Intelligence. These components will leverage technologies such as computer vision, natural language processing, data mining, recommendation systems, knowledge bases, and business intelligence.
The work you'll do
- Understand and translate business and technical requirements into effective solutions.
- Design and develop highly scalable and performant systems.
- Build and maintain low-latency infrastructure for serving AI/ML/LLM models.
- Develop intelligent techniques for integrating and processing large-scale enterprise operational data.
- Create and maintain APIs, libraries, and infrastructure to support data serving at scale.
- Implement data quality and freshness mechanisms for enterprise customers.
Requirements
- 5+ years of experience in software engineering or a relevant technical role (preferably experienced in Machine Learning Ops)
- Expert-level proficiency in Python (preferably experienced with FastAPI, PyDantic, and SQLAlchemy)
- Familiarity with distributed systems and event driven architectures (e.g. RabbitMQ or Kafka)
- Strong understanding of DevOps principles and tools: Kubernetes, Terraform, DataDog, Docker, CI/CD, ArgoCD
- Extensive experience with cloud platforms (AWS, Azure, GCP)
- Proven ability to design, build, document, and distribute scalable services, libraries, and tools
- Passion for developing AI/ML/LLM solutions and staying up-to-date with industry trends.
About the company
We're Celonis. We help companies reveal and fix inefficiencies they can’t see in their software system, enabling them to perform at levels they never thought possible.