Search - Search Inference - Senior Site Reliability Engineer
Role details
Job location
Tech stack
Job description
Search - Search Inference - Senior Site Reliability Engineer 1 week ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale - unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone to accelerate the results that matter. By taking advantage of all structured and unstructured data - securing and protecting private information more effectively - Elastic's complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI. What Is The Role The Search Inference team is responsible for bringing performant, ergonomic, and cost effective machine learning (ML) model inference to Search workflows. ML inference has become a
Requirements
inference service so it may scale efficiently and reliably, hosting a growing number of models for semantic search, agentic workflows and foundation models. * Ensuring proactive monitoring and SLO-based alerting using error budgets to prevent incidents before they happen. * Enhancing the scalability and reliability of the service and partnering with the team to ensure knowledge is shared, clear documentation is produced, and best practices are followed. * Growing our global infrastructure to meet increasing scaling demands by developing and maintaining software, tooling, and automations. * Collaborating in an inclusive environment, focusing on operational excellence and uplifting each other with constructive feedback. * Being part of an SRE on-call rotation responding to operational needs and incidents. What You Bring * 5+ years of experience in a site reliability engineer (or equivalent) role, operating services in production at scale. * 3+ years of
Benefits & conditions
experience with Kubernetes, Helm & containerised services. * Experience Terraform/Pulumi/Crossplane or similar. * Experience writing non-trivial code in a language like Python, Go, or equivalent. * Strong Linux fundamentals, experience writing Bash scripts. * Strong written communication. Bonus points Experience working with Ray and KubeRay is a big plus! Experience working with the Elastic Observability Stack. Additional Information - We Take Care Of Our People * Competitive pay based on the work you do here and not your previous salary. * Health coverage for you and your family in many locations. * Ability to craft your calendar with flexible locations and schedules for many roles. * Generous number of vacation days each year. * Increase your impact - We match up to $2000 (or local currency equivalent) for financial donations and service. * Up to 40 hours each year to use toward volunteer projects you love. * Embracing parenthood with