Lead Senior Backend Engineer
Role details
Job location
Tech stack
Job description
performant, secure and observable at scale. You'll collaborate with experts in AI infrastructure and data engineering to build robust, secure, and efficient systems that scale to millions of requests. Responsibilities and Duties Design and build core platform services: API gateway, authentication, authorization, key rotation, and multi-tenant isolation.Design, implement, and optimize APIs and backend systems using Python frameworks such as primarily FastAPI (or Flask, or Django). Architect and implement usage metering, billing integration, and rate limiting for inference endpoints. Build and maintain scalable, fault-tolerant microservices for data processing and AI integration. Build and operate a high-throughput proxy/routing layer for AI model serving traffic. Collaborate with cross-functional teams to design system architecture and ensure seamless system interoperability. Design telemetry and observability into the platform from the ground up - structured logging, distributed tracing
Requirements
metrics, and alerting. Implement robust CI/CD pipelines, monitoring, and observability for high-performance production systems. Drive technical decisions on architecture, data modeling, and technology choices. Identify performance bottlenecks and drive improvements in reliability, scalability, and latency. Establish engineering standards for the backend codebase: testing, code review, CI/CD, and deployment practices. Ensure best practices for security, compliance, and maintainability across the API lifecycle. Collaborate closely with the ML infrastructure team to integrate with model serving systems (e.g., NVIDIA Dynamo, vLLM, SGlang, TensorRT-LLM). Qualifications Degree in Computer Science, Software Engineering, or equivalent professional experience. 5+ years of experience building and operatingdeveloping backend systems in production, with a meaningful scale with a focus on API design and high-scale production environments. Strong proficiency in Python and at least one systems-oriented language (Go, Rust, Java, C++), willingness to work across the current stack which includes Python (FastAPI) - we value the right tool for the jobengineer with deep knowledge of asynchronous programming, modern Python tooling and profiling. Proven experience designing and maintaining REST APIs (REST, gRPC, WebSockets, SSEIncluding websockets, SSE, Webhooks etc.) in production environments and the tradeoffs between them. Solid understanding of distributed systems: consistency, fault tolerance, concurrency, performance and networking, data pipelines, and performance optimization. Experience designing authentication/authorization systems (OAuth2, JWT, API key management, RBAC/ABAC). Experience with cloud infrastructure (AWS, GCP, or Azure) and containers ized environments (Docker) and orchestration (, Kubernetes). Understanding of database design (SQL, ORMs etc.) and data modeling at scale. Fluent in English (written and verbal) and comfortable collaborating in international teams. Nice-t