AI Platform Engineer
Role details
Job location
Tech stack
Job description
developers to deploy reproducibly and safely. Design APIs for AI inference, prompt management, and evaluation. Implement MLOps pipelines: versioning, monitoring, logging, experimentation tracking. Optimize performance: latency, cost, throughput, and reliability. Collaborate with backend engineers to integrate AI capabilities. Monitor model performance, drift, and set up logging and observability. Build CI/CD pipelines for model deployment. Document AI infrastructure and best practices. Mentor AI developers on software practices. Required Skills & Experience 7+ years of software engineering experience (Python preferred). Experience with LLMs and AI/ML in production: OpenAI API, HuggingFace, LangChain, or similar. Knowledge of vector databases: Pinecone, Chroma, Weaviate, FAISS. Cloud infrastructure experience: GCP (Vertex AI preferred) or AWS (SageMaker). API development: FastAPI, REST, async programming. CI/CD and DevOps: Docker, Terraform, GitHub Actions. Monitoring and observability.
Requirements
Problem-solving mindset; comfortable debugging complex distributed systems. Experience deploying AI at enterprise level. Nice-to-Have Fine-tuning or training models. Familiarity with LangChain, Pydantic AI, or similar frameworks. Prompt engineering and evaluation techniques. Real-time inference and streaming responses. Background in data engineering or ML engineering. Knowledge of RAG architectures. Contributions to open-source AI/ML projects. Tech Stack Languages: Python, Bash. AI/ML: OpenAI API, Anthropic, HuggingFace, LangChain, Pydantic AI. Vector DBs: Pinecone, Chroma, Weaviate, FAISS. Backend: FastAPI, SQLAlchemy, Pydantic. Cloud: GCP (Vertex AI, Cloud Run), Terraform. CI/CD: GitHub Actions. Experiment Tracking: MLflow, Weights & Biases, or custom. Containers: Docker; Kubernetes optional. What We Offer Competitive compensation, including Stock Options. Access to state-of-the-art tools and collaboration with leading experts. Flexible work arrangements with potential remote options.