Senior AI Engineer in Cleveland

Energy Jobline
Cleveland, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Remote
Cleveland, United States of America

Tech stack

API
Artificial Intelligence
Architectural Patterns
Code Review
Databases
Data Validation
Python
Search Algorithms
PostgreSQL
Software Engineering
Systems Integration
TypeScript
Management of Software Versions
Google Cloud Platform
System Availability
Prompt Engineering
Backend
FastAPI
Deployment Automation
GraphQL
Machine Learning Operations
Api Design
GPT

Job description

  • Lead the implementation of rigorous evaluation frameworks to monitor model performance, drift, and cost in real-time.
  • Architect and develop high-performance backend services and APIs using Python (FastAPI) to serve large models at scale.
  • Design advanced Retrieval-Augmented (RAG) systems, selecting and managing vector databases and optimizing embedding strategies for accuracy and speed.
  • Establish comprehensive model observability and guardrail systems to monitor real-time performance, detect distribution drift, and implement automated safety filters that mitigate hallucinations, bias, and toxic outputs in production environments.
  • Build robust integration layers that connect AI agents securely to external enterprise systems, CRMs, and legacy databases.
  • Conduct code reviews, provide technical guidance, and foster a culture of continuous learning and innovation within the engineering team.
  • Collaborate with infrastructure teams to define deployment strategies, ensuring solutions scale dynamically under load.
  • Define the end-to-end architecture for AI products on cloud platforms (preferably Google Cloud Platform), ensuring high availability, security, and cost-effectiveness.

What you'll need to accomplish in your first year:

  • Develop reusable internal libraries and architectural patterns and standards to accelerate the delivery of AI solutions across multiple client engagements.
  • Mentor engineers on best practices for building deterministic software around probabilistic AI models.

Our total rewards program is designed for your protection, peace of mind, and overall well-being. In addition to our outstanding basics, we offer a net-zero cost medical option, company contributions to your HSA, fertility support, fully-paid parental leave, a monthly stipend for your lifestyle spending account, and much more.

Requirements

WE'RE HIRING! If you love data and are looking for unlimited growth opportunities, we want to talk with you about joining Further., * 6+ years of software engineering experience with at least 3 years dedicated to AI/ML application development.

  • Expert proficiency in Python AI application development and modern API architecture (REST, GraphQL, gRPC) using enterprise standards like static type checking and data validation.
  • Deep experience building production applications with LLM frameworks such as LangChain, LangGraph or LlamaIndex.
  • Hands-on expertise with vector databases (Pinecone, Weaviate, PostgreSQL) and search algorithms.
  • Strong understanding of LLMOps principles, including model registry, versioning, and serving infrastructure specifically in Google Cloud.
  • Experience in Typescript development for prototyping and integrations
  • Proficiency with git workflows and understanding of standard application development processes, * Knowledge of advanced prompt engineering and fine-tuning techniques (LoRA, PEFT).
  • Experience optimizing inference costs and latency for large-scale deployments.
  • Previous experience in a client-facing consulting role, managing diverse stakeholders and navigating complex organizational structures.
  • Any Google Cloud Professional Certification

Apply for this position