Staff Software Engineer

LLMS, LLC
Burlingame, United States of America
1 month ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate
Compensation
$ 275K

Job location

Burlingame, United States of America

Tech stack

Java
API
Artificial Intelligence
Amazon Web Services (AWS)
Azure
Software as a Service
Cloud Computing
Databases
Continuous Integration
Data Governance
Distributed Systems
Python
Service-Oriented Architecture
Software Engineering
TypeScript
Google Cloud Platform
Load Balancing
Fast Healthcare Interoperability Resources
Large Language Models
Caching
Backend
Kubernetes
HuggingFace
Health Level Seven International
Integration Frameworks
Machine Learning Operations
Terraform
Docker

Job description

We are seeking a Staff Software Engineer, Applied AI to build and scale the backend systems that power LLM applications in healthcare. This role is ideal for an engineer who thrives at the intersection of backend architecture and applied AI, designing APIs, pipelines, and infrastructure that make LLMs reliable, secure, and cost-efficient in production. If you want to push LLMs beyond demos into mission-critical healthcare workflows, we'd love to hear from you., * Backend for LLMs - Architect and implement scalable, low-latency APIs and services that wrap, orchestrate, and optimize LLMs for healthcare use cases.

  • Data & Retrieval Pipelines - Build ingestion, preprocessing, and retrieval-augmented generation (RAG) pipelines to ground LLMs in clinical and revenue-cycle data.
  • LLMOps & Observability - Design systems for model monitoring, evaluation, cost tracking, and guardrails, ensuring reliability and responsible use.
  • Performance & Optimization - Engineer solutions for caching, batching, load balancing, and scaling LLM workloads across cloud and containerized environments.
  • Security & Compliance - Implement HIPAA-ready infrastructure, data governance, and auditability for LLM-powered applications.
  • Cross-Functional Collaboration - Partner with product, ML engineers, and healthcare experts to translate business workflows into robust backend systems.
  • Technical Leadership - Drive end-to-end delivery of LLM backend projects, establish engineering best practices, and mentor peers in LLM system design.

Requirements

  • 5+ years of backend or full-stack software engineering experience, with 3+ years working on ML/LLM-enabled applications.
  • Strong coding skills in Python (and ideally one statically typed language such as Go, Java, or TypeScript).
  • Experience with LLM integration frameworks (Hugging Face, LangChain, LlamaIndex, OpenAI APIs, Anthropic, etc.).
  • Deep knowledge of distributed systems, service-oriented architecture, and building APIs at scale.
  • Cloud-native expertise: AWS/Google Cloud Platform/Azure, Kubernetes, Docker, Terraform, etc.
  • Familiarity with MLOps/LLMOps practices: CI/CD for models, evaluation harnesses, monitoring, and reproducibility.
  • Excellent system design skills and the ability to align technical architecture with product goals., * Experience applying LLMs in healthcare or other regulated industries (FHIR, HL7, HIPAA).
  • Hands-on experience with RAG pipelines, vector databases, and structured-output orchestration.
  • Background in enterprise SaaS or mission-critical platforms where uptime, latency, and scale matter.
  • Knowledge of responsible AI, safety, and privacy-preserving ML techniques.

Benefits & conditions

USD $200,000.00 - USD $275,000.00 /Yr.", "datePosted": "2026-03-28T00:02:42.000Z", "validThrough": "2026-04-28T00:02:42.000Z", "url": "https://www.dice.com/job-detail/38038c8d-c1c5-4ae8-8b9c-ca5eb0b0cc9a", "identifier": {"@type": "PropertyValue", "name": "Cayuse Holdings, LLC", "value": "38038c8d-c1c5-4ae8-8b9c-ca5eb0b0cc9a"}, "hiringOrganization": {"@type": "Organization", "name": "Cayuse Holdings, LLC", "sameAs": "https://www.dice.com/company-profile/a83e0e43-0406-5a54-bf28-070fb4f65aca", "logo": "https://d3qscgr6xsioh.cloudfront.net/05dwy1egSVWAMyd0Xp8H_d4bb8036b8e2fa6d2c2ac3c31c658dee.png?format=webp"}, "applicantLocationRequirements": {"@type": "Country", "name": "USA"}, "jobLocation": {"@type": "Place", "address": {"@type": "PostalAddress", "addressLocality": "Burlingame", "addressRegion": "CA", "postalCode": "94010", "addressCountry": "US"}}, "employmentType": "FULL_TIME", "baseSalary": {"@type": "MonetaryAmount", "currency": "USD", "value": "Depends on

Apply for this position