Principle Software Engineer, AI Observability & Evals Platform

Langchain Inc.
Cambridge, United States of America
11 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 270K

Job location

Cambridge, United States of America

Tech stack

Query Performance
API
Artificial Intelligence
Amazon Web Services (AWS)
Azure
Cloud Computing
Code Review
Computer Programming
Databases
Data Systems
Python
PostgreSQL
Software Architecture
Redis
TypeScript
Datadog
React
Large Language Models
Reliability of Systems
Backend
Front End Software Development
Vertica
Go

Job description

We're looking for a Principal/Lead level Software Engineer to join the LangSmith team and help drive the technical direction of the platform. You'll build across the full stack from backend services and APIs to frontend product surfaces, and you'll play a central role in shaping how we build: setting engineering standards, mentoring engineers across the team, and making architectural decisions that hold up as we scale. If you're energized by both hands-on engineering and the multiplier effect of leveling up those around you, this role is built for that., * Lead architectural decisions across our Go, Python, and TypeScript stack, ensuring systems are performant, maintainable, and built to scale

  • Work across the full stack, owning features end-to-end from backend services and APIs through to frontend product experiences
  • Drive tracing, monitoring, and evaluation workflows at scale, with a focus on reliability and query performance across high-volume data
  • Help shape the product roadmap by partnering closely with product and design - not just executing on it

Raise the Bar for the Team

  • Set engineering standards for the team: define patterns, lead code reviews, and establish the foundations others build on
  • Mentor and grow engineers at all levels through code review, design feedback, pairing, and ongoing technical guidance
  • Drive projects from ambiguity to delivery while maintaining high engineering standards and aggressive timelines, * Troubleshoot and resolve production issues with a root-cause mindset, and implement durable fixes
  • Ensure system reliability through strong testing, monitoring, and alerting practices
  • Create and maintain technical documentation, including system design docs and API references

Requirements

Do you have experience in Team development?, * 10+ years of professional experience in backend or fullstack engineering on highly complex, production systems

  • Strong programming skills across multiple parts of the stack: backend (Python and/or Go) and frontend (TypeScript, React, or similar)

  • Demonstrated experience making and owning architectural decisions, including tradeoffs around data systems, APIs, and service reliability

  • Experience with high-throughput or mission-critical systems, and a proven ability to optimize for performance and reliability

  • Depth in operationalizing technical work - you've taken systems from prototype to production and kept them running well at scale

  • Demonstrated track record of mentoring engineers and raising the technical quality of a team, not just the codebase

  • Strong communication skills and comfort operating cross-functionally with product, design, and engineering leadership

  • Customer centricity and an ownership mentality - you care how the product lands, not just how the code reads

  • You exemplify our operating principles, * Experience with database systems (Postgres, Redis, ClickHouse) and cloud platforms (AWS, GCP, or Azure)

  • Familiarity with observability tooling, evaluation frameworks, or AI/LLM infrastructure

Benefits & conditions

Pulled from the full job description

  • Health insurance
  • Vision insurance
  • Dental insurance, We offer competitive compensation that includes base salary, variable compensation for relevant roles, meaningful equity, benefits, and perks. Actual compensation and offerings will vary based on role, level, and location. Team members in the EU, UK, and APAC receive locally competitive benefits aligned with regional norms and regulations., Benefits include medical, dental, and vision coverage, flexible vacation, a 401(k) plan, meals on in-office days in the US and more.

About the company

At LangChain, our mission is to make intelligent agents ubiquitous. We build the foundation for agent engineering in the real world, helping developers move from prototypes to production-ready AI agents that teams can rely on. We began as widely adopted open-source tools and have grown to also offer a platform for building, evaluating, deploying, and operating agents at scale. With $125M raised at Series B from IVP, Sequoia, Benchmark, CapitalG, and Sapphire Ventures, we're at a stage where we're continuing to develop new products, growth is accelerating, and all team members have meaningful impact on what we build and how we work together. LangChain is a place where your contributions can shape how this technology shows up in the real world. Today, LangChain, LangGraph, LangSmith, and Fleet are used by teams shipping real AI products across startups and large enterprises. Millions of developers trust LangChain to power AI teams at companies like Replit, Clay, Coinbase, Workday, Lyft, Cloudflare, Harvey, Rippling, Vanta, and 35% of the Fortune 500.

Apply for this position