Senior Platform Engineer, Ingestion
Role details
Job location
Tech stack
Job description
This role sits at the core of LangSmith: you'll own the ingestion systems, query systems, and the API, SDK, and CLI surfaces that thousands of development teams use every day. You'll work at the intersection of distributed systems and developer experience, on infrastructure that teams across the industry depend on.
What you'll do:
- Build and scale critical systems: design and operate high-throughput, data-intensive ingestion and trace-query systems supporting LangSmith, built on SmithDB, our purpose-built database for agent observability. Build monitoring, alerting, and automated recovery so the pipeline stays resilient.
- Set API, SDK, and CLI standards: define and enforce the standards, tooling, and CI that power SDK generation across Python, TypeScript, Go, and Java; keep our developer surfaces consistent, high-quality, and self-served across feature teams.
- Own integrations: build new integrations and maintain existing ones so it's easy to use LangSmith with any AI framework, agent, or tool - keeping us framework-agnostic
- Solve complex problems: debug performance bottlenecks, optimize database queries, and architect solutions for distributed-system challenges
- Respond to incidents: participate in an on-call rotation focused on post-incident learning, automation, and prevention
Requirements
Do you have experience in Workday?, * Platform engineering: hands-on experience designing and running data-intensive systems at scale
- Developer experience: a track record of building high-quality, widely-adopted CLIs, SDKs, or API standards that developers actually enjoy using
- Database expertise: production experience with OSS datastores (PostgreSQL, Redis)
- Backend languages: Strong backend software engineering skills with production-level experience in Go, Python, or TypeScript.
- Infrastructure expertise: solid knowledge of cloud object storage, Kubernetes, containerized infrastructure, and cloud platforms (GCP, AWS)
- Observability mastery: hands-on experience with observability stacks (Datadog, Prometheus/Grafana, OpenTelemetry, or similar)
- Operational mindset and high agency: "you build it, you run it, you own it," with a focus on sustainable practices
Nice to Have:
- Experience: 5+ years building and operating production systems, developer-facing APIs, or both
- Strong experience with Java
- Knowledge of columnar file, memory formats and OLAP databases
- Background in high-growth startups
Benefits & conditions
We offer competitive compensation that includes base salary, meaningful equity, and benefits such as health and dental coverage, flexible vacation, a 401(k) plan, and life insurance. Actual compensation will vary based on role, level, and location. For team members in the EU and UK, we provide locally competitive benefits aligned with regional
Compensation Philosophy:
We offer competitive compensation that includes base salary, variable compensation for relevant roles, meaningful equity, benefits, and perks. Actual compensation and offerings will vary based on role, level, and location. Team members in the EU, UK, and APAC receive locally competitive benefits aligned with regional norms and regulations., Benefits include medical, dental, and vision coverage, flexible vacation, a 401(k) plan, meals on in-office days in the US and more.