Technical Program Manager IRC294357

GlobalLogic

Reading, United States of America

2 months ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

$ 140K

Job location

Reading, United States of America

Tech stack

Agile Methodologies

Artificial Intelligence

Amazon Web Services (AWS)

Confluence

DevOps

Identity and Access Management

Reliability Engineering

Cloud Services

Data Logging

Large Language Models

Grafana

Machine Learning Operations

Virtual Agents

Cloudwatch

Terraform

Dynatrace

Job description

We are looking for a highly technical Lead Platform Engineer to architect the observability, cost governance, and security framework for our enterprise AI agent ecosystem. You will be responsible for ensuring our agentic workflows-built on AWS Bedrock, AgentCore, and MCP servers-are scalable, observable, and cost-efficient. The ideal candidate bridges the gap between traditional DevOps and the emerging world of LLMOps, with a deep focus on distributed tracing for non-deterministic AI workloads. Salary Verbiage: "GlobalLogic estimates the starting pay range for this role to be performed hybrid in Reading, PA to be $130K to $140K. This reflects base salary only and does not include additional performance-linked variable compensation, benefits etc that may apply to the role. This pay range is provided as a good faith estimate and the amount offered may be higher or lower. GlobalLogic takes many factors into consideration in making an offer, including candidate qualifications, work experience, operational needs, travel and onsite requirements, internal peer equity, prevailing wage, responsibilities, and other market and business considerations., * Assess CloudWatch, X-Ray, Bedrock logging, AgentCore traces vs. agentic workflow requirements; produce gap analysis, Setup observability in Dynatrace

Design post-deployment validation pipeline for agents & MCP servers (deployment health + tool registration checks)
Implement distributed tracing & structured logging: LLM decisions, tool selections, sub-agent calls, MCP interactions
Evaluate LangFuse / LiteLLM proxy vs. AWS-native; deliver target-state observability architecture recommendation

Cost Tracking & TCO

Extend tagging taxonomy to cover agent runtimes, MCP servers, vector DBs, Bedrock token consumption per namespace
Design cost visibility model: aggregate agent, MCP, vector DB, and Bedrock token costs per team/department
Build CloudWatch (or equivalent) dashboards for per-team spend; configure AWS Budgets with alerting thresholds
Automate cost reports delivered via email / Microsoft Teams; implement anomaly detection rules

Monitoring & Alerting

Define P1-P4 alerting rules: deployment failures, runtime errors, tool invocation failures, MCP connectivity issues
Integrate alert notifications to Microsoft Teams channels and email; route by resource ownership tags
Author runbooks linked to every alert; publish in Confluence for developer self-service resolution
Evaluate AWS-native vs. third-party monitoring stack; deliver recommendation aligned to observability architecture

Security & Access Control

Assess current IAM + tagging approach for multi-team isolation; identify scalability gaps and risks
Evaluate Cedar policy engine (AgentCore) for fine-grained tool access control; document enterprise-scale gaps
Design scalable ABAC-based identity model for multi-team isolation without IAM policy sprawl; deliver Terraform modules

Requirements

Experience: 8+ years in Platform Engineering, DevOps, or Site Reliability Engineering (SRE). Cloud Expertise: Deep proficiency in AWS (IAM, CloudWatch, Bedrock, Lambda). Observability Tools: Proven experience with Dynatrace, Jaeger, or Honeycomb, and distributed tracing standards. AI/LLM Interest: Familiarity with the LLM lifecycle, including prompt execution, token usage, and frameworks like LangChain or AgentCore. Automation: Advanced experience with Terraform and CI/CD pipeline design. Collaboration: Experience working in an Agile environment with integrated tools like Microsoft Teams and Confluence.

About the company

Culture of caring. At GlobalLogic, we prioritize a culture of caring. Across every region and department, at every level, we consistently put people first. From day one, you'll experience an inclusive culture of acceptance and belonging, where you'll have the chance to build meaningful connections with collaborative teammates, supportive managers, and compassionate leaders. Learning and development. We are committed to your continuous learning and development. You'll learn and grow daily in an environment with many opportunities to try new things, sharpen your skills, and advance your career at GlobalLogic. With our Career Navigator tool as just one example, GlobalLogic offers a rich array of programs, training curricula, and hands-on opportunities to grow personally and professionally. Interesting & meaningful work. GlobalLogic is known for engineering impact for and with clients around the world. As part of our team, you'll have the chance to work on projects that matter. Each is a unique opportunity to engage your curiosity and creative problem-solving skills as you help clients reimagine what's possible and bring new solutions to market. In the process, you'll have the privilege of working on some of the most cutting-edge and impactful solutions shaping the world today. Balance and flexibility. We believe in the importance of balance and flexibility. With many functional career areas, roles, and work arrangements, you can explore ways of achieving the perfect balance between your work and life. Your life extends beyond the office, and we always do our best to help you integrate and balance the best of work and life, having fun along the way! High-trust organization. We are a high-trust organization where integrity is key. By joining GlobalLogic, you're placing your trust in a safe, reliable, and ethical global company. Integrity and trust are a cornerstone of our value proposition to our employees and clients. You will find truthfulness, candor, and integrity in everything we do. About GlobalLogic GlobalLogic, a Hitachi Group Company, is a trusted digital engineering partner to the world's largest and most forward-thinking companies. Since 2000, we've been at the forefront of the digital revolution - helping create some of the most innovative and widely used digital products and experiences. Today we continue to collaborate with clients in transforming businesses and redefining industries through intelligent products, platforms, and services., © 2026 Careerjet All rights reserved

Role details

Job location

Tech stack

Job description

Requirements

About the company

Apply for this position

Good distractions

Moments

Videos View all