Sr. Software Engineer - Platform Performance & Resilience (AI-Enabled)

Toshiba Global Commerce Solutions
Durham, United States of America
27 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Durham, United States of America

Tech stack

Java
Artificial Intelligence
Cloud Computing
Profiling
Concurrency Controls
Continuous Integration
Software Debugging
Distributed Systems
Middleware
Fault Tolerance
Node.js
Reliability Engineering
Cloud Services
Software Engineering
Workflow Management Systems
Cloud Platform System
Kafka
Build Tools

Job description

Architect Reliability Across Edge-Store-Cloud

  • Design and implement platform mechanisms that ensure transaction integrity and availability across POS terminals, store middleware, and cloud services.
  • Define and validate failuremode strategies for intermittent connectivity, tier isolation, data replay, and synchronization conflicts.
  • Engineer patterns that prevent cascading failures and support graceful degradation under realworld load.

Engineer Performance at Retail Scale

  • Define latency budgets and performance envelopes across all tiers.
  • Build systems that measure and validate throughput, concurrency limits, and resource saturation.
  • Collaborate with development teams to eliminate bottlenecks before production.

Build Automated Resilience Validation

  • Develop AIenabled systems that automatically generate and execute performance and resilience validation scenarios.
  • Integrate nonfunctional quality gates into CI/CD workflows.
  • Continuously evaluate timeout, retry, circuit breaker, and backoff strategies under stress.

Elevate Observability & Signal Quality

  • Architect structured telemetry across edge, store, and cloud tiers.
  • Ensure endtoend transaction traceability.
  • Improve rootcause detection by strengthening monitoring signaltonoise ratio.

Own Engineering Outcomes EndtoEnd

  • Produce technical designs and failuremode analyses.
  • Implement and deploy platform components in Node.js and companion services in Java.
  • Drive productionreadiness improvements based on performance data.

Requirements

  • 4-6+ years of professional software engineering experience.
  • Strong proficiency in Node.js and Java.
  • Proven experience in performance engineering, reliability engineering, or distributed systems architecture.
  • Demonstrated experience designing systems with deterministic timeouts, retry/backoff strategies, circuit breakers, and concurrency controls.
  • Experience modeling multitier systems (edge, middleware, cloud).
  • Solid understanding of SLOs, SLIs, and nonfunctional validation.
  • Experience deploying services in Kubernetesbased cloud environments.
  • Strong debugging and profiling skills for distributed systems., * Experience building automated resilience or faultinjection systems.
  • Familiarity with eventdriven architectures (Kafka, Pub/Sub, MQ).
  • Experience implementing structured observability frameworks.
  • Exposure to AIenabled automation or workflow orchestration.
  • Experience optimizing systems in intermittently connected environments.

Benefits & conditions

Toshiba Global Commerce Solutions, Inc. offers a competitive salary and generous benefits package including the following:

  • Group health coverage (medical, dental, & vision)
  • Employee Assistance Programs
  • Pre-tax spending accounts
  • 401(k) plan (with company match)
  • Company provided life insurance
  • Pet Insurance
  • Employee discounts
  • Generous paid holiday schedule, paid vacation & sick/personal days

About the company

Toshiba Global Commerce Solutions is seeking a Senior Software Engineer - Platform Performance & Resilience that plays a key role in engineering performance, resilience, and observability across a threetier distributed architecture spanning edge devices, instore servers, and cloud services. This role uses AIenabled automation to validate and enforce productiongrade reliability, with the ultimate goal of delivering measurable system stability at retail scale. The position operates at the intersection of distributed systems architecture, performance engineering, reliability validation, and intelligent automation., Toshiba Global Commerce Solutions is a dynamic billion-dollar global company based in Research Triangle Park, NC, providing retail store solutions to your favorite brands. Have you ever been in a hurry and made use of the self-checkout at Lowe's Foods, earned fuel rewards at Kroger, or just paid for purchases at retailers such as Walmart, Michaels, Carrefour, The Gap, Calvin Klein, Boots, Cencosud, BJ's, or Costco? These are just a few examples of our in-store solutions and impressive customer base that made us the world's installed market share leader. The nature of retail is changing quickly, so if you share our 'Together Commerce' vision of a seamless two-way, participatory shopping experience, let's get together to drive the new economy., We at Toshiba Global Commerce Solutions firmly believe that our people are an integral part to the success of our customers. Furthermore, we're committed to Diversity, Equity, and Inclusion for all our people as highlighted by our 5 Core Principles (Create Outreach, Foster Belonging, Unleash Opportunity, Diverse Cultural Engagement and Culture of Transparency). We're passionate about our customers the retail industry and becoming a more responsible company as we help create a brighter future.

Apply for this position