AWS Database Engineer / Cloud DBA

OpenKyber LLC
9 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Remote

Tech stack

API
Artificial Intelligence
Amazon Web Services (AWS)
Automated Storage and Retrieval Systems
Azure
C++
Cloud Computing
Computer Programming
Databases
Data Centers
Distributed Systems
Python
Blockchain
Azure
Pega
Google Cloud Platform
Load Balancing
Large Language Models
Caching
Generative AI
AI Platforms
Kubernetes
Deployment Automation
Machine Learning Operations
Multiaccess Edge Computing
Api Gateway
ServiceNow
Go

Job description

Role Overview: We are seeking a Principal GenAI Architect to serve as a hands-on practitioner and core technical visionary. This is a rare, high-impact role requiring deep expertise in Generative AI, distributed systems, and agentic architectures. You will act as the central design authority for our GenAI capabilities within a matrixed organization, bridging internal platform development, third-party vendor reviews, and cutting-edge agentic workflows. Your primary mandate is to "push the thinking"-elevating our AI strategy while remaining deeply hands-on. You will oversee all GenAI use cases, driving architectural excellence across cloud, on-premise, and edge environments, with a specific focus on applications within the regional banking and financial services sector., * GenAI Architecture & Thought Leadership: Serve as the ultimate technical authority for GenAI architecture across the enterprise, reviewing and guiding all AI/ML use cases within a matrixed organization. Push the boundaries of our technical vision, acting as a forward-thinking catalyst for how GenAI is built and deployed.

  • Lead the architectural review process for all third-party AI integrations coming into the bank (e.g., ServiceNow, Five9, Pega), ensuring they meet strict security, performance, and integration standards.
  • Agentic Stack & AI Platform Engineering: Spearhead the growth and development of our agentic stack, designing agentic frameworks that incorporate robust workflow (WF) logic. Architect sophisticated retrieval systems and agent data stacks, utilizing vector databases, hybrid search, BM25, and graph-based reasoning.
  • Implement solutions for externalized long-term memory, contextual data freshness, and Model Context Protocol (MCP) servers. Lead prompt and context engineering strategies to maximize model accuracy and reliability.
  • Infrastructure, Inference & Edge Computing: Design, implement, and scale high-performance distributed systems and AI/ML platforms. Optimize LLM inference, implementing advanced batching, caching strategies, and load balancing techniques.
  • Evaluate and implement dynamic deployment strategies, weighing the trade-offs of deploying small/local LLMs at the edge versus leveraging hyperscaler inferencing via cloud APIs.
  • Architect and test distributed API gateways across hybrid (cloud and on-premise) environments. Oversee on-premise hardware strategy, including rigorous GPU management, utilization, and thermal/compute optimization.

Requirements

Do you have experience in Model deployment?, Required Qualifications Engineering Foundation: 12-15 years experience with strong proficiency in at least one core programming language (e.g., Python, Go, C++) and deep experience building large-scale distributed systems.

GenAI & LLM Expertise: 5-7 years hands-on, practitioner-level experience with LLM inference optimization, fine-tuning, and deployment strategies.

Agentic Architectures: 3-5 years experience with a proven track record of building complex agentic systems, evaluation frameworks, and advanced retrieval pipelines (RAG, Vector DBs, Graph reasoning).

Cloud & Infrastructure: 10-12 years extensive experience with Kubernetes, Cloud Infrastructure (AWS, Google Cloud Platform, or Azure), and managing high-availability platforms.

Hardware / On-Premise Knowledge: 8-10 years experience and understanding of GPU orchestration, resource management, and hardware optimization in on-premise or hybrid data centers.

Strategic Communication: 12-15 years experience and ability to navigate a matrixed organization, translate complex technical trade-offs to leadership, and rigorously evaluate third-party enterprise platforms.

Nice to Have Domain experience in the Banking or Financial Services industry. Interest or hands-on experience in integrating Blockchain technologies and decentralized frameworks.

About the company

About OpenKyber: OpenKyber is a $30 billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long term success. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure and connectivity. We are one of the leading providers of digital and AI infrastructure in the world. OpenKyber is a part of NTT Group, which invests over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future.

Apply for this position