Cloud AI Infrastructure Engineer

Apex Systems LLC
1 month ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Tech stack

Microsoft Active Directory
Agile Methodologies
Artificial Intelligence
Data analysis
User Authentication
Azure
Cloud Computing
Cloud Computing Security
Databases
Continuous Integration
DevOps
DNS
Github
Python
Role-Based Access Control
Reliability Engineering
Cloud Services
Azure
Software Engineering
Data Logging
Scripting (Bash/Python/Go/Ruby)
Google Cloud Platform
Load Balancing
Cloud Platform System
Large Language Models
Firewalls (Computer Science)
Infrastructure as Code (IaC)
Kubernetes
Terraform

Job description

Lead complex technology Cloud initiatives including those that are companywide with broad impact

Act as a key contributor in automating the provisioning of Cloud Infrastructure using Infrastructure as a Code (IaC)

Make decisions in developing standards and best practices for engineering and large-scale technology solutions

Design, optimize, and document the engineering aspects of the Cloud platform

Lead and share understanding of industry best practices and how new technologies influence the Cloud technology team to meet deliverables and drive new initiatives

Review and analyze complex, large-scale technology solutions in Cloud for strategic business objectives and solving technical challenges that require in-depth evaluation of multiple parameters, including intangibles or unprecedented technical factors

Collaborate and consult with key technical experts, senior technology team, and external industry groups to resolve complex technical issues and achieve goals

Build and enable Cloud infrastructure, automate the orchestration for Google Cloud Platforms (Google Cloud Platform) & Microsoft Azure

Working in a globally distributed team to provide innovative and robust Cloud centric solutions

Closely work with Product Team and vendors to develop and deploy Cloud services to meet customer expectations

Gather and analyze data to diagnose the root cause of Cloud issues, recommend, and implement solutions to resolve issues in timely manner

Requirements

In this contingent resource assignment, you may: Consult on complex initiatives with broad impact and large-scale planning for Specialty Software Engineering. Review and analyze complex multi-faceted, larger scale, or longer-term Specialty Software Engineering challenges that require in-depth evaluation of multiple factors, including intangibles or unprecedented factors. Contribute to the resolution of complex and multi-faceted situations requiring solid understanding of the function, policies, procedures, and compliance requirements that meet deliverables. Strategically collaborate and consult with client personnel. Required Qualifications: 5+ years of Specialty Software Engineering experience, or equivalent demonstrated through one or a combination of the following: work or consulting experience, training, military experience, education., 5+ years of Software Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education

3+ years working with Google Cloud Platform (Google Cloud Platform) and/or Microsoft Azure and a proven track record of building complex infrastructure programmatically with Infrastructure as Code (IaC) tools

2+ years of hands-on experience with IaC tools Terraform and GitHub

Knowledge and understanding of Cloud service offerings such as Compute and AI & ML on Google Cloud Platform and/or Azure

Demonstrated experience on at least three of the following key services: Azure AI Foundry, Azure Machine Learning, Document Intelligence, Azure AI Search, Vertex AI, Document AI, LLMs, etc.

Desired Qualifications:

Exposure to Cloud governance and logging/monitoring tools

Cloud certification on Google Cloud Platform and/or Azure

Experience with Agile, CI/CD, DevOps concepts, and Site Reliability Engineer (SRE) principles

Proficient on container-based solution services, have handled 2-3 large scale Kubernetes based infrastructure build out, provisioning of services in Azure or Google Cloud Platform

Experience in scripting (Shell, Python)

Understanding of Azure cloud service offerings such as AI Foundry

Understanding of Google Cloud Platform cloud service offerings such as Agentspace, NotebookLM, Agent Engine

Understanding of Agentic AI frameworks such as ADK, A2A, MCP, LangChain, etc.

Good understanding of networking, firewalls, load balancing concepts (IP, DNS, Guardrails, Vnets) and exposure to database, cloud security, AD, authentication methods, RBAC

Knowledge and understanding of Cloud service offerings on Security, Data Protection and Security policy implementations

Intermediate understanding of Cloud computing concepts like landing zone and Blueprints

Excellent verbal, written, and interpersonal communication skills

Ability to articulate technical solutions to both technical and business audiences

Ability to deliver & engage with partners effectively in a multi-cultural environment by demonstrating co-ownership & accountability in a matrix structure

Delivery focus and willingness to work in a fast-paced, enterprise environment

About the company

Apex Systems is a world-class IT services company that serves thousands of clients across the globe. When you join Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing in Talent Satisfaction in the United States and Great Place to Work in the United Kingdom and Mexico. Apex uses a virtual recruiter as part of the application process. Click for more details.

Apply for this position