Cloud Architect with expertise in Generative AI

Gyansys Inc.
Sunnyvale, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Sunnyvale, United States of America

Tech stack

.NET
Microsoft Windows
API
Artificial Intelligence
Amazon Web Services (AWS)
Azure
C Sharp (Programming Language)
Cloud Computing
Cloud Computing Security
Cloud Engineering
Information Systems
Human-Computer Interaction
Identity and Access Management
Python
Node.js
Performance Tuning
Role-Based Access Control
Search Technologies
TypeScript
Google Cloud Platform
Enterprise Software Applications
Large Language Models
Multi-Agent Systems
Prompt Engineering
Generative AI
Amazon Web Services (AWS)
FastAPI
Containerization
Kubernetes
Information Technology
Low Latency
Machine Learning Operations
Key Vault

Job description

We are seeking a highly skilled Cloud Architect with expertise in Generative AI, Copilot Studio, and multi?cloud platforms spanning Azure (including Azure AI Foundry), AWS, and Google Cloud., * Architect end?to?end Generative AI solutions, including model serving (vLLM, TGI), API integration, and user interaction layers.

  • Design and implement RAG architecture using vector stores, embeddings, hybrid search, and re?ranking to embed enterprise knowledge into LLMs.
  • Create agentic systems, enabling multi?agent collaboration for complex, stateful workflows and reasoning?driven automation.
  • Develop and govern Copilots in Copilot Studio, including connectors, actions, plugins, DLP rules, environment strategy, and integration with Microsoft 365 and enterprise systems.
  • Leverage Azure AI Foundry (prompt flow, evaluators, safety, model orchestration) to operationalize LLM applications at scale.
  • Evaluate and optimize AI system performance, balancing quality, latency, throughput, cost efficiency, and safety compliance.
  • Implement Responsible AI, security, and HITL (HumanintheLoop) controls, ensuring compliance in regulated environments.?in?the?Loop) controls, ensuring compliance in regulated environments.
  • Produce clear, maintainable documentation for architecture, patterns, and operational processes.

Requirements

  • 10+ years of experience in cloud architecture or enterprise software engineering.
  • 3+ years of hands?on experience designing or delivering Generative AI or LLM applications.
  • Proven experience with Azure AI Foundry, Azure OpenAI, and Copilot Studio (actions, connectors, governance, M365 integration).
  • Experience deploying AI solutions on AWS (Bedrock, SageMaker) and/or GCP (Vertex AI).
  • Hands?on experience with RAG, vector databases (Azure AI Search, Pinecone, OpenSearch, Vertex Matching Engine), embeddings, and hybrid search.
  • Deep understanding of cloud security (IAM/RBAC, Key Vault/KMS, VPC/PrivateLink, token safety).
  • Experience with Kubernetes (AKS/EKS/GKE), containerization, API frameworks (FastAPI, Node.js, .NET), Python, TypeScript, or C#/.NET.
  • Working knowledge of transformer architectures and model adaptation techniques (fine?tuning, LoRA, prompt engineering).
  • Familiarity with AI Ops / MLOps tools such as Prompt Flow, MLflow, SageMaker Pipelines, or Vertex Pipelines.
  • Bachelor's/ Masters in Computer Science, Engineering, Information Systems, Data Science, or related field (required).

About the company

GyanSys is a leading global system integrator company supporting enterprise customers worldwide. We specialize in solutions implementations, managed services, and data analytics spanning SAP, Salesforce, Microsoft, and other prime enterprise platforms. Using a mature blended delivery model with over 3,000 consultants, we support over 350 enterprise customers across the Americas, Europe, and APAC.

Apply for this position