Cloud Architect with expertise in Generative AI
Gyansys Inc.
Campbell, United States of America
2 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
IntermediateJob location
Campbell, United States of America
Tech stack
.NET
Microsoft Windows
API
Artificial Intelligence
Amazon Web Services (AWS)
Azure
C Sharp (Programming Language)
Cloud Computing
Cloud Computing Security
Cloud Engineering
Information Systems
Human-Computer Interaction
Identity and Access Management
Python
Node.js
Performance Tuning
Role-Based Access Control
Search Technologies
TypeScript
Google Cloud Platform
Enterprise Software Applications
Large Language Models
Multi-Agent Systems
Prompt Engineering
Generative AI
Amazon Web Services (AWS)
FastAPI
Containerization
Kubernetes
Information Technology
Low Latency
Machine Learning Operations
Key Vault
Job description
We are seeking a highly skilled Cloud Architect with expertise in Generative AI, Copilot Studio, and multi?cloud platforms spanning Azure (including Azure AI Foundry), AWS, and Google Cloud., * Architect end?to?end Generative AI solutions, including model serving (vLLM, TGI), API integration, and user interaction layers.
- Design and implement RAG architecture using vector stores, embeddings, hybrid search, and re?ranking to embed enterprise knowledge into LLMs.
- Create agentic systems, enabling multi?agent collaboration for complex, stateful workflows and reasoning?driven automation.
- Develop and govern Copilots in Copilot Studio, including connectors, actions, plugins, DLP rules, environment strategy, and integration with Microsoft 365 and enterprise systems.
- Leverage Azure AI Foundry (prompt flow, evaluators, safety, model orchestration) to operationalize LLM applications at scale.
- Evaluate and optimize AI system performance, balancing quality, latency, throughput, cost efficiency, and safety compliance.
- Implement Responsible AI, security, and HITL (HumanintheLoop) controls, ensuring compliance in regulated environments.?in?the?Loop) controls, ensuring compliance in regulated environments.
- Produce clear, maintainable documentation for architecture, patterns, and operational processes.
Requirements
- 10+ years of experience in cloud architecture or enterprise software engineering.
- 3+ years of hands?on experience designing or delivering Generative AI or LLM applications.
- Proven experience with Azure AI Foundry, Azure OpenAI, and Copilot Studio (actions, connectors, governance, M365 integration).
- Experience deploying AI solutions on AWS (Bedrock, SageMaker) and/or GCP (Vertex AI).
- Hands?on experience with RAG, vector databases (Azure AI Search, Pinecone, OpenSearch, Vertex Matching Engine), embeddings, and hybrid search.
- Deep understanding of cloud security (IAM/RBAC, Key Vault/KMS, VPC/PrivateLink, token safety).
- Experience with Kubernetes (AKS/EKS/GKE), containerization, API frameworks (FastAPI, Node.js, .NET), Python, TypeScript, or C#/.NET.
- Working knowledge of transformer architectures and model adaptation techniques (fine?tuning, LoRA, prompt engineering).
- Familiarity with AI Ops / MLOps tools such as Prompt Flow, MLflow, SageMaker Pipelines, or Vertex Pipelines.
- Bachelor's/ Masters in Computer Science, Engineering, Information Systems, Data Science, or related field (required).
About the company
GyanSys is a leading global system integrator company supporting enterprise customers worldwide. We specialize in solutions implementations, managed services, and data analytics spanning SAP, Salesforce, Microsoft, and other prime enterprise platforms. Using a mature blended delivery model with over 3,000 consultants, we support over 350 enterprise customers across the Americas, Europe, and APAC.