GenAi/ Agentic Lead/ Architect

Nityo Infotech Corporation
Santa Clara, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Santa Clara, United States of America

Tech stack

.NET
Microsoft Windows
API
Artificial Intelligence
Amazon Web Services (AWS)
Azure
Google BigQuery
C Sharp (Programming Language)
Cloud Computing
Cloud Computing Security
Cloud Engineering
Information Systems
Human-Computer Interaction
Identity and Access Management
Python
Node.js
Role-Based Access Control
Search Technologies
TypeScript
Enterprise Data Management
Google Cloud Platform
Enterprise Software Applications
Large Language Models
Snowflake
Prompt Engineering
Multi-Cloud
Generative AI
Amazon Web Services (AWS)
FastAPI
Containerization
Kubernetes
Information Technology
Low Latency
Machine Learning Operations
Key Vault
Redshift
Databricks

Job description

We are seeking a highly skilled Cloud Architect with expertise in Generative AI, Copilot Studio, and multicloud platforms spanning Azure (including Azure AI Foundry), AWS, and Google Cloud. This role will design scalable, secure, and productionready AI systems, enabling RAG, agentic workflows, and enterprise copilots.

Core Responsibilities:

  1. Architect endtoend Generative AI solutions, including model serving (vLLM, TGI), API integration, and user interaction layers.
  2. Design and implement RAG architecture using vector stores, embeddings, hybrid search, and reranking to embed enterprise knowledge into LLMs.
  3. Create agentic systems, enabling multiagent collaboration for complex, stateful workflows and reasoningdriven automation.
  4. Develop and govern Copilots in Copilot Studio, including connectors, actions, plugins, DLP rules, environment strategy, and integration with Microsoft 365 and enterprise systems.
  5. Leverage Azure AI Foundry (prompt flow, evaluators, safety, model orchestration) to operationalize LLM applications at scale.
  6. Evaluate and optimize AI system performance, balancing quality, latency, throughput, cost efficiency, and safety compliance.
  7. Implement Responsible AI, security, and HITL (HumanintheLoop) controls, ensuring compliance in regulated environments.intheLoop) controls, ensuring compliance in regulated environments.
  8. Produce clear, maintainable documentation for architecture, patterns, and operational processes.

Requirements

  • 8 10 years of experience in cloud architecture or enterprise software engineering.
  • 3+ years of handson experience designing or delivering Generative AI or LLM applications.
  • Proven experience with Azure AI Foundry, Azure OpenAI, and Copilot Studio (actions, connectors, governance, M365 integration).
  • Experience deploying AI solutions on AWS (Bedrock, SageMaker) and/or Google Cloud Platform (Vertex AI).
  • Handson experience with RAG, vector databases (Azure AI Search, Pinecone, OpenSearch, Vertex Matching Engine), embeddings, and hybrid search.
  • Deep understanding of cloud security (IAM/RBAC, Key Vault/KMS, VPC/PrivateLink, token safety).
  • Experience with Kubernetes (AKS/EKS/GKE), containerization, API frameworks (FastAPI, Node.js, .NET), Python, TypeScript, or C#/.NET.
  • Working knowledge of transformer architectures and model adaptation techniques (finetuning, LoRA, prompt engineering).
  • Familiarity with AI Ops / MLOps tools such as Prompt Flow, MLflow, SageMaker Pipelines, or Vertex Pipelines., * Experience implementing agentbased systems using frameworks like LangChain, LlamaIndex, Semantic Kernel, or AutoGen.
  • Background working with enterprise data ecosystems (Databricks, Snowflake, BigQuery, Redshift).
  • Knowledge of Responsible AI frameworks, guardrails, safety filters, PII redaction, and evaluation methodologies.
  • Experience in regulated industries (healthcare, finance, government), with understanding of compliance controls.
  • Experience with observability (OpenTelemetry, PrometheGrafana, App Insights) for AI workloads.

Education:

  • Bachelor s/ Masters in Computer Science, Engineering, Information Systems, Data Science, or related field (required).

Apply for this position