GEN AI Architect--Edison, NJ / New York, NY--Full Time

Okaya Inc
New York, United States of America
6 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

New York, United States of America

Tech stack

API
Artificial Intelligence
Amazon Web Services (AWS)
Azure
Cloud Engineering
Python
TensorFlow
PyTorch
Flask
Large Language Models
Prompt Engineering
Keras
FastAPI
Kubernetes
HuggingFace
Machine Learning Operations
Virtual Agents
Data Pipelines
Docker
Microservices

Requirements

  • Architecture-level expertise in Python for enterprise GenAI solutions
  • Design and implementation of large-scale GenAI platforms
  • API and microservices architecture using FastAPI / Flask
  • Deep experience with LLMs, Prompt Engineering, RAG, and Agentic AI
  • Hands-on experience with LangChain, LlamaIndex, Hugging Face
  • AI-enabled data pipelines and workflows
  • Model lifecycle management, observability, and monitoring
  • Responsible AI, data privacy, security, and compliance

Good to Have Skills

  • Gemini LLM and multi-LLM orchestration
  • ML frameworks: PyTorch, TensorFlow, Keras
  • NLP and vector databases
  • Cloud-native AI (AWS / Azure / GCP)
  • Docker, Kubernetes
  • Presales and client-facing solutioning

Apply for this position