GEN AI Architect--Edison, NJ / New York, NY--Full Time
Okaya Inc
New York, United States of America
6 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
EnglishJob location
New York, United States of America
Tech stack
API
Artificial Intelligence
Amazon Web Services (AWS)
Azure
Cloud Engineering
Python
TensorFlow
PyTorch
Flask
Large Language Models
Prompt Engineering
Keras
FastAPI
Kubernetes
HuggingFace
Machine Learning Operations
Virtual Agents
Data Pipelines
Docker
Microservices
Requirements
- Architecture-level expertise in Python for enterprise GenAI solutions
- Design and implementation of large-scale GenAI platforms
- API and microservices architecture using FastAPI / Flask
- Deep experience with LLMs, Prompt Engineering, RAG, and Agentic AI
- Hands-on experience with LangChain, LlamaIndex, Hugging Face
- AI-enabled data pipelines and workflows
- Model lifecycle management, observability, and monitoring
- Responsible AI, data privacy, security, and compliance
Good to Have Skills
- Gemini LLM and multi-LLM orchestration
- ML frameworks: PyTorch, TensorFlow, Keras
- NLP and vector databases
- Cloud-native AI (AWS / Azure / GCP)
- Docker, Kubernetes
- Presales and client-facing solutioning