AI Engineer

Diamondpick, Inc.

Germantown, United States of America

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Job location

Germantown, United States of America

Tech stack

API

Artificial Intelligence

Data analysis

Computer Vision

User Authentication

Google BigQuery

Cloud Computing

Cloud Storage

Code Review

Databases

Data Cleansing

Data Retrieval

Linux

Web Development

Python

PostgreSQL

Machine Learning

MongoDB

MySQL

NumPy

TensorFlow

Systems Integration

Unstructured Data

Web Applications

Web Services

Scripting (Bash/Python/Go/Ruby)

Google Cloud Platform

Cloud Platform System

Feature Engineering

PyTorch

Transfer Learning

Flask

Large Language Models

Grafana

Prompt Engineering

Deep Learning

Model Validation

Generative AI

GIT

FastAPI

Pandas

Containerization

Scikit Learn

HuggingFace

XGBoost

Machine Learning Operations

REST

GPT

Data Pipelines

Api Management

Docker

Unsupervised Learning

Web Api

Job description

We're seeking enthusiastic AI Engineers to design, build, and deploy intelligent systems spanning generative AI, traditional machine learning, and deep learning. The role offers hands-on opportunities with large language models, AI agents, classical ML pipelines, deep learning architectures, observability tools, responsible AI practices, and cloud infrastructure on GCP.

Responsibilities include:

Building Python applications powered by LLMs (GPT, Claude, Gemini, LLaMA), utilizing prompt engineering, evaluation, and model customization techniques.
Developing Retrieval-Augmented Generation (RAG) pipelines with vector databases (FAISS, ChromaDB, Pinecone) and frameworks like LangChain or LlamaIndex.
Designing and constructing agent workflows that use planning, tool calling, and memory for reliable multi-step tasks, leveraging frameworks such as LangChain, LlamaIndex, CrewAI, or AutoGen.
Evaluating and improving model outputs using automated metrics and human feedback.

Traditional ML & Deep Learning:

Training and deploying ML models for classification, regression, and clustering using Python libraries like scikit-learn and XGBoost.
Performing feature engineering, data preprocessing, and exploratory data analysis on both structured and unstructured datasets.
Building deep learning models with PyTorch or TensorFlow for NLP and computer vision tasks.
Applying transfer learning and optimizing models for production (quantization, distillation).

Web Applications & APIs:

Building web applications and REST APIs with Flask or FastAPI to serve ML, deep learning, and LLM-powered features to end users.
Designing API endpoints for model inference, data retrieval, and integration with frontend applications.
Integrating AI capabilities into web services with robust error handling, authentication, and scalable architecture.

Cloud & Deployment:

Deploying and managing ML, deep learning, and generative AI workloads on Google Cloud Platform (Vertex AI, Cloud Run, GKE, BigQuery).
Using Vertex AI for training, serving, and orchestrating pipelines across traditional ML models, deep learning models, and LLM-based applications.
Working with GCP storage (Cloud Storage, BigQuery) for data pipelines and feature stores.
Containerizing applications with Docker for deployment on GKE or Cloud Run.
Writing Python scripts for data pipelines, API integrations, and automation tasks on GCP.
Monitoring deployed ML/DL/LLM models and setting up retraining and evaluation workflows using GCP tools.

Requirements

Bachelor's or Master's in CS, AI, Data Science, or related field.

Strong proficiency in Python (NumPy, Pandas, Flask/FastAPI) and Hugging Face ecosystem.
Hands-on experience with LLMs, including prompt engineering, evaluation, or building LLM-powered applications.
Understanding of ML fundamentals: supervised/unsupervised learning, model evaluation, and feature engineering.
Deep learning concepts knowledge (Transformers, CNNs, attention mechanisms), with experience in PyTorch or TensorFlow.
Experience with at least one major cloud platform; GCP strongly preferred.
Familiarity with Linux, Git, Docker, and building REST APIs using Flask or FastAPI.
Understanding of databases and SQL for querying and integrating structured data (PostgreSQL, MySQL, BigQuery, MongoDB, or similar).
Continuous learner with a growth mindset who keeps up with rapidly evolving AI research, tools, and best practices.
Strong communication skills for explaining complex technical concepts to both technical and non-technical stakeholders.
Team-oriented mindset with a collaborative approach to problem-solving, code reviews, and knowledge sharing.
Good documentation habits for writing clear technical docs and maintaining well-commented code.

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all