Data AI Engineer with Vector Databases

Lorvenk Technologies LLC

Plano, United States of America

yesterday

Role details

Contract type

Temporary contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Plano, United States of America

Tech stack

Airflow

Amazon Web Services (AWS)

Big Data

Encodings

Data Architecture

ETL

Data Transformation

Data Security

Python

TensorFlow

SQL Databases

Workflow Management Systems

PyTorch

Snowflake

Spark

Build Management

Scikit Learn

Kafka

Data Management

GPT

Data Pipelines

Databricks

Job description

Design and build ETL/ELT pipelines and data processing workflows Develop batch and real-time data pipelines using modern frameworks Work with Python and SQL for data transformation and analytics Implement GenAI data architectures, including RAG pipelines and vector indexing Manage and optimize Vector Databases for embedding storage and similarity search Build secure data solutions on AWS, ensuring data quality and compliance Support analytics, reporting, and data modernization initiatives

Requirements

Strong experience in Python and SQL Hands-on experience with ETL/ELT and data pipelines Mandatory: Experience with Vector Databases Experience with GenAI / LLM frameworks (LangChain or LangGraph) Experience with Big Data frameworks (Apache Spark, Apache Kafka) Workflow orchestration using Apache Airflow Experience with data platforms like Databricks or Snowflake AWS services: S3, Glue, Redshift

Nice to Have: Experience with ML frameworks (Scikit-learn, PyTorch) Knowledge of RAG architectures and embedding pipelines Experience in financial services / fintech environments