Data AI Engineer with Vector Databases

Lorvenk Technologies LLC
Plano, United States of America
yesterday

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Plano, United States of America

Tech stack

Airflow
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Big Data
Encodings
Data Architecture
ETL
Data Transformation
Data Security
Python
TensorFlow
SQL Databases
Workflow Management Systems
PyTorch
Snowflake
Spark
Build Management
Scikit Learn
Kafka
Data Management
GPT
Data Pipelines
Databricks

Job description

Design and build ETL/ELT pipelines and data processing workflows Develop batch and real-time data pipelines using modern frameworks Work with Python and SQL for data transformation and analytics Implement GenAI data architectures, including RAG pipelines and vector indexing Manage and optimize Vector Databases for embedding storage and similarity search Build secure data solutions on AWS, ensuring data quality and compliance Support analytics, reporting, and data modernization initiatives

Requirements

Strong experience in Python and SQL Hands-on experience with ETL/ELT and data pipelines Mandatory: Experience with Vector Databases Experience with GenAI / LLM frameworks (LangChain or LangGraph) Experience with Big Data frameworks (Apache Spark, Apache Kafka) Workflow orchestration using Apache Airflow Experience with data platforms like Databricks or Snowflake AWS services: S3, Glue, Redshift

Nice to Have: Experience with ML frameworks (Scikit-learn, PyTorch) Knowledge of RAG architectures and embedding pipelines Experience in financial services / fintech environments

Apply for this position