Lead Data Platform Engineer (Palantir Or Databricks)

Luxoft Spain
Municipality of Alicante, Spain
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Municipality of Alicante, Spain

Tech stack

Artificial Intelligence
Amazon Web Services (AWS)
Azure
Big Data
Databases
Data Validation
Information Engineering
Data Infrastructure
ETL
Data Systems
Dimensional Modeling
Distributed Systems
Monitoring of Systems
Python
PostgreSQL
MySQL
Oracle Applications
Prometheus
Search Technologies
Retrieval-Augmented Generation
System Availability
Snowflake
Grafana
GIT
PySpark
Kubernetes
Codebase
Data Management
Terraform
Docker
Databricks

Job description

pbProject description /b /ppbr/ppRole Description - We are seeking an expert with deep proficiency as a Palantir Platform Engineer, possessing experience in data engineering and designing, building, and operationalizing AI-powered workflows, agents, and applications that drive tangible business outcomes.The ideal candidate is a self-starter, able to translate complex business needs into scalable technical solutions, and confident working directly with stakeholders to maximize the value of Foundry and AIP./ppbr/ppbResponsibilities /b /ppManage and optimize Palantir data platform./ppEnsure high availability, security, and performance of data systems./ppProvide valuable insights about data platforms usage./ppOptimize computing and storage for large-scale data processing./ppDesign and maintain system libraries (Python) used in ETL pipelines and platform governance./ppOptimize ETL Processes - Enhance and tune existing ETL processes for better performance, scalability, and reliability./ppAIP AI Enablement: /ppSupport the design and deployment of AIP use cases such as copilots, retrieval workflows, and decision-support agents./ppGround agents and logic flows using RAG (retrieval-augmented generation) by connecting to relevant data sources, embedding/vector search, ontology content./ppUse Ontology-Augmented Generation (OAG) when needed: operational decision-making where logic, data, actions and relationships are embedded in the Ontology./ppCollaborate with senior engineers on agent design, instructions, and evaluation using AIP's native features./ppbr/ppbSkills /b /ppbr/ppMust have /ppMinimum 10 Years of experience in IT/Data./ppMinimum 5 years of experience as a Data Platform Engineer/Data Engineer./ppMinimum 3 years of experience with Palantir Foundry./ppPractical experience using or supporting AIP features such as RAG workflows, copilots, or agent-based applications./ppBachelor's in IT or related field./ppInfrastructure Cloud: Azure, AWS (expertise in storage, networking, compute)./ppProficiency in PySpark for distributed computing /ppProficiency in Python for ETL development./ppSQL: Expertise in writing and optimizing SQL queries, preferably with experience in databases such as PostgreSQL, MySQL, Oracle, or Snowflake./ppETL Tools: Familiarity with ETL tools processes /ppData Modelling: Experience with dimensional modelling, normalization/denormalization, and schema design./ppVersion Control: Proficiency with version control tools like Git to manage codebases and collaborate on development./ppData Pipeline Monitoring: Familiarity with monitoring tools (e.g., Prometheus, Grafana, or custom monitoring scripts) to track pipeline performance./ppData Quality Tools: Experience implementing data validation, cleaning, and quality frameworks, ideally Monte Carlo./ppbr/ppNice to have /ppContainerization Orchestration: Docker, Kubernetes./ppInfrastructure as Code (IaC): Terraform./ppUnderstanding of Investment Data domain (desired)./p

Requirements

ppbr/ppbSkills /b /ppbr/ppMust have /ppMinimum 10 Years of experience in IT/Data. /ppMinimum 5 years of experience as a Data Platform Engineer/Data Engineer. /ppMinimum 3 years of experience with Palantir Foundry. /ppPractical experience using or supporting AIP features such as RAG workflows, copilots, or agent-based applications. /ppBachelor's in IT or related field. /ppInfrastructure Cloud: Azure, AWS (expertise in storage, networking, compute). /ppProficiency in PySpark for distributed computing /ppProficiency in Python for ETL development. /ppSQL: Expertise in writing and optimizing SQL queries, preferably with experience in databases such as PostgreSQL, MySQL, Oracle, or Snowflake. /ppETL Tools: Familiarity with ETL tools processes /ppData Modelling: Experience with dimensional modelling, normalization/denormalization, and schema design. /ppVersion Control: Proficiency with version control tools like Git to manage codebases and collaborate on development. /ppData Pipeline Monitoring: Familiarity with monitoring tools (e.g., Prometheus, Grafana, or custom monitoring scripts) to track pipeline performance. /ppData Quality Tools: Experience implementing data validation, cleaning, and quality frameworks, ideally Monte Carlo. /ppbr/ppNice to have /ppContainerization Orchestration: Docker, Kubernetes. /ppInfrastructure as Code (IaC): Terraform. /ppUnderstanding of Investment Data domain (desired).

Apply for this position