Senior Data Engineer
Role details
Job location
Tech stack
Job description
At CDW, we make it happen, together. Trust, connection, and commitment are at the heart of how we work together to deliver for our customers. It's why we're coworkers, not just employees. Coworkers who genuinely believe in supporting our customers and one another. We collectively forge our path forward with a level of commitment that speaks to who we are and where we're headed. We're proud to share our story and Make Amazing Happen at CDW.
The Senior Data Engineer designs, builds, and maintains scalable data pipelines and Lakehouse solutions on the Databricks platform to support enterprise data and AI initiatives. This role partners closely with architects, analytics, and data science teams to deliver highquality, reliable data products. The engineer operates with significant autonomy and contributes deep technical expertise across the full data lifecycle.
What you'll do:
- Build and maintain scalable data pipelines, ETL and ELT processes, and data models within the Databricks platform.
- Design, develop, and deploy data and AI solutions using Databricks, Spark, Delta Lake, and related technologies.
- Develop batch and streaming pipelines using tools such as Databricks Workflows and Azure Data Factory.
- Design logical data flow diagrams and normalized schemas, implementing Lakehouse patterns such as the Medallion Architecture (Bronze, Silver, Gold layers).
- Ensure data quality, integrity, security, and governance throughout the data lifecycle, including use of Unity Catalog.
- Optimize Spark jobs and data transformations through effective partitioning, caching, and join strategies.
- Monitor pipeline execution, identify failures, and troubleshoot complex data processing issues.
- Collaborate with data architects, analysts, data scientists, and business stakeholders to understand requirements and deliver solutions.
- Support documentation of data processes, standards, and data flows.
Requirements
- 5 Years of experience designing, developing, and deploying data solutions on the Databricks platform.
- Proficiency in Python, including PySpark, and SQL.
- Handson experience with Spark, Delta Lake, and Lakehouse architectures.
- Experience implementing data quality, governance, and security practices across data pipelines.
- Familiarity with machine learning concepts, tools, and libraries such as TensorFlow, PyTorch, Scikitlearn, and MLflow is a plus.
- Experience configuring and integrating external AI models and working with AI governance and monitoring tools is a plus.
- Experience with asynchronous programming patterns in Python for building scalable data or AI workloads is a plus.
- Strong problemsolving, collaboration, and communication skills., We're looking for people who bring curiosity, a learner's mindset, and a willingness to engage with ever-evolving technology and tools. We value adopting AI as a partner, openness to experimentation, and a shared interest in learning together on AI. Our goal is to create a culture where AI enhances-not replaces-human creativity and decision-making. You don't need to be an expert today; what matters is your readiness to explore, adapt, and grow with us as we integrate AI responsibly and effectively into our work.