Associate / Senior Data Engineer

BI GmbH
Biberach an der Riß, Germany
2 days ago

Role details

Contract type
Permanent contract
Employment type
Part-time / full-time
Working hours
Regular working hours
Languages
English, German
Experience level
Senior

Job location

Biberach an der Riß, Germany

Tech stack

Artificial Intelligence
Airflow
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Data analysis
Apache HTTP Server
Cloud Computing
Software Documentation
Computer Programming
Continuous Integration
Data Architecture
Information Engineering
ETL
Data Systems
Data Warehousing
Software Design Patterns
DevOps
Python
Machine Learning
NoSQL
SQL Databases
Systems Integration
Unstructured Data
Parquet
Snowflake
Spark
IT Architecture
Containerization
Data Lake
Kubernetes
Storage Technologies
Information Technology
Data Analytics
Kafka
Data Management
Data Pipelines
GXP
Docker
Databricks

Job description

As an Associate or Senior Data Engineer, you will take a leading role in shaping and steering the data domain for Chemistry, Manufacturing and Controls (CMC) within Research, Development and Medicine. You will be responsible for driving the design, evolution, and long-term direction of data capabilities that support CMC business processes, decision making, and innovation across the global organization.

While this role includes hands-on development and delivery of data solutions, it also contains the strategic ownership of the CMC data domain from the IT perspective. You will act as a key interface between business, product teams, and IT, ensuring that data architectures, platforms, and integrations are aligned with business priorities, regulatory requirements, and Boehringer Ingelheim's overall data and IT strategy.

Embedded in a cross functional product team, you will contribute to building a scalable, high quality, and future proof data ecosystem that enables advanced analytics, AI, and machine learning use cases. Depending on your experience, you will join as an Associate Data Engineer with a strong focus on implementation and domain understanding, or as a Senior Data Engineer with extended responsibility for strategic direction, architectural leadership, and IT decision making within the CMC data landscape., * One of your key tasks will be to design, develop, and operate scalable data pipelines as well as ETL/ELT processes that integrate CMC data from both internal and external sources.

  • In regard to CMC data solutions, you will closely collaborate with stakeholders, analysts, researchers, and data scientists to translate business and regulatory requirements into robust, future-proof data architectures.
  • Moreover, you will implement and continuously evolve cloud-based data solutions across data lakes, data warehouses, and analytics platforms.
  • In addition, you will ensure high data quality, integrity, security, and governance by defining, applying, and maintaining validation, monitoring, and documentation standards.
  • Furthermore, you will leverage modern data platforms and tools such as AWS, Databricks, Snowflake, and dbt to enable reporting, advanced analytics, as well as AI and ML use cases.
  • You will be the main contact person for monitoring, optimizing, and troubleshooting data pipelines, ensuring reliable operations, scalability, and cost efficiency across the entire data landscape., * You will take end-to-end ownership of the IT data architecture and the overall delivery of CMC data initiatives, ensuring long-term consistency, robustness, and scalability across the data landscape.
  • In your role as a senior expert, you will lead system and solution design discussions and act as the primary IT counterpart for all CMC-related data topics, providing architectural guidance and technical direction.
  • Moreover, you will orchestrate the implementation of large-scale and complex data solutions, taking them from initial concept through development and into successful go-live and steady-state operations.
  • Furthermore, you will drive the strategic alignment of the CMC data domain with enterprise-wide data, cloud, and IT architecture standards, ensuring coherence and future readiness.
  • You will also evaluate, select, and recommend modern data engineering technologies, platforms, and design patterns that best support current and emerging CMC use cases.
  • In addition, you will actively contribute to knowledge sharing and mentoring activities, supporting the development of data engineering best practices and fostering technical excellence within the team.

Requirements

Do you have experience in Spark?, Do you have a Master's degree?, * Bachelor's degree in computer science, engineering, mathematics, or a related field with extensive relevant professional experience, or a Master's degree in a related field with relevant professional experience, or equivalent practical experience

  • Proven experience in data engineering, including the development and operation of data pipelines and integrations
  • Hands on experience with cloud based data platforms, preferably AWS (e.g. S3, Glue, Lambda, Kinesis, Step Functions)
  • Practical knowledge of modern data platforms and tools such as Databricks, Snowflake, dbt, and object storage technologies
  • Strong SQL skills and experience working with relational, NoSQL, and unstructured data
  • Programming experience in Python or Scala and familiarity with CI/CD and DevOps practices
  • It would be a plus to have experience with the Apache ecosystem and modern data tooling (e.g., Spark, Kafka, Airflow, Parquet, Iceberg) as well as infrastructure-as-code and CI/CD practices
  • It would be beneficial to have familiarity with containerization and cloud-native environments (e.g., Docker, Kubernetes)
  • Also beneficial would be experience enabling analytics, AI or machine-learning use cases on enterprise data platforms and relevant cloud or data certifications (e.g. AWS Cloud Practitioner, Data Analytics, or Solutions Architect)
  • Language skills: fluent English and preferably good German, * Master's degree in computer science, engineering, mathematics, or a related field, or equivalent practical experience with a strong track record in professional data engineering roles
  • Several years of experience designing, implementing, and evolving enterprise scale data and integration architectures
  • Ability to lead technical workstreams, refine requirements, and coordinate between business stakeholders and IT delivery teams
  • Strong strategic thinking and decision making capabilities, with the ability to balance short term delivery and long term architecture
  • Excellent communication skills and a customer centric mindset in a global environment
  • Experience working in regulated environments (e.g. GxP) and familiarity with pharmaceutical development or CMC processes is an advantage

Apply for this position