Data Engineer
Teksystems
Basel, Switzerland
7 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English, GermanJob location
Basel, Switzerland
Tech stack
API
Airflow
Databases
Data Deduplication
Information Engineering
ETL
Data Mining
Data Warehousing
Python
MongoDB
NoSQL
Parsing
Text Mining
Scripting (Bash/Python/Go/Ruby)
Large Language Models
GraphQL
Api Design
Text Analysis
Job description
- Design and implement ETL pipelines and MongoDB schemas
- Manage document databases, including curation and FAIRification (cleaning, parsing, disambiguation, deduplication, harmonization)
- Integrate new data sources and develop robust parsers and text-cleaning workflows
- Improve user experience by building data warehouses and APIs
- Provide user support for dataset usage (documentation, training, use case guidance)
Requirements
We are seeking a technically strong data professional with robust experience in:
- Data engineering, data warehousing, and database management
- NoSQL databases (especially MongoDB)
- Python Scripting
- ETL orchestration tools (eg, Airflow)
A solid understanding of Text Analytics, Text and Data Mining (TDM), and Large Language Models (LLMs) is highly desirable. experience with scientific or other text-based datasets, high-performance computing environments, and API development (eg, GraphQL) is a plus. A biomedical Background is beneficial but not required., * Strong experience in data engineering, data warehousing, and database management.*
- Hands-on experience with NoSQL/MongoDB.*
- Strong proficiency in Python Scripting.*
- experience with ETL orchestration tools (eg, Airflow).*
- Understanding of Text Analytics, Text Mining, TDM, and LLMs
- Knowledge of FAIR data principles
- Familiarity with scientific literature or other text-based datasets
- Excellent communication skills in English (fluent); German is a plus
- Ability to explain technical topics to non-technical stakeholders
- Exposure to high-performance computing environments
- experience with API development (eg, GraphQL) is advantageous
- Biomedical Background/education is a plus