Data Engineer
Role details
Job location
Tech stack
Job description
- Set up data warehouse and manage data pipelines.
- Implement MLops in NLP and Computer Vision projects.
- Collaborate with teams to optimize data processes.
Conocimientos
Data Ops SQL NoSQL Machine Learning Data Processing Python NLP Computer Vision, Azure Data Services Apache Airflow Snowflake Apache Flink Descripción del empleo
Are you obsessed with the possibilities of the emerging fields of artificial intelligence? Do you believe we can design & develop our own futures? Are you proactive and passionate?
Requirements
A leading company focused on AI for humanity is seeking a Data Engineer (medior/senior) to join their fast-growing team in A Coruña, Spain. The role involves managing data architecture, implementing MLops, and collaborating on various AI projects. Ideal candidates will have a strong background in data engineering, SQL proficiency, and a passion for AI applications., * Experience in Data Ops with knowledge of SQL/NoSQL databases.
- Proficiency in SQL for data manipulation and reporting.
- Experience with machine learning frameworks like TensorFlow or PyTorch., Degree in Computer Science or Artificial Intelligence, We're looking for an experienced Data Engineer (medior/senior) who has a talent for problem solving, a keen interest in technology, and wants to help deliver real world solutions., * You have a degree in Computer Science, Artificial Intelligence, with experience in Data Ops combined with in-depth knowledge of SQL/NoSQL databases.
- You like working with large amounts of data. We are in charge of the data warehouse and pipeline, designing the architecture and centralising data, ensuring integrity across different sources and system performance. Together with the data science team we look to optimise existing processes and setup new ones.
- You'll join a small but fast-growing data team (the D in DEUS), responsible for the technical aspects of various AI projects. Some examples of tasks would be setting up a data warehouse, extracting, transforming and loading data, and above all task automation. You will also collaborate closely with the other teams to conceptualise, and most of all to make things happen.
- You're interested in helping us implement MLops in the NLP and Computer Visions projects we are currently working on.
- You have experience bringing a Proof of Concept ML model into production.
You have experience with:
- Proficiency in SQL is essential for effective data manipulation, querying, and reporting within our data systems.
- Databricks as a platform,includingUnity Catalog and Delta Protocol
- PySpark/Spark for distributed data processing.
- Distributed data processingtechniques
- Azure data services like ADF, Synapse, and EventHubs.Familiaritywith Microsoft Fabric is also valuable.
You know or have an understanding of:
- Apache Airflow
- Snowflake
- Apache Flink
...and some more points:
- Excellent written and verbal English is the only language requirement.
- Working experience with at least one machine learning framework (e.g., TensorFlow, PyTorch, scikit-learn)
- Programming experiencewithPython
- Experience with databases, cloud computing, version control, best practices in data engineering and software development
- Knowledge of ML fundamentals, an interest in NLP and computer vision applications
- We will provide top hardware [macbook pro etc] and the latest software - we use the right tools for the job.