Member of Technical Staff, Data Research Engineer - MAI Superintelligence Team

Microsoft
Redmond, United States of America
5 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Charing Cross, United Kingdom

Tech stack

Artificial Intelligence
Data analysis
Information Engineering
Python
NumPy
Management of Software Versions
Data Processing
Spark
Pandas
Information Technology
Data Pipelines
Apache Beam

Job description

We are seeking Data Research Engineers to join our Multimodal team, where we are building the next generation of foundation models across vision, language, audio, and beyond. If you are passionate about exploring, designing, and building high-quality datasets to drive frontier AI models, this role is for you. At Microsoft AI, data is at the heart of innovation-and in this role, you will collaborate closely with scientists, engineers, and annotators to curate, analyse, and evaluate diverse multimodal data sources critical to model development. You'll lead efforts in developing novel data collection strategies, improving dataset quality, understanding data-driven model behaviours, and aligning datasets with ethical and societal values. This is a cross-disciplinary, high-impact role ideal for engineers who want to push the boundaries of what AI can learn from data, especially in multimodal contexts. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. Responsibilities

  • Create high-quality datasets for training and evaluation; run experiments on new datasets (data ablations) to assess their impact and determine the most effective data
  • Develop and maintain scalable data pipelines for multimodal ingestion, pre-processing, filtering, and annotation
  • Analyse real-world multimodal datasets to assess quality, diversity, relevance, and identify areas for improvement
  • Build lightweight tools and workflows for dataset auditing, visualization, and versioning
  • Collaborate with Safety, Ethics, and Governance teams to ensure datasets meet standards for quality, privacy, and responsible AI practices

Requirements

Do you have experience in Spark?, Do you have a Master's degree?, * Bachelor's Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or a related technical field AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)

  • OR equivalent experience
  • Experience in data analysis or data engineering
  • Proficiency in statistics and exploratory data analysis methods
  • Ability to communicate technical findings effectively to research and product teams, * Master's Degree in Computer Science or related technical field AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
  • Familiarity with data processing frameworks such as Spark, Ray, Apache Beam
  • Experience working with large-scale, real-world datasets that are unstructured or semi-structured

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

About the company

Microsoft is a global technology company headquartered in Redmond, Washington. Our mission is to empower every person and every organization on the planet to achieve more. We develop, license, and support a wide range of software products, services, and devices that help individuals and businesses realize their full potential.

Our flagship products include the Microsoft 365 productivity cloud, Windows operating system, Azure cloud platform, and Dynamics 365 business applications. We are also a leader in areas such as artificial intelligence, cybersecurity, developer tools, and gaming through Xbox and Game Pass.

With operations in more than 190 countries and over 220,000 employees worldwide, Microsoft is committed to responsible innovation, inclusive economic growth, and sustainability. We work closely with governments, industries, and communities to ensure that technology serves the public good and helps address some of the world’s most pressing challenges.

As we celebrate our 50th anniversary in 2025, we continue to look forward—investing in AI, cloud, and quantum computing to shape the future of work, education, and society at large scale.

Apply for this position