Data Scientist Direct Hire-2749

JAB Recruitment
Houston, United Kingdom
14 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Houston, United Kingdom

Tech stack

Artificial Intelligence
C++
Nvidia CUDA
Python
Machine Learning
NumPy
TensorFlow
SciPy
Sensor Fusion
Signal Processing
Feature Engineering
PyTorch
Deep Learning
Keras
Pandas
Recurrent Neural Networks

Job description

  • Process and model univariate and multivariate time series; perform augmentation, segmentation, alignment, and feature engineering as needed
  • Work with diverse sensor modalities (e.g., vibration, temperature, accelerometers, audio, images) including synchronization, sampling-rate challenges, and noise/artifact handling
  • Build robust pipelines for ingesting and cleaning large-scale, time-synchronized multi-sensor datasets

Foundation Models & Deep Learning

  • Develop self-supervised / semi-supervised methods (masked modeling, contrastive learning, temporal predictive coding, multimodal alignment/fusion)
  • Design and evaluate architectures such as Transformers, CNNs (1D/2D/3D), TCNs, RNN/LSTM/GRU, and optional GNNs / diffusion / generative models
  • Fine-tune at scale using domain adaptation, few-shot learning, and adapter/prompt strategies where applicable
  • Define evaluation strategies across ML metrics (MSE, F1, AUC) and time-series/event metrics (DTW, correlation, IoU) tied to end-user outcomes

Software & Infrastructure

  • Build training and inference workflows for multi-GPU / multi-node environments (mixed precision, scalable data loaders, distributed training/optimization)
  • Contribute to high-performance data preprocessing (Python; C++/CUDA a strong plus)
  • Implement production-ready data/model packaging for downstream teams and deployment

Collaboration & Communication

  • Work cross-functionally with domain experts, product stakeholders, and engineering teams
  • Present model behavior clearly (interpretability/attention analysis, uncertainty quantification) and communicate value/impact

Requirements

Work Authorization: Must be authorized to work in the U.S. (no sponsorship) Experience Level: 3+ years (MS/PhD preferred), * 3+ years of relevant experience in Data Science / Machine Learning / AI

  • Strong experience with time series / sequential data and real-world sensor challenges
  • Expertise in Python (NumPy, SciPy, Pandas)
  • Deep learning experience with PyTorch (preferred) and/or TensorFlow/Keras, JAX/Flax
  • Understanding of core math foundations (linear algebra, probability, optimization) and signal processing basics
  • Ability to communicate effectively with technical and non-technical stakeholders
  • Authorized to work in the U.S. and willing to relocate

Preferred / Strongly Valued

  • MS or PhD in CS, Data Science, AI, or related field
  • Experience building self-supervised foundation models for time series
  • Multi-modal learning (time series + vision/text/audio/structured) and sensor fusion experience
  • Distributed training at scale (ZeRO, mixed precision, multi-node)
  • C++/CUDA and high-performance preprocessing
  • Signal processing methods (Fourier/wavelets, Kalman/Savitzky-Golay, resampling, noise modeling) #LI-DNI

Apply for this position