Senior Data Scientist II

RELX Group plc

Norwalk, United States of America

3 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

$ 184K

Job location

Remote

Norwalk, United States of America

Tech stack

Artificial Intelligence

Amazon Web Services (AWS)

Data analysis

Big Data

Code Review

Computer Programming

Hadoop

Information Extraction

Python

Machine Learning

Language Modeling

Product Management

Large Language Models

Spark

Deep Learning

Information Technology

Data Analytics

XGBoost

Feature Extraction

GPT

Job description

Work on new product development. Propose and build data-driven solutions for high-value customer problems by discovering, extracting, and modeling knowledge from large-scale natural language datasets. Prototype new ideas, collaborate with other data scientists as well as product designers, data engineers, front-end developers, and a team of expert legal data annotators. Evaluate and help maintain our data assets and training/evaluation data sets. Develop and implement NLP-based information extraction solutions. Propose and identify trade-offs of various algorithmic solutions. Interface with other technical personnel or team members to finalize requirements. Work closely with other development team members to understand moderately complex product requirements and translate them into software designs. Successfully implement development processes, coding best practices, and code reviews for production environments. Perform other duties as needed.

Requirements

Master's degree (or foreign equivalent) in Data Science, Data Analytics, Statistics, Computer Science, or a related field required.
4 years of experience in job offered or related occupations required.
Also required is: 2 years of experience: working directly with large language models and transformer-based architectures including BERT, RoBERTa, and T5 to develop in depth understanding of the existing AI project architecture and to enhance product features by tuning the language models per need; applying LLMs including ChatGPT, GPT 3.5, Claude, and Mistral to understand the nuances of the existing large language models in the market and be able to build/enhance AI product features by utilizing the LLMs on a day-to-day basis, as these LLMs are widely used in developing our AI product; working with big data technologies and tools including Hadoop, Spark, or AWS to work with large volumes of data and perform Data Science related activities on the data viz., data analysis, feature extraction, model development, reporting etc.; working with machine learning algorithms, including deep learning, gradient boosting, and random forests to develop, fine-tune and deploy regression or classification Machine Learning models using these traditional ML algorithms; and in advanced programming skills in Python, R, or other relevant languages for data analysis to perform day-to-day activities like exploratory data analysis, model development, and testing & deployment.
Employee reports to LexisNexis USA office in Norwalk, CT, but may telecommute from any location within the U.S.
Experience can be concurrent.

SALARY RANGE FOR REQ# R114245

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all