Data Scientist III

RELX Group plc
Philadelphia, United States of America
1 month ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate
Compensation
$ 116K

Job location

Remote
Philadelphia, United States of America

Tech stack

Java
Microsoft Excel
Artificial Intelligence
Artificial Neural Networks
Clinical Data Repository
Cloud Computing
Cluster Analysis
Databases
Data Cleansing
Data Mining
Data Visualization
Information Retrieval
Python
Matlab
Machine Learning
MySQL
Natural Language Processing
Named Entity Recognition
Tableau
Unstructured Data
Jupyter Notebook
Data Processing
Deep Learning
Data Analytics
Free and Open-Source Software
Unsupervised Learning

Job description

  • Support our Data Sciences team within Health Content Operations and work with the Clinical Solutions and Education business units to provide research around data science and analytics that drives growth, revenue generation, outreach, and innovation. Focus on solving data science problems such as entity extraction, named-entity recognition, word-sense disambiguation, information retrieval, clustering, supervised and unsupervised learning using NLP, machine learning, deep learning, and statistical methods. Compare and recommend the use of latest GenAI technologies to solve these problems when the traditional approaches cannot solve them. Drive innovation by leveraging the latest research, literature, latest developments in GenAI, Responsible AI, GenAI Evaluation, RAG to Build POC's and solve complex problems. Collaborate with inter disciplinary teams across organization to build solutions. Conduct research on concept indexing, relationship extraction, and data extraction from clinical data and scientific literature. Analyze vast amounts of unstructured data and design, prototype, and operationalize machine learning and automation solutions for our health business. Provide data analytics support including designing automated approaches for ontology and graph development, ontology validation and terminology mappings. Analyze extracted information to drive such processes as automated and manual data cleansing, linking, and populating knowledge graphs. Coordinate with stakeholders as needed. Manage contractors as needed to get new entities reviewed for ingestion into Emmet. Ensure compliance with DPR documentation. Lead the Rising Tide program and Mentor interns. Coordinate with IT developers and (content) subject matters experts to translate information needs into data science solutions. Drive new developments and implement process changes and disruptive technologies in the organization. Perform other duties as needed.

Requirements

  • Master's degree (or foreign equivalent) in Data Science, Data Analytics, Enterprise Intelligence, or a related field required.

  • 2 years of experience in job offered or related occupations required.

  • Also required is: 2 years of experience: with doctor, nurse, and patient information needs to design Data Science, Machine Learning (ML) and Natural Language Processing (NLP) solutions to improve patient outcomes; working with deep learning models, neural networks, and state-of-the-art transformer language models, putting data science into production; utilizing nix systems, open-source software, Jupyter notebook hubs, cloud computing, MATLAB for data modeling, machine learning and visualization purposes, and Java, Python or R; and with utilization of database, data manipulation, and visualization tools, such as MySQL, Excel, and Tableau.

  • Employee reports to Elsevier, Inc. office in Philadelphia, PA but may telecommute from any location within the U.S.

  • Experience can be concurrent.

SALARY RANGE FOR REQ# R111460

Apply for this position