Data Scientist

Talent Portus
Ellicott City, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Ellicott City, United States of America

Tech stack

HTML
Artificial Intelligence
Airflow
Amazon Web Services (AWS)
Big Data
Cloudera Impala
Nvidia CUDA
Databases
Data Systems
IBM DB2
Distributed Systems
Hadoop
Hive
Python
Latex
Machine Learning
Mathematica
Microsoft SQL Server
Language Modeling
Natural Language Processing
NLTK
Oracle Applications
TensorFlow
SQL Databases
Unstructured Data
PyTorch
Large Language Models
Spark
Generative AI
Gpu Programming
GIT
Pandas
Scikit Learn
Information Technology
Machine Learning Operations
Spacy
Data Pipelines

Job description

A Fortune 1000 organization in Baltimore is looking for a senior NLP/AI engineer to lead the development of advanced language models, automate data processes, and support large-scale analytical initiatives. This role is fully on-site and requires strong hands-on technical expertise.

Core Responsibilities

Build and improve NLP and LLM models using Python and modern AI libraries.

Work with large, complex datasets to extract insights and support business needs.

Create scalable data pipelines and ML solutions within AWS environments.

Experiment with new NLP techniques and enhance model performance.

Translate real-world challenges into automated, reliable data solutions.

Partner with analysts and data teams to address data gaps and deliver meaningful insights.

Support the deployment and maintenance of ML/LLM systems using MLOps practices.

Requirements

10+ years in IT with strong experience in Python, NLP tools, SQL, Pandas, NLTK, and spaCy

Hands-on experience with Generative AI and LLMs

Strong understanding of ML frameworks such as TensorFlow, PyTorch, and scikit-learn

Experience working with Git, cloud platforms (AWS), Spark, Airflow, or similar technologies

Ability to handle, clean, and process real-world structured and unstructured data

Background working with various database systems (DB2, Oracle, SQL Server, Hadoop)

Strong communication, analytical thinking, and problem-solving skills

Must obtain and maintain Public Trust clearance

On-site in Woodlawn, MD, five days a week

Preferred Qualifications

Experience supporting government (federal or state) IT projects

Knowledge of distributed systems (Spark, Hive, Impala)

Research or analytical lab experience

Familiarity with GPU programming (CUDA), Mathematica, LaTeX/HTML

Exposure to NLP applications for anomaly detection

more

Requirements added by the job poster

3+ years of work experience with spaCy

Working in an onsite setting

5+ years of work experience with NLTK

5+ years of work experience with Large Language Models (LLM)

5+ years of work experience with PyTorch

8+ years of work experience with Python (Programming Language)

Apply for this position