Mid-Level NLP Data Scientist [$280k/yr+] TS/SCI-FS Poly
SYSTOLIC, INC.
Herndon, United States of America
yesterday
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
Intermediate Compensation
$ 280KJob location
Herndon, United States of America
Tech stack
Artificial Intelligence
Amazon Web Services (AWS)
Cloud Computing
Nvidia CUDA
Data Cleansing
Data Files
ETL
Data Visualization
Data Flow Control
Python
Machine Learning
Language Modeling
Natural Language Processing
NLTK
Oracle
Oracle Applications
Performance Tuning
Raw Data
TensorFlow
SQL Databases
Tableau
Unstructured Data
Web Services
Graphics Processing Unit (GPU)
PyTorch
Flask
Large Language Models
Deep Learning
Topic Modeling
Generative AI
Keras
GIT
HuggingFace
Data Analytics
Gensim
Spacy
Document Classification
Software Version Control
Jenkins
Custom Reports
Job description
- Perform data-driven business analysis, specializing in natural language processing (NLP) and data preparation.
- Transform structured and unstructured data into clear and supported analytic insights.
- Design and implement advanced ETL code and table configurations for complex data sets.
- Develop and organize relevant information with supporting analytics using SQL in Oracle databases.
- Author analytic publications and produce ad-hoc reports, including data visualizations.
- Conduct sophisticated analysis using NLP, Python, deep learning frameworks (PyTorch, Tensorflow, Keras), and machine learning models for text classification and topic modeling.
- Utilize generative language models, HuggingFace Transformers, and Python NLP packages such as Spacy, Gensim, or NLTK.
- Leverage GPUs for accelerated computing.
- Work with Git and Jenkins for version control and automation.
- Conduct advanced statistical analysis and effectively communicate methodological choices and model results.
- Optional skills include cloud computing (AWS), Flask, Tableau for visualizations, and tuning LLMs., * Work closely with data scientists and technical teams to implement requirements.
- Conduct sophisticated analysis using deployed tools and natural language processing.
- Analyze large amounts of raw data, including text data, to provide business insights.
- Preprocess or clean structured and unstructured data, including text data.
- Design and implement advanced ETL code and table configurations for complex data sets.
- Use Structured Query Language (SQL) in Oracle database to develop and organize relevant information with supporting analytics.
- Independently, or with a team, author analytic publications and produce ad-hoc reports to include data visualizations.
- Stay current with enterprise metadata collection tools.
- Implement existing coordination processes.
- Provide technical education to staff on an ad-hoc basis.
- Provide subject matter expertise in NLP to support initiatives.
- Key skills: Fine-tuning, LLM, Model Training, Git, Jenkins/Hudson, Communications/Technical Writing, Data Science, Database Engineering, Requirements Analysis, Data Visualization, Dataflow, Human Language Processing, Machine Learning/Artificial Intelligence, Web Services, CUDA, PyTorch, Tensorflow, Python, SQL, AWS, Oracle.
Requirements
- Degree: Technical bachelor's degree or equivalent experience
- Years of experience: 13+ years
- Total Compensation: $280k+ yearly (tentative)