Data Scientist - NLP

ANALYTICA
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Remote

Tech stack

Artificial Intelligence
Amazon Web Services (AWS)
Artificial Neural Networks
Azure
Data Files
Python
Machine Learning
Machine Translation
Natural Language Processing
Named Entity Recognition
NLTK
Open Source Technology
TensorFlow
SAS (Software)
Sentiment Analysis
Stemming
Feature Engineering
PyTorch
Large Language Models
Prompt Engineering
Spark
Deep Learning
Topic Modeling
Keras
Information Technology
Machine Learning Operations
Gensim
Spacy
Software Version Control
Unsupervised Learning
Databricks

Job description

  • Pre-processing - Demonstrate the skills and experience to collect, clean, and prepare data sets for input into a computational model using Python. Strong candidates will explain various methods you have applied using common pre-processing functions such as stop word removal, stemming, lemmatization, and tokenization., To enhance efficiency, fairness, and accuracy, Analytica may use AI-assisted tools to support certain aspects of our hiring process.
  • Application Review: AI tools may help identify skills and experiences relevant to the role.
  • Interview Support: AI-powered notetaking tools may be used during interviews to document discussions and summarize key points.

These tools are used to assist our team. All hiring decisions are made by Analytica recruiters and hiring managers.

Requirements

  • Feature Engineering and Attribute Evaluation - Candidate must demonstrate experience with NLP feature engineering methods such as TF-IDF, word2vec, GloVe, and FastText identifying the key determinants for modeling that exist in the business process and within existing data sets as well as selecting evaluation protocols (model techniques).

  • Modeling - Candidates will have practiced skills and experience selecting classification modeling techniques to fit the business problem. Examples will include techniques such as machine learning (ML) supervised and unsupervised learning, regression, neural networks and deep learning, natural language processing, etc.

  • Validation - Strong candidates will describe their experience with investigating, reporting, and justifying model results.

  • Visualization- Experience in presenting the results of their modeling activities, depicting the insights realized, and explaining the relevance of their results to the organization's business challenges., * Master's degree required, and PhD preferred in Statistics, Mathematics, Computer Science, or similar

  • High degree of experience utilizing SAS, R, or Python to support NLP use cases such as Document Summarization, Named Entity Recognition, Sentiment Analysis, and/or Topic Modeling

  • At least four years of experience developing scalable, production-ready NLP solutions using sci-kit learn, Keras, TensorFlow, PyTorch, Spark NLP.

  • Experience using git/github to version control source code

  • Experience leveraging transformer architecture to develop NLP models

  • Experience with open source NLP packages such as Gensim, SpaCy, or NLTK.

  • Experience with BERT, GPT-J, RoBERTa, T5 or other transformers

  • Experience with GenAI and Prompt Engineering is a plus

  • Experience in Databricks and MLFlow is a plus

  • Experience with machine translation and transcription of foreign language documents using Microsoft Azure translation services is a plus

  • Experience working in an AWS cloud environment and with related AWS services such as Bedrock and Textract

  • Experience coordinating and maintaining user stories

  • Must be a US citizen

  • Must be able to obtain and maintain a Public trust security clearance

Benefits & conditions

Analytica has been recognized by Inc. for 3 consecutive years as one of the 250 fastest growing business. We offer competitive compensation with opportunities for bonuses, employer paid health care, training and development funds, and 401k match., Target Salary* What level of government security clearance do you hold? If applicable, please list any Federal agencies you currently hold a clearance for? LinkedIn Profile* Where did you hear about Analytica?* Do you acknowledge and agree that your application may be reviewed by AI-assisted tools, and that interviews may be recorded or summarized using AI notetakers?* Yes, I acknowledge and agreeNo, I do not agree The following questions are entirely optional. To comply with government Equal Employment Opportunity and/or Affirmative Action reporting regulations, we are requesting (but NOT requiring) that you enter this personal data. This information will not be used in connection with any employment decisions, and will be used solely as permitted by state and federal law. Your voluntary cooperation would be appreciated. Learn more. Gender Race/Ethnicity, Invitation for Job Applicants to Self-Identify as a U.S. Veteran

  • A "disabled veteran" is one of the following:
  • a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or
  • a person who was discharged or released from active duty because of a service-connected disability.
  • A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.
  • An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.
  • An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

About the company

About ANALYTICA: Analytica is a leading consulting and information technology solutions provider to public sector organizations supporting health, civilian, and national security missions. Founded in 2009 and headquartered in Bethesda, MD, the company is an established SBA small business that has been recognized by Inc. Magazine each of the past three years as one of the 250 fastest-growing companies in the U.S. Analytica specializes in providing software and systems engineering, information management, analytics & visualization, agile project management, and management consulting services. The company is appraised by the Software Engineering Institute (SEI) at CMMI® Maturity Level 3 and is an ISO 9001:2008 certified provider., When receiving email communication from Analytica, please ensure that the email domain is analytica.net to verify its authenticity.

Apply for this position