Data Scientist

Spectraforce
Newark, United States of America
9 days ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Remote
Newark, United States of America

Tech stack

API
Artificial Intelligence
Amazon Web Services (AWS)
Data analysis
Big Data
Cloud Computing
Databases
Data Visualization
Python
Machine Learning
Raw Data
Software Deployment
Software Engineering
SQL Databases
Data Processing
Large Language Models
Generative AI
Information Technology
Text Analysis
Data Pipelines

Job description

  • Responsible for the hands-on development of advanced data science solutions comprising the portfolio developed by the Lead Data Scientist and the technical requirements specified by the Lead Data Scientist. Perform hands-on data analysis, model development, model training, model testing.
  • Write production-level code and partner with machine learning engineers to push development code into production.
  • Continuously research new methods for problem solution, including new algorithms, modeling techniques, and data analytics techniques.
  • Partner with machine learning engineers to productionized machine learning models. Partner with data engineers to build data pipelines. Partner with software engineers to integrate solutions with business platforms. The Skills and expertise you bring

Requirements

  • Advanced degree (Masters, Ph.D.) in Mathematics, Statistics, Engineering, Econometrics, Physics, Computer Science, Actuarial, Data Science, or comparable quantitative disciplines.

  • Working on complex problems in which analysis of situations or data requires an in-depth evaluation of various factors. Exercises judgment within broadly defined practices and policies in selecting methods, techniques and evaluation criteria for obtaining results.

  • Ability to learn new skills and knowledge on an ongoing basis through self-initiative and seeking challenges.

  • Excellent problem solving, communication and collaboration skills. Applied experience with several of the following:

  • Machine Learning: Understanding of machine learning theory, including the mathematics underlying machine learning algorithms. Expertise in the application of machine learning theory to building, training, testing, interpreting and monitoring machine learning models.

  • Generative AI & Natural Language Processing: Experience with modeling and interpreting text analysis including NLP, LLMs (BERT, etc), and Generative AI. Experience in modern Gen AI technologies including RAG, LangChain, LangGraph, vector DB and their application in Individual Retirement Strategies area.

  • Statistics and Computing: Exceptional understanding of: Multivariable Calculus, Linear Algebra, Differential Equations, Applied Probability, Applied Statistics, Computer Science (Programming Methodologies), and Cloud.

  • Knowledge of statistical techniques such as the use of descriptive, inferential, Bayesian statistics, time series analysis etc. to extract business insights and experimentation to solve business problems.

  • Data Acquisition and Transformation: Acquiring data from disparate data sources using API's and SQL. Transform data using SQL and Python. Visualizing data using a diverse tool set including but not limited to Python.

  • Database Management System: Knowledge of how databases are structured and function in order to use them efficiently. May include multiple data environments, cloud/AWS, primary and foreign key relationships, table design, database schemas, etc.

  • Data Wrangling: Preparing data for further analysis; Redefining and mapping raw data to generate insights; Processing of large datasets (structured, unstructured).

  • AWS DevOps: Experience in the project development life cycle in an AWS environment. Familiar with development, QA, staging and production deployment stages.

  • Programming Languages: Python, SQL.

Apply for this position