Data Scientist

Cadent, LLC
Philadelphia, United States of America
6 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Philadelphia, United States of America

Tech stack

Artificial Intelligence
Amazon Web Services (AWS)
Data analysis
Application Frameworks
Cloud Computing
Encodings
Data as a Services
Information Engineering
Relational Databases
DevOps
High-Level Architecture
Python
Machine Learning
Scrum
TensorFlow
Software Engineering
SQL Databases
Statistics
Feature Engineering
PyTorch
Large Language Models
Deep Learning
Model Validation
Generative AI
PySpark
Scikit Learn
Information Technology
Machine Learning Operations

Job description

Right now we are looking for a highly motivated and experienced Lead Data Scientist within the Data Services Organization who will be responsible for shaping our data science strategy, managing a team of data scientists, delivering ML products and collaborating with cross-functional teams to translate data into actionable business strategies. The Sr. Director will apply scientific methods to identify business optimization strategies and develop, evaluate, and demonstrate prototypes and production grade builds. Data Scientists collaborate directly with the Business, Product, Data Engineering, DevOps and QA team members to productize AI/ML research to drive business growth. This is a critical role that needs knowledge of mathematics and engineering, output from this role will be leveraged by business, engineers and by senior executives to define the future of Cadent. Responsibilities:

  • Hands-on design, train and apply statistics, mathematical models, & machine learning techniques to create scalable ML solutions such as identity to solve business problems and build ML data products to enable speed to market and rapid experimentation
  • Contribute iterative improvements to predictive models using latest ML techniques, algos and tech stack
  • Leverage model governance techniques and frameworks to ensure performance and stability of data science products
  • Define project scope, objectives, and success metrics, ensuring projects are delivered on time and within budget
  • Oversee the end-to-end lifecycle of data science projects, from problem formulation to model deployment and monitoring
  • Present results and findings to technical audience, product and business stakeholders
  • Strive to innovate leveraging latest algos, tools, data and systems while staying abreast of industry trends and best practices in data science
  • Participate in researching new data, tools, algorithms and tech stack to align with evolving AI & ML industry
  • Work with machine learning engineers, data engineers, DevOps and software developers to deploy models and modeling pipelines to be leveraged by business
  • Align with Product and business on project deliverables, timelines, provide updates on progress
  • Foster a collaborative and innovative team culture that encourages knowledge sharing and continuous learning
  • Participate in the Agile / scrum process
  • Follow the CRISP-DM process to generate robust documentation associated with iterative work
  • Collaborate effectively with broader data services group including but not limited to Machine Learning Engineers, Data Engineers, Analytics Engineers, Software Engineers, Quality Assurance Engineers, and Business Intelligence analysts

Requirements

  • M.S. or higher in computer science, mathematics, operations research, statistics or related discipline with a focus on machine learning; or the equivalent of 6-7 years' experience in a Data Science and Machine learning role
  • 6+ years of data science and machine learning developer hands on keyboard experience
  • Experience working with LLM technologies, including developing generative and embedding techniques, modern model architectures, retrieval-augmented generation (RAG), fine tuning / pre-training LLM (including parameter efficient fine-tuning), and evaluation benchmarks
  • Proven background answering open ended research questions using data, tools and technology
  • Ability to write clean, expressive code in Python, use of open source frameworks such as TensorFlow, PyTorch, scikit learn) and or other tools including PySpark, Scala etc.
  • Experience with SQL and reading from relational databases, experienced using cloud computing ecosystems (e.g., AWS, GCP)
  • Experience with the practical application of computational statistics and complex ML algos including deep learning, GraphDB SVMs, time series forecasting etc. to build and evaluate models
  • Experienced in deploying models in production and enable model governance
  • Strong analytical and quantitative problem-solving ability
  • Experienced effectively communicating technical concepts and insights to non-technical stakeholders, influencing decision-making across the organization.
  • Fundamental understanding of the mathematical workings of standard feature engineering, dimension reduction, machine learning algorithms and model validation & measurement
  • Familiarity with best practices for software engineering and the use of the scientific Python ecosystem
  • Media or ad-tech experience a plus

Apply for this position