Sr. Data Scientist
Role details
Job location
Tech stack
Job description
-
Design, develop, and deploy Generative AI and LLM-driven solutions for complex data processing and analysis
-
Build and optimize NLP pipelines for extracting and transforming structured and unstructured data
-
Develop and maintain machine learning models, including LLM-based workflows and advanced analytics solutions
-
Work within Databricks to build scalable data and ML pipelines, leveraging modern tooling and infrastructure
-
Implement and enhance Retrieval-Augmented Generation (RAG) systems and explore model fine-tuning approaches
-
Collaborate with engineering, product, and data teams to translate business needs into technical solutions
-
Leverage AI-assisted development tools to accelerate engineering workflows and improve efficiency
-
Continuously evaluate and integrate emerging AI/ML technologies and best practices
-
Contribute to the evolution of a modern AI-first environment, blending data science and software engineering practices
-
Communicate complex technical concepts and results to both technical and non-technical stakeholders
Requirements
-
Strong hands-on experience with Python (required)
-
Experience working with Generative AI and Large Language Models (LLMs)
-
Background in Natural Language Processing (NLP) and machine learning fundamentals
-
Experience with Databricks, particularly for machine learning workflows (strongly preferred)
-
Solid software engineering skills with the ability to build scalable, production-ready solutions
-
Experience with Retrieval-Augmented Generation (RAG) or similar GenAI architectures
-
Exposure to model fine-tuning techniques, particularly with open-source models
-
Ability to work in a fast-paced, evolving environment with a blend of data science and engineering responsibilities
-
Strong communication skills and the ability to collaborate across cross-functional teams
Nice to have:
-
Experience working with clinical or healthcare data
-
Background in information extraction techniques (e.g., named entity recognition)
-
Familiarity with modern AI tooling and development environments
Benefits & conditions
-
100% Remote work environment
-
Healthcare Medical including HSA
-
Dental and Vision Insurance
-
401k