Data Scientist Engineer
International Technologies & Systems Corporation
Baltimore, United States of America
2 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
Senior Compensation
$ 120KJob location
Baltimore, United States of America
Tech stack
Artificial Intelligence
Amazon Web Services (AWS)
Data analysis
Computer Vision
Azure
Big Data
Cloud Computing
Data as a Services
Python
Machine Learning
Natural Language Processing
NumPy
TensorFlow
SQL Databases
Data Processing
PyTorch
Large Language Models
Spark
Deep Learning
Generative AI
Pandas
PySpark
Scikit Learn
Information Technology
Machine Learning Operations
Databricks
Job description
This role involves leveraging advanced machine learning models and AI-driven solutions to address complex business problems., * AI/ML Model Development: Design and train machine learning models using various algorithms, including deep learning, NLP, and computer vision.
- Databricks Orchestration: Build and optimize end-to-end AI/ML pipelines on Databricks, utilizing Unity Catalog for governance and MLflow for experiment tracking.
- Generative AI & LLMs: Implement advanced AI patterns such as Retrieval-Augmented Generation (RAG) and fine-tune pre-trained models for specific enterprise tasks.
- Python Expertise: Write production-quality, idiomatic PySpark and Python code that leverages Spark's distributed nature.
- Collaboration: Partner with Engineering and Product teams to translate business problems into scalable analytical solutions.
- Insight Extraction: Perform exploratory data analysis (EDA) and extract meaningful insights from massive, complex datasets to drive strategic decisions.
Requirements
Do you have experience in Technical Proficiency?, Do you have a Master's degree?, * Education: MS or PhD in a quantitative field such as Computer Science, Statistics, or Math.
- Experience: 5+ years of hands-on experience in data science or AI engineering in high-growth environments., * Technical Proficiency: Expert-level Python (pandas, NumPy, scikit-learn, PySpark).
- Extensive experience with Apache Spark for large-scale data processing.
- Proficiency in SQL for data manipulation and querying in Lakehouse environments.
- AI Foundations: Strong understanding of statistics, probability, and advanced ML lifecycle management (MLOps).
Preferred Skills
- Experience with Deep Learning frameworks (TensorFlow, PyTorch).
- Familiarity with Cloud Platforms (AWS, Azure, or GCP) and their native data services.
- Databricks certifications, such as Databricks Certified Machine Learning Professional.
Benefits & conditions
Pulled from the full job description
- Health insurance
- Paid time off
- Employee assistance program