Data Scientist
Role details
Job location
Tech stack
Job description
Agile Defense is seeking a Data Scientist / Engineer to support the design, development, and operational deployment of scalable, AI-enabled data solutions within the Department of Defense's CDAO ADA IR program. This role is part of a multidisciplinary team integrating advanced analytics, machine learning, and engineering practices into mission-critical environments at Combatant Commands.
You will help shape and deploy data pipelines, pre-processing workflows, feature engineering strategies, and machine learning services within secure, containerized environments. The ideal candidate brings a hybrid of statistical modeling fluency and hands-on software engineering expertise. You will collaborate closely with product managers, full-stack developers, platform engineers, and mission stakeholders to transform raw data into meaningful insights and decision-support tools., * Happy - Be Infectious. Happiness multiplies and creates a positive and connected environment where motivation and satisfaction have an outsized effect on everything we do.
- Helpful - Be Supportive. Being helpful is the foundation of teamwork, resulting in a supportive atmosphere where collaboration flourishes, and collective success is celebrated.
- Honest - Be Trustworthy. Honesty serves as our compass, ensuring transparent communication and ethical conduct, essential to who we are and the complex domains we support.
- Humble - Be Grounded. Success is not achieved alone, humility ensures a culture of mutual respect, encouraging open communication, and a willingness to learn from one another and take on any task.
- Hungry - Be Eager. Our hunger for excellence drives an insatiable appetite for innovation and continuous improvement, propelling us forward in the face of new and unprecedented challenges.
- Hustle - Be Driven. Hustle is reflected in our relentless work ethic, where we are each committed to going above and beyond to advance the mission and achieve success.
Requirements
This role requires strong technical communication skills, a collaborative mindset, and experience working in agile environments that value reproducibility, testing, and continuous delivery. Familiarity with cloud-based data platforms such as Databricks, Palantir, or AWS-native data services is highly preferred. Education and Background A bachelor's degree plus 3 years of recent specialized experience, OR, an associate's degree plus 7 years of recent specialized experience, OR, a major certification plus 7 years of recent specialized experience, OR, 11 years of recent specialized experience Required Skills
- 4+ years of experience in applied data science, machine learning engineering, or data pipeline development.
- Proficient in Python, SQL, and distributed data frameworks (e.g., Spark, Databricks, PySpark
Preferred Skills
- Experience developing ML models from training to deployment using industry-standard tools and libraries (e.g., scikit-learn, TensorFlow, XGBoost, MLflow).
- Familiarity with MLOps, API development, and secure cloud-based environments (e.g., AWS, Azure, Palantir Foundry).
- Strong understanding of data validation, model testing, and performance evaluation techniques.
- Experience with data visualization and storytelling using tools such as Tableau, Plotly, or Matplotlib.
- Excellent technical communication skills, with the ability to explain complex concepts to non-technical audiences.