Data Scientist I
Llc (green)
Warrendale, United States of America
2 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
JuniorJob location
Warrendale, United States of America
Tech stack
Artificial Intelligence
Amazon Web Services (AWS)
Azure
Encodings
Databases
Continuous Integration
Statistical Hypothesis Testing
Iterative and Incremental Development
Python
Machine Learning
NLTK
NoSQL
Release Management
SQL Databases
Google Cloud Platform
Retrieval-Augmented Generation
Large Language Models
Prompt Engineering
Deep Learning
Backend
Keras
GIT
Pandas
Scikit Learn
Information Technology
Software Version Control
Job description
- Analyze and identify the linkages and interactions between the component parts of an entire system.
- Take ownership of projects, ensuring their successful planning and technical execution.
- Partner with team leadership to ensure collective ownership of quality, timelines, and deliverables.
- Develop skills outside your comfort zone and encourage others to do the same.
- Use the review of work as an opportunity to deepen the expertise of team members.
- Address conflicts or issues, engaging in difficult conversations with clients, team members and other stakeholders, escalating where appropriate.
Requirements
Do you have experience in Version control systems?, Do you have a Bachelor's degree?, * Bachelor's Degree
- 3 year(s) in a quantitative field (Computer Science, Mathematics, Machine Learning, AI, Statistics, Operational research or equivalent)
- At least 1 year of direct experience with feature identification in developing predictive models, data acquisition and preparation, and hypothesis testing.
- At least 1 year experience in Python, R or other relevant language.
You demonstrate experience in:
- Heavy contributor in building of AI and GenAI solutions, including but not limited to analytical modeling, prompt engineering, general all-purpose programming (e.g., Python), testing, communication of results, front end and back-end integration, and iterative development with clients
- Documenting and analyzing business processes for AI and Generative AI opportunities, including gathering of requirements, creation of initial hypotheses, and development of AI/GenAI solution approach
- Experience processing unstructured and structured data to be consumed as context for LLMs, including but not limited to embedding of large text corpus, generative development of SQL queries, building connectors to structured databases; and
- Demonstrates abilities and/or a proven record of success learning and performing in functional and technical capacities, including the following areas:
- Managing GenAI application including back-end and front-end integrations
- Using Python (e.g., Pandas, NLTK, Scikit-learn, Keras, etc.), common LLM development frameworks (e.g., Langchain, Semantic Kernel), Relational storage (SQL), Non-relational storage (NoSQL);
- Experience in analytical techniques such as Machine Learning, Deep Learning and Optimization
- Vectorization and embedding, prompt engineering, RAG (retrieval augmented generation) workflow development
- Understanding or hands on experience with Azure, AWS, and / or Google Cloud platforms
- Experience with Git Version Control, Unit/Integration/End-to-End Testing, CI/CD, release management, etc.
About the company
At Green Cabbage we empower our customers in the procurement and negotiation process with industry data to save money, time and risk. The company is growing rapidly, and we want to set the foundation for accelerating our development team. We are currently designing, building, and optimizing our core data science offerings and looking for a Data Scientist to join our team to steward experimentation and innovation., Green Cabbage, the Global Leader in Procurement Intelligence, provides mid-market and enterprise clients with the data and expertise needed to achieve better deals across technology, third-party labor, marketing, and travel & expense contracts. Our flagship platform, Harvest, empowers procurement teams worldwide to access Green Cabbage's precise, governed intelligence.
At Green Cabbage, we foster a collaborative, high-energy, and growth-minded culture. We move quickly, value accountability, and celebrate team wins. If you're passionate about building secure, modern IT systems that keep a business running efficiently and safely, we'd love to hear from you.