NLP Data Scientist
Role details
Job location
Tech stack
Job description
Are you passionate about applying NLP and LLMs to real-world, human-focused challenges? As an NLP Data Scientist at Workhuman, you'll contribute to building AI tools that help organizations understand how their people work, grow, and connect. You'll work across multiple active projects simultaneously rather than embedded on a single product team - one quarter you might be building a text classification model, the next contributing to an LLM evaluation pipeline or translating a new client need into a tool or analysis we haven't built before. What stays constant is working with large volumes of unstructured workplace text and shipping models that drive real product outcomes. **This is a remote position based in the US. ** Applicants must have valid permanent work authorization in the U.S. (e.g., U.S. citizen, permanent resident). We are unable to offer visa sponsorship for this role. What We Can Offer You The opportunity to design and deploy NLP and LLM-powered models that directly power Workhuman's AI products. Hands-on work with large volumes of unstructured text - employee recognition data, feedback, survey responses, and more. Exposure to the full applied NLP lifecycle: problem framing, data preparation, model development, evaluation, and production handoff. Collaboration with a cross-functional team of Data Scientists, ML Engineers, Software Engineers, and Product Managers. A learning-forward environment with access to the latest research, tools, and best practices in NLP and generative AI., Thank you for considering joining our team. We strive to create an inclusive and accessible application process for all candidates. If you encounter any difficulties or have specific accessibility requirements while applying for this position, please don't hesitate to reach out to us at for assistance. We are committed to ensuring equal access to opportunities for all individuals.
Requirements
Foundational experience with NLP techniques, including text classification, named entity recognition, information extraction, and working with unstructured text data. Proficiency in Python and comfort with the modern NLP/ML stack - HuggingFace Transformers, spaCy, NLTK, Gensim, scikit-learn, statsmodels, pandas, and pyarrow. Familiarity with LLMs and their practical applications - including prompt design and basic LLM evaluation techniques (e.g., LLM-as-judge, semantic similarity scoring). Genuine curiosity about model evaluation is a strong plus. Some experience with cloud-based ML infrastructure; AWS and Bedrock experience is a plus. Comfort working with boto3 or similar SDKs to integrate ML capabilities into larger systems. A collaborative, solution-oriented approach to ambiguous problems. Strong communication skills, with the ability to translate technical findings into clear language for non-technical audiences. The base salary range for this position is $130,000 - $155,000. Offered base compensation within this range will be determined based on the candidate's qualifications, experience, geographic location, and other factors relevant to the scope and responsibilities of this role. This role is also eligible for bonus or other variable compensation based on job performance and our standard benefits package, which supports employee well-being and work-life balance.