NLP Data Scientist
Certain Advantage
Charing Cross, United Kingdom
2 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Compensation
£ 55KJob location
Charing Cross, United Kingdom
Tech stack
Artificial Intelligence
Data analysis
Software Debugging
Information Retrieval
Python
Natural Language Processing
Data Processing
Large Language Models
Prompt Engineering
Generative AI
Information Technology
Job description
- In this role, you will design, implement, and maintain scalable NLP and GenAI pipelines, which include data processing, preprocessing, and evaluation. You will perform advanced data analysis on real-world datasets to extract meaningful insights to support decision-making processes. Staying current with state-of-the-art research in the areas of LLMs and NLP is crucial, as you will propose new ideas and methodologies that unlock business value. You will contribute to the development of RAG systems and retrieval pipelines, which involve chunking, embedding, re-ranking, and evaluation. Additionally, you will participate in experiments by designing experimental details, writing reusable code, running evaluations, and organizing results. Collaboration with your team is essential as you help prioritize research initiatives that have direct value, and you will work closely with stakeholders, project managers, and architects to gather requirements, plan project scopes, and ensure timely project delivery.
Technologies:
- AI
- Support
- Python
- Marketing
More:
At Certain Advantage, we are looking for a passionate NLP/GenAI Data Scientist who can contribute to innovative R&D efforts within our GenAI/NLP team. Generative AI is poised to transform operations across all major lines of our global energies client's business. Applications may include conversational AI, intelligent information retrieval, AI-assisted system design, intelligent plant monitoring, and autonomous exploratory systems. If you possess strong Python skills and a focus on Natural Language Processing, we encourage you to apply today!
Requirements
- We require candidates to have a strong understanding of modern Natural Language Processing (NLP), Large Language Models (LLMs), transformer architectures, prompt engineering, Retrieval-Augmented Generation (RAG), agentic architectures, and evaluation methodologies. A solid foundation in Python programming for developing and debugging AI models is essential. While an educational background at the degree level, ideally in computer science, electrical engineering, or a related technical subject, is preferred, we are open to considering applicants without a Master's degree. Excellent communication skills and a collaborative mindset are vital, as is a strong interest in cross-disciplinary collaboration that supports research delivering both business value and scientific impact.