Python Web Scraping Engineer

Tenth Revolution Group
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
£ 70K

Job location

Remote

Tech stack

JavaScript
Artificial Intelligence
Information Engineering
Web Scraping
Software Debugging
Python
Reverse Engineering
Selenium
Cloud Platform System
Captcha
Reliability of Systems
Puppeteer (Software)
Playwright

Job description

My client is hiring a Senior Python Scraping Engineer to design, build, and operate high-volume, highly resilient web scraping systems, with a specific and sustained focus on scraping Google at scale.

This is a specialist role. It is not suited to generalist data engineers who have only worked on light or opportunistic scraping. You will be working in hostile, rapidly changing environments where naïve techniques fail and deep expertise in bot detection and evasion is essential.

The role sits at the intersection of data engineering, reverse engineering, and large-scale systems reliability, and plays a critical role in delivering accurate, timely, and trusted data.

My client is a leader in adopting AI-assisted and agentic coding practices, and this role is ideal for engineers who actively use AI tools to improve productivity, reasoning, and system design., * Design and operate large-scale scraping systems handling 10+ million requests per day, primarily targeting Google and Google-like platforms

  • Build robust scrapers for dynamic, JavaScript-heavy environments using browser automation and hybrid approaches
  • Continuously adapt to changes in markup, request flows, ranking logic, and anti-automation mechanisms
  • Engineer extraction pipelines with a strong emphasis on correctness, consistency, and observability
  • Implement and maintain advancedanti-bot evasion strategies, including:
  • Proxy and request routing strategies
  • Browser and headless fingerprinting
  • CAPTCHA handling and mitigation
  • Monitor system health, detect anomalies early, and debug complex production issues
  • Optimise performance, cost, and latency across large-scale scraping infrastructure
  • Collaborate closely with data engineers, data scientists, and product teams to ensure scraped data is reliable and usable
  • Produce clear documentation and operational runbooks to support long-term maintainability

Requirements

  • Expert-level web scraping skills using Python
  • Direct, hands-on experience scraping Google at scale - this is essential
  • Deep understanding of:
  • Anti-bot and bot-detection systems
  • Browser and network fingerprinting
  • CAPTCHA systems and mitigation techniques
  • Scaling scraping infrastructure reliably
  • Strong knowledge of HTTP, TLS, cookies, headers, redirects, and browser networking behaviour
  • Experience with tools such as Playwright, Selenium, Puppeteer, or equivalent frameworks
  • Comfortable designing asynchronous and concurrent scraping architectures
  • Proven experience running scraping systems at scale in cloud environments
  • Excellent debugging skills and the ability to reason about complex failure modes
  • Strong communication skills, with the ability to clearly explain complex technical behaviour

About the company

Tenth Revolution Group are the go-to recruiter for Data & AI roles in the UK offering more opportunities across the country than any other recruitment agency. We're the proud sponsor and supporter of SQLBits, Power Platform World Tour, and the London Fabric User Group. We are the global leaders in Data & AI recruitment.

Apply for this position