Machine Learning Engineer

CareerCircle
Cupertino, United States of America
4 days ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 146K

Job location

Remote
Cupertino, United States of America

Tech stack

Artificial Intelligence
Big Data
Computer Programming
Data Visualization
Distributed Computing Environment
Python
Machine Learning
Natural Language Processing
Standard Sql
SQL Databases
Large Language Models
Prompt Engineering
Model Validation
Build Management
PySpark
Data Pipelines

Job description

  • Design and build LLM-based evaluation frameworks, including automated scoring pipelines and rubric-based grading systems
  • Build and maintain data pipelines for evaluation datasets using Python, SQL, and scalable processing tools
  • Translate complex evaluation results into clear, actionable insights for technical and non-technical stakeholders
  • Implement automation workflows and agentic evaluation systems to improve efficiency and reduce manual efforts
  • Develop prompt engineering strategies to evaluate output quality, accuracy, and consistency
  • Create and maintain metrics, KPIs, and dashboards to track and communicate model performance
  • Conduct error analysis, root-cause investigations, and quality deep dives to guide model improvements
  • Partner cross-functionally to define evaluation methodologies and integrate them into production workflows

Requirements

  • 5+ years of experience in ML engineering, NLP, or AI/ML automation
  • Strong programming skills in Python and SQL
  • Deep understanding of machine learning concepts with a focus on NLP and advanced LLM capabilities (e.g., Chain-of-Thought, agentic workflows)
  • Experience working with large-scale datasets and data pipelines
  • Strong experience with LLM evaluation, prompt engineering, or auto grading systems
  • Experience developing metrics and KPIs to measure model output quality and consistency

Nice-to-Have:

  • Experience with LLM-as-judge systems or human + model evaluation frameworks
  • Background in inter-rater reliability, evaluation calibration, or judged systems design
  • Experience with PySpark or distributed data processing tools
  • Exposure to building dashboards or visualization tools for model performance tracking, Python, SQL, NLP, LLM Evaluation, Prompt Engineering, Machine Learning, Data Pipelines, Automation Systems

Benefits & conditions

This is a Contract position based out of Cupertino, CA. Pay and Benefits

The pay range for this position is $60.00 - $70.00/hr.

Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to specific elections, plan, or program terms. If eligible, the benefits available for this temporary role may include the following:

  • Medical, dental & vision
  • Critical Illness, Accident, and Hospital
  • 401(k) Retirement Plan - Pre-tax and Roth post-tax contributions available
  • Life Insurance (Voluntary Life & AD&D for the employee and dependents)
  • Short and long-term disability
  • Health Spending Account (HSA)
  • Transportation benefits
  • Employee Assistance Program
  • Time Off/Leave (PTO, Vacation or Sick Leave) Workplace Type

About the company

PySpark Research Dashboard Operations Automation Scalability Investigation Data Pipelines Machine Learning Business Valuation Scalability Design Prompt Engineering Workflow Management Product Engineering Full Stack Development Distributed Data Store Artificial Intelligence Business Transformation SQL (Programming Language) Critical Illness Insurance Python (Programming Language) Large Language Model Evaluation Key Performance Indicators (KPIs) Error Analysis (Numerical Analysis), We're partners in transformation. We help clients activate ideas and solutions to take advantage of a new world of opportunity. We are a team of 80,000 strong, working with over 6,000 clients, including 80% of the Fortune 500, across North America, Europe and Asia. As an industry leader in Full-Stack Technology Services, Talent Services, and real-world application, we work with progressive leaders to drive change. That's the power of true partnership. TEKsystems is an Allegis Group company., We're a leading provider of business and technology services. We accelerate business transformation for our customers. Our expertise in strategy, design, execution and operations unlocks business value through a range of solutions. We're a team of 80,000 strong, working with over 6,000 customers, including 80% of the Fortune 500 across North America, Europe and Asia, who partner with us for our scale, full-stack capabilities and speed. We're strategic thinkers, hands-on collaborators, helping customers capitalize on change and master the momentum of technology. We're building tomorrow by delivering business outcomes and making positive impacts in our global communities. TEKsystems and TEKsystems Global Services are Allegis Group companies. Learn more at TEKsystems.com. The company is an equal opportunity employer and will consider all applications without regard to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law. San Francisco Fair Chance Ordinance: Pursuant to the San Francisco Fair Chance Ordinance, for all positions located in the city and county of San Francisco, we will consider for employment qualified applicants with arrest and conviction records. Massachusetts Lie Detector: It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. Use of Artificial Intelligence (AI): We may use Artificial Intelligence (AI) to support parts of our hiring process, including sourcing, screening, and evaluating candidates. AI helps assess applications and qualifications, but final decisions are made by our hiring team. By applying, you acknowledge and agree that your application may be reviewed using AI tools. Related Jobs Machine Learning Engineer - LLM Evaluation & Automation TEKsystems Cupertino, CA*Remote PySpark Research Dashboard Operations Automation Scalability Investigation Data Pipelines Machine Learning Business Valuation Scalability Design Prompt Engineering Workflow Management Product Engineering Full Stack Development Distributed Data Store Artificial Intelligence Business Transformation SQL (Programming Language) Critical Illness Insurance Python (Programming Language) Large Language Model Evaluation Key Performance Indicators (KPIs) Error Analysis (Numerical Analysis) +0

Apply for this position