Remote Software Developer (LLM Evaluation)

Turing

yesterday

Role details

Contract type

Temporary contract

Employment type

Part-time / full-time

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Job location

Remote

Tech stack

Java

JavaScript

API

Artificial Intelligence

C Sharp (Programming Language)

C++

Python

Machine Learning

Software Architecture

Software Engineering

Rust

React

Large Language Models

FastAPI

Api Design

Job description

Curate code examples, build solutions, and correct code for AI model training in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go
Evaluate and refine AI-generated code to ensure efficiency, scalability, and reliability
Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks
Build agents that can verify the quality of code and identify error patterns
Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them
Design verification mechanisms that can automatically verify a solution to a software engineering task

Technologies:

AI
API
C#
FastAPI
Java
JavaScript
LLM
Python
Rust

Requirements

3+ years of software engineering experience
Strong expertise in building full-stack applications and deploying scalable, production-grade software using modern languages and tools
Deep understanding of software architecture, design, development, debugging, and code quality/review assessment
Excellent oral and written communication skills for clear, structured evaluation rationales
Candidates must be based out of US, Canada, or WEU countries (UK, Netherlands, Italy, Germany, )

Benefits & conditions

Salary: ? - ? € per year, Salary: 200 - 300 USD per HOUR

Tech stack: Python, JavaScript, C++, C#, Java, FastAPI
Category: Python Developer / Engineer
Location address: 548 Market Street, PMB 18282, San Francisco, United States
Benefits & perks: Fully home office / remote work; Flexible work time
About Us: Based in San Francisco, California, Turing is the worlds leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers by accelerating frontier research with high-quality data, advanced training pipelines, and top AI researchers specializing in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents; and by helping enterprises transform AI from proof of concept into proprietary intelligence that performs reliably and delivers measurable impact on the P&L.
Project Overview: Create cutting-edge datasets for training, benchmarking, and advancing large language models; curate code examples; provide precise solutions and corrections across multiple languages; evaluate and refine AI-generated code; collaborate with researchers and cross-functional teams to enhance enterprise-level AI-driven coding solutions.
Engagement Details: Commitment flexible, minimum 10 hrs/week up to 40 hrs/week (partial PST overlap required); Type: Contractor (no medical/paid leave); Duration: 1 month (starting next week; potential extensions based on performance and fit)
View this job and over 500 other transparent jobs with salaries & tech stacks on DevITJobs
Are you looking for Python jobs in San-Francisco?

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

Apply for this position

Good distractions

Moments

Videos View all