Software Engineer - AI Interaction Evaluator
Role details
Job location
Tech stack
Job description
We're looking for highly experienced software engineer (SR+) to help evaluate the quality of interactions with modern coding agents such as OpenAI Codex and Claude Code., You will assess how AI coding agents behave in real-world scenarios - focusing on:
- Whether the response makes sense
- Whether the preamble and reasoning are useful
- Whether the output reflects strong engineering judgment
- Whether the interaction feels right to an experienced developer
This role is about engineering taste - not syntax correctness.
What You'll Be Doing
- Evaluate AI-generated coding interactions end-to-end
- Judge whether outputs are:
- Useful
- Correct (at a high level)
- Aligned with how a strong engineer would think
- Assess the quality of explanations and reasoning, not just code
- Distinguish between different levels of response quality (e.g. what makes something a 2 vs 4)
- Provide clear, opinionated feedback on:
- What worked
- What didn't
- What felt "off" or misleading
- Help define what great looks like when interacting with tools like Cursor
What We Mean by "Taste"
We're specifically looking for engineers who can answer questions like:
- Does this feel like something a strong engineer would actually say?
- Is this explanation helpful, or just technically correct?
- Is the model guiding the user well, or just dumping output?
- Would this interaction build or erode trust?, Remote or Hybrid Mid level Mid level Fintech * Legal Tech * Software * Financial Services * Cybersecurity * Data Privacy The Client Legal Administrator manages corporate governance, compliance, and board meeting processes, ensuring accurate records and communication with stakeholders. Top Skills: Document Management SystemsExcelMicrosoft Office (WordOutlook) Mondelēz International
Manager Category Finance FP&A Meals, Innovation & Sustainability EU (F/M)
Yesterday Remote or Hybrid 4K-4K Annually Senior level 4K-4K Annually Senior level Big Data * Food * Hardware * Machine Learning * Retail * Automation * Manufacturing Manage finance planning and performance for the Meals category in Europe, collaborating with marketing teams and leading sustainability financial initiatives. Top Skills: Bi)Finance Systems (FitExcel Mondelēz International
Requirements
- Staff / Principal-level engineer (or equivalent experience)
- Strong background in one of the below:
- TypeScript / JavaScript
- Python
- Hands-on experience using:
- OpenAI Codex
- Claude Code
- Cursor
- Deep familiarity with modern AI-assisted dev workflows
- Able to evaluate code without needing to fully execute or deeply review every line
- Comfortable giving direct, opinionated feedback
- High bar for what "good engineering" looks like
Nice to Have
- Experience with tools like Cursor or similar AI-first IDEs
- Prior exposure to prompt design or evaluation workflows
- Experience mentoring senior engineers or defining engineering standards
Benefits & conditions
2 Days Ago Remote or Hybrid 56K-85K Annually Senior level 56K-85K Annually Senior level Big Data * Food * Hardware * Machine Learning * Retail * Automation * Manufacturing The Senior Security Automation Developer will build and maintain automation systems to enhance security operations, collaborating closely with various security teams to improve response times and maturity levels. Top Skills: GitopsPythonSoar
What you need to know about the Colorado Tech Scene
With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.
Key Facts About Colorado Tech
- Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
- Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
- Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
- Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
- Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute