AI Evaluation Engineer

Help track model quality and improve results with clear checks.

Remote Full-time Mid

EvaluationAILLMQuality

Snapshot

TeamEngineering

CategoryAI / Machine Learning / LLM Engineering

TypeFull-time

LocationRemote

LevelMid

Focus

Evaluation, scoring, quality checks

Role overview

You will create evaluation steps, score model outputs, and help teams understand quality. You’ll support updates with clear results.

Responsibilities

Requirements

Perks

Remote work

Learning support

Tooling help

Weekly syncs

Ready?

Apply now or talk to us about the team.