Apply

Founding AI Engineer - Staff

Engineering

Full time

Bay area, CA

About the Role

The engineering problem: general-purpose LLMs aren't good enough for legal work in their default state. We're building expert-in-the-loop systems which capture lawyer feedback continuously, feed it into our knowledge base so that the right lawyer guidance can be provided to LLMs via an agentic RAG process. The process involves a robust evaluation process to ensure high quality. As a Staff AI Engineer, you'll own the core AI systems that solve this problem. You'll set the technical direction for how we build, evaluate, and ship AI that works alongside lawyers in production.

What You'll Do

Build and maintain LLM-based agents that handle complex legal workflows with lawyer oversight at critical decision points
Design expert-in-the-loop systems where attorney feedback continuously improves AI quality — the core differentiator of our product
Build evaluation infrastructure including LLM-as-a-judge evals for continuous monitoring, internal tooling for collecting feedback, and frameworks for running eval sets efficiently
Stay on top of the frontier — track latest LLM architectures and model updates, build a framework to evaluate potential enhancements to the product
Work directly with lawyers and product to understand how AI should behave in legal contexts and translate that into engineering decisions

What We're Looking For

Required

5+ years of production backend engineering experience in Python — you write code that ships and scales, not just notebooks
1+ year building LLM-based agents in production - agents that call external APIs, manage state, and make decisions.
Built evaluation systems for measuring prompt and agent performance pre and post deployment.
Tech lead experience: you've owned architecture decisions, led development, handled deployment and monitoring
AI-native workflow: you're already writing 90%+ of your code with tools like Claude Code, Cursor, or Codex
In-person in SF: we build together in the office — this isn't a remote role

Nice to Have

Experience building planning-based agents — systems that reason about goals, decompose tasks, and decide on actions autonomously
Experience designing systems with expert/human feedback loops — where domain experts continuously improve AI output

About Company

Inhouse is the #1 AI lawyer for small to midsize businesses. We combine AI, lawyers from our own law firm, and an expert feedback loop to deliver fast, compliant, high-quality legal work at a fraction of the cost and turnaround time of traditional firms.

We're a small team, early stage, building in SF. You'll have real ownership over architecture and direction, ship to production fast, and work directly with attorneys who'll make you smarter about a domain most engineers never touch.

Apply