Founding AI Engineer - Staff
Engineering
Full time
Bay area, CA
About the Role
The engineering problem: general-purpose LLMs aren't good enough for legal work in their default state. We're building expert-in-the-loop systems which capture lawyer feedback continuously, feed it into our knowledge base so that the right lawyer guidance can be provided to LLMs via an agentic RAG process. The process involves a robust evaluation process to ensure high quality. As a Staff AI Engineer, you'll own the core AI systems that solve this problem. You'll set the technical direction for how we build, evaluate, and ship AI that works alongside lawyers in production.
What You'll Do
- Build and maintain LLM-based agents that handle complex legal workflows with lawyer oversight at critical decision points
- Design expert-in-the-loop systems where attorney feedback continuously improves AI quality — the core differentiator of our product
- Build evaluation infrastructure including LLM-as-a-judge evals for continuous monitoring, internal tooling for collecting feedback, and frameworks for running eval sets efficiently
- Stay on top of the frontier — track latest LLM architectures and model updates, build a framework to evaluate potential enhancements to the product
- Work directly with lawyers and product to understand how AI should behave in legal contexts and translate that into engineering decisions
What We're Looking For
Required
- 5+ years of production backend engineering experience in Python — you write code that ships and scales, not just notebooks
- 1+ year building LLM-based agents in production - agents that call external APIs, manage state, and make decisions.
- Built evaluation systems for measuring prompt and agent performance pre and post deployment.
- Tech lead experience: you've owned architecture decisions, led development, handled deployment and monitoring
- AI-native workflow: you're already writing 90%+ of your code with tools like Claude Code, Cursor, or Codex
- In-person in SF: we build together in the office — this isn't a remote role
Nice to Have
- Experience building planning-based agents — systems that reason about goals, decompose tasks, and decide on actions autonomously
- Experience designing systems with expert/human feedback loops — where domain experts continuously improve AI output
Inhouse is the #1 AI lawyer for small to midsize businesses. We combine AI, lawyers from our own law firm, and an expert feedback loop to deliver fast, compliant, high-quality legal work at a fraction of the cost and turnaround time of traditional firms.
We're a small team, early stage, building in SF. You'll have real ownership over architecture and direction, ship to production fast, and work directly with attorneys who'll make you smarter about a domain most engineers never touch.