AI Engineer
About
We’re building Jenny, a home assistant. Jenny helps homeowners take a project from first idea to final payment — understanding scope, navigating bids, and keeping everything on track without the chaos that usually comes with it. We’re a small team in San Francisco working on a problem that touches nearly every household.
The Role
We’re hiring an AI Engineer to own the agentic systems at the core of Jenny. This is not a research role — you’ll design, build, and ship the agent workflows that homeowners and contractors depend on every day.
You’ll work at the intersection of LLM systems and product, making real architectural decisions about how agents reason, plan, use tools, and hand off between each other. The problems are hard: multi-step workflows, ambiguous user intent, long-horizon task management, and accuracy requirements that matter when real money is on the line.
What You’ll Work On
- Design and own end-to-end agentic workflows — from user intent to structured output across multiple steps and tool calls
- Build the planning and reasoning layer that allows Jenny to scope projects, clarify ambiguity, and drive tasks to completion
- Develop tool-use and memory systems that give agents persistent context across a project’s lifecycle
- Build eval frameworks to measure agent accuracy, reliability, and regression across workflows
- Integrate external data sources (cost databases, permit records, contractor profiles) as grounded context for agent decisions
- Collaborate closely with product to translate user needs into agent behavior — and push back when the right solution is simpler
What We’re Looking For
- 3+ years of software engineering experience, with at least 1 year focused on LLM or agent systems in production
- Deep understanding of agentic architectures — tool use, planning loops, multi-agent coordination, retrieval, and memory
- Strong Python engineering skills; comfort working across the stack when needed
- Experience building and running evals — not just shipping prompts, but measuring whether they work
- High agency: you identify problems, propose solutions, and drive them to done without waiting to be told
- Familiarity with frameworks like LangGraph, LlamaIndex, or similar is a plus — but we care more about first-principles thinking than framework knowledge
Details
- Location: In-person, San Francisco — 1700 Montgomery St
- Type: Full-time
- Compensation: Competitive salary + equity
Apply
Send a short note about what you’ve built and why this problem interests you to jobs@hijenny.ai. No cover letter required — just tell us something real.