01 Zakres zadań
As Principal AI Engineer you are the technical owner of the AI core. You lead the audit of the existing codebase, define the architecture we evolve toward, build the test and evaluation harness that lets us ship changes safely, and you are the engineer who is paged when the AI surface misbehaves in production.
What you'll do
First 90 days
- Audit atlas-ai: agent flows, LangGraph state machines, Celery topology, datastore usage, OpenAI integration patterns
- Produce a written assessment of operational risk: failure modes, race conditions, retry semantics, idempotency, checkpoint integrity
- Quantify token cost per agent flow and per user session
- Identify the highest-risk subsystems and propose stabilisation plans
- Build (or harden) an evaluation harness for the agent flows — golden cases, regression suites, hallucination/safety tests
- Lead the knowledge-transfer sessions from the client's AI team
Ongoing
- Set the technical direction for the AI core
- Lead design for new agent flows and major changes to existing ones
- Own the production health of the AI surface (with platform/SRE support)
- Hire and mentor the rest of the AI squad (~10 engineers at full scale)
- Represent the AI core in cross-team architecture conversations with the client