Senior Product Manager - Tech, GenAI, Amazon Rufus
Amazon.com, Inc, City of Westminster
Senior Product Manager - Tech, GenAI, Amazon Rufus
Salary not available. View on company website.
Amazon.com, Inc, City of Westminster
- Full time
- Permanent
- Onsite working
Posted 1 day ago, 10 May | Get your application in today.
Closing date: Closing date not specified
Job ref: 02606f0a6dcf4ddcb459231cfc7d33a8
Location ref: City of Westminster
Full Job Description
Amazon's Rufus AI team is building the future of conversational shopping. Rufus helps hundreds of millions of customers find and discover products through natural language, and behind every response is an automated quality measurement system powered by LLM-as-a-Judge (LLMAJ) technology. We are seeking a Sr. Product Manager-Tech to own the quality governance, global scaling, and operational excellence of this judge portfolio.
You will work alongside Language Engineers who build and tune judges, Product Managers who define quality criteria and evaluation standards, Data Scientists who operate evaluation pipelines, and Engineering teams who build the infrastructure that runs evaluations. This is a high-autonomy role: you own your domain end-to-end and are expected to drive decisions, not just track workstreams.
This role sits at the intersection of AI evaluation, product management, and applied tooling. You will own the governance framework for a portfolio of dozens of LLM judges that power critical evaluation metrics used for release decisions, competitive benchmarking, and leadership reporting. You will drive the localization of judges from en-US to 5+ international marketplaces, facilitate model evaluation and debugging workflows, and build purpose-built tools and agents to automate governance operations at scale., Own the LLMAJ governance framework: judge registry, versioning standards, quality validation gates, deprecation policies, and agreement rate monitoring across the full judge portfolio
- Own the international LLMAJ expansion: drive judge localization from en-US to global marketplaces, identify coverage gaps, define remediation plans, and validate judge quality per locale
- Facilitate model evaluation and debugging: work with Language Engineers and Scientists to trace response quality issues, inspect production logs, and root-cause judge disagreements or quality regressions
- Build purpose-built tools and agents: code automation using internal agent frameworks to streamline governance workflows, judge monitoring, data extraction, and reporting
- Define and own partner-facing quality metrics powered by LLMAJ, including defect rates, agreement rates, and evaluation dimension reporting across partner teams
- Drive human-in-the-loop validation workflows, coordinating between evaluation platforms and annotation teams to maintain judge calibration
- Drive discipline on evaluation requests by enforcing data-driven problem statements, clear scoping, and definition of done before work begins
- Write business requirements documents, contribute to leadership updates, and represent LLMAJ governance in cross-functional forums
A day in the life
You start the morning checking agreement rate dashboards for drift across international locales and triaging alerts. A new prompt release is shipping, so you pull evaluation results, spot two judges regressing in the Japanese marketplace, and open a debugging session with a Language Engineer to trace the root cause. After lunch, you present international judge coverage in a cross-functional review. In the afternoon, you ship an update to a governance agent you built that auto-generates weekly judge health reports. You close the day pushing back on an under-scoped evaluation request.
About the team
We are the team responsible for measuring whether Amazon's AI shopping assistant is actually good. We build LLM judges, define quality standards, and run evaluations that directly inform what ships to hundreds of millions of customers. Our team includes Language Engineers, Data Scientists, and Product Managers who work closely with Science, Engineering, and Product teams across the organization. We move fast, care deeply about measurement rigor, and believe that if you cannot measure quality automatically, you cannot improve it at scale.
Bachelor's degree
- Experience in technical product management, program management or engineering
- Experience owning/driving roadmap strategy and definition
- Experience with end to end product delivery
- Experience with feature delivery and tradeoffs of a product
- Experience contributing to engineering discussions around technology decisions and strategy related to a product
- Experience in representing and advocating for a variety of critical customers and stakeholders during executive-level prioritization and planning
Preferred Qualifications
- Experience in using analytical tools, such as Tableau, Qlikview, QuickSight
- Experience in building and driving adoption of new tools
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice (https://www.amazon.jobs/en/privacy_page) to know more about how we collect, use and transfer the personal data of our candidates.
Direct job link
Relevant jobs
- Biotechnology / Life Sciences Jobs in Bexley, Bexley
- Biotechnology / Life Sciences Jobs in Bexleyheath, Bexley
- Biotechnology / Life Sciences Jobs in Bromley, Barnsley
- Biotechnology / Life Sciences Jobs in Bromley Common, Bromley
- Biotechnology / Life Sciences Jobs in Camden Town, Greater London
- Biotechnology / Life Sciences Jobs in City of Westminster
- Biotechnology / Life Sciences Jobs in Croydon, Cambridgeshire
- Biotechnology / Life Sciences Jobs in Ealing, Ealing
- Biotechnology / Life Sciences Jobs in Enfield, Hyndburn
- Biotechnology / Life Sciences Jobs in Greenwich, Amber Valley
- Biotechnology / Life Sciences Jobs in Hackney
- Biotechnology / Life Sciences Jobs in Hammersmith and Fulham, Hammersmith and Fulham
- Biotechnology / Life Sciences Jobs in Harrow
- Biotechnology / Life Sciences Jobs in Hillingdon, Hillingdon
- Biotechnology / Life Sciences Jobs in Hounslow
- Biotechnology / Life Sciences Jobs in Islington, Leeds
- Biotechnology / Life Sciences Jobs in Kensington and Chelsea, Kensington and Chelsea
- Biotechnology / Life Sciences Jobs in Kingston upon Thames
- Biotechnology / Life Sciences Jobs in Lambeth, Lambeth
- Biotechnology / Life Sciences Jobs in Lewisham
- Biotechnology / Life Sciences Jobs in Merton, Oxfordshire
- Biotechnology / Life Sciences Jobs in Orpington, Greater London
- Biotechnology / Life Sciences Jobs in Richmond upon Thames
- Biotechnology / Life Sciences Jobs in Sidcup, Greater London
- Biotechnology / Life Sciences Jobs in Southwark
- Biotechnology / Life Sciences Jobs in Sutton, Doncaster
- Biotechnology / Life Sciences Jobs in Tower Hamlets, Tower Hamlets
- Biotechnology / Life Sciences Jobs in Twickenham, Greater London
- Biotechnology / Life Sciences Jobs in Wandsworth, Wandsworth
- Biotechnology / Life Sciences Jobs in Wimbledon, Greater London