Pursue your passion and potential
Lead Full Stack Engineer- AI Tester
Gurgaon, India
Caring. Connecting. Growing together.
With these values to guide us, our people are committed to making a meaningful difference in the lives of those we are honored to serve.
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together.
Primary Responsibilities:
- QE & Automation Foundations
- Strong QE fundamentals: test strategy, test planning, defect lifecycle
- Functional, integration, regression testing
- API testing (REST, JSON, Postman)
- Test automation using Python / JavaScript
- CI/CD awareness (Git, pipelines, build verification)
- Log analysis & debugging (CloudWatch / centralized logging)
- Strong QE fundamentals: test strategy, test planning, defect lifecycle
- Conversational AI Testing
- Intent, utterance, slot/entity testing (Amazon Lex or similar)
- Multi turn conversation validation
- Context & session management testing
- Interruptions, re phrasing, intent switching
- Fallback, disambiguation, error handling scenarios
- Voice vs Chat behavior differences (ASR errors, latency, barge in)
- Bot → Agent handoff validation
- Intent, utterance, slot/entity testing (Amazon Lex or similar)
- Agentic AI Testing
- Clear understanding of agentic behavior (plan → reason → act → observe)
- Testing non deterministic responses
- Multi step workflow validation
- Tool / API invocation correctness by agents
- Agent memory & state validation across turns
- Boundary based validation (acceptable vs unacceptable outcomes)
- Guardrail enforcement testing (what agent must NOT do)
- Clear understanding of agentic behavior (plan → reason → act → observe)
- LLM & Prompt Quality Engineering
- Prompt structure understanding (system vs dynamic prompts)
- Prompt regression testing
- Prompt versioning & rollback validation
- Hallucination detection & containment
- Response tone, empathy, policy compliance validation
- Safety, toxicity, and unsafe response testing
- Prompt structure understanding (system vs dynamic prompts)
- Production Readiness & Observability
- AI production defect RCA
- Monitoring AI KPIs:
- o Containment rate
- Escalation rate
- Conversation success/failure
- Hallucination incidents
- Latency, retries, throttling, timeout scenarios
- Release readiness sign off for AI systems
- Monitoring AI KPIs:
- AI production defect RCA
- Security, Privacy & Compliance
- PII / PHI masking validation
- Data boundary validation (what is sent to LLM vs retained)
- Prompt leakage risk testing
- Secure handling of logs and conversation transcripts
Compliance aware test scenarios (healthcare grade rigor)
- PII / PHI masking validation
- Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so
Required Qualifications:
- Fulltime Graduation degree
- Skills:
- Advanced AI Evaluation
- LLM as Judge concepts
- Human in the loop evaluation
- Golden dataset creation
- Automated quality scoring frameworks
- LLM as Judge concepts
- Advanced Automation & Scale
- Conversation simulation frameworks
- Synthetic utterance / conversation generation
- Data driven AI test automation
- AI assisted test case generation
- Conversation simulation frameworks
- Platform & Architecture Awareness
- Experience with Agent Orchestrators
- Multi agent coordination testing
- RAG pipeline understanding
- Integration testing across IVR, CRM, backend systems
- Experience with Agent Orchestrators
- Quality Leadership & Collaboration
- Risk based testing mindset
- Ability to challenge product assumptions
- Strong collaboration with Product, AI Engineers, Ops
- Executive ready quality metrics & reporting
- Comfort with ambiguity and evolving AI behavior
- Risk based testing mindset
- Advanced AI Evaluation
At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.
Benefits
Our mission of helping people live healthier lives extends to our team members. Learn more about our range of benefits designed to help you live well.
Life
Resources and support to focus on what matters most to you, in every facet of your life.
Emotional
Education, tools and resources to help you reduce and manage stress, build resilience and more.
Physical
Health plans and other coverage to support wellness for you and your loved ones.
Financial
Benefits for today and to help you plan for the future, including your retirement.
We’re honored to be recognized for our exceptional work culture
Connect with us


