Lead AI Test Automation Engineer

Philippines Posted on March 6, 2026

Job Description

Lead AI Test Automation Engineer
DevRev

About the role: As our Lead AI Test Automation Specialist, you'll develop testing strategies, evaluation frameworks, and quality metrics specifically designed for LLM-powered applications. This role requires a unique blend of QA expertise, understanding of GenAI behaviour, and automation skills to ensure our AI features are reliable, accurate, and trustworthy.

Key Responsibilities:

Design and implement comprehensive testing strategies for GenAI features, including conversational AI, agentic systems, and LLM-powered workflows

Develop automated test suites for prompt testing, including regression tests that detect unintended changes in model behaviour

Create evaluation frameworks to measure GenAI quality across multiple dimensions (accuracy, relevance, safety, consistency, latency)

Build and maintain test datasets and golden examples that represent diverse user scenarios and edge cases

Implement monitoring and alerting systems to detect quality degradation in production GenAI features

Perform adversarial testing to identify potential failures, hallucinations, biases, or security vulnerabilities in AI systems

Collaborate with engineers to define acceptance criteria and quality gates for AI feature releases

Develop tools and frameworks that make it easy for engineers to test their GenAI implementations

Conduct user acceptance testing and gather feedback on AI feature performance from internal users

Document testing procedures, known issues, and quality metrics in clear, accessible formats

Partner with Product and Design teams to ensure AI features meet user experience standards

Stay current with GenAI testing methodologies, tools, and industry best practices

Your Qualifications

PRE or test engineering experience, preferably with AI/ML systems.

Strong understanding of GenAI technologies including LLMs, prompt engineering, and AI application patterns

Experience with test automation frameworks and scripting (Python, JavaScript, Selenium, Pytest)

Knowledge of software testing methodologies (functional, integration, regression, performance, security testing)

Ability to design test cases and evaluation criteria for non-deterministic systems

Strong analytical and problem-solving skills with attention to detail

Experience with API testing tools (Postman, REST Assured) and backend testing

Familiarity with CI/CD pipelines and automated testing integration

Excellent communication skills for documenting issues and collaboration

Preferred Qualifications

Experience testing conversational AI, chatbots, or agentic systems

Knowledge of ML model evaluation metrics and techniques

Familiarity with LLM evaluation frameworks (LangSmith, PromptFoo, Ragas)

Experience with performance testing and load testing AI APIs

Understanding of responsible AI principles, including fairness, transparency, and safety testing

Background in enterprise software or SaaS QA

Experience with test management tools (TestRail, Zephyr, Jira)

Knowledge of security testing methodologies for AI systems

Scripting experience with Python, including working with LLM APIs

What Makes This Role Exciting

Define Quality practices for GenAI applications

Work on cutting-edge AI technologies and help ensure they're reliable and trustworthy

Shape quality standards that will impact millions of enterprise users

Collaborate closely with engineers, data scientists, and product teams

Grow expertise in a highly specialized and increasingly important domain

Influence the entire AI product development lifecycle from design to release

Join a team that values quality as a first-class concern, not an afterthought

Join us in innovating our testing processes and ensuring the delivery of high-quality software products through advanced automation techniques.

About DevRev

AI is changing the way we work. At DevRev, we’re making it conversational. DevRev’s AI-native business operating system designed for you to experience conversational AI at work—transforming complex processes into simple, intuitive conversations that drive real results. Our conversational interface unlocks highly accurate search, hyper-personalized workflows, and lightning-fast analytics for your teams—all through a single conversational interface that feels as natural as talking to a teammate. Our patented AI solutions, Airdrop and Knowledge Graph, take you beyond automation. By unifying data across all your systems, we give your teams the power to access all siloed information in one place, build self-service user experiences, ship impactful products faster, and boost customer satisfaction—without missing a beat. Founded in 2020 by Dheeraj Pandey, co-founder and former CEO of Nutanix, and Manoj Agarwal, former SVP of Engineering at Nutanix, DevRev was built from the ground up for the AI era with a singular mission: to build the world’s most customer-centric companies. Our platform bridges Dev (your teams) and Rev (your customers), bringing them together through the power of AI. Global by design, with offices in North America, South America, Europe, Asia, and Australia, DevRev is backed by $100.8 million from Khosla Ventures and Mayfield Fund, with a $1.15 billion valuation. Recognized among CRN’s top 10 cloud startups to watch in 2025 and by Forbes as one of America’s best startup employers in 2024, we’re just getting started. Join us and see how AI-powered conversations can connect your teams, delight your customers, and accelerate your business.

Industry: Software Development