Development PracticesDevelopment Practices
Conference50min
INTERMEDIATE

Testing Challenges in the Age of AI

AI apps challenge traditional QA due to non-determinism, security needs, and complex agent workflows. This session introduces a layered testing approach—mock-based, behavioral, and live-model tests—using mokksy.dev and promptfoo, with real-world JetBrains Koog examples. Attendees learn practical strategies to ensure quality and reliability in evolving AI systems.

Konstantin Pavlov
Konstantin PavlovJetBrains

talkDetail.whenAndWhere

Wednesday, October 8, 12:00-12:50
Room 6
talks.description
AI apps break QA: non‑determinism, latency, rate limits and schema violations make assertion tests flaky, turning CIs red and eroding confidence. Security/privacy add complexity: data must be anonymized, prompts sanitized, leaks prevented—areas beyond standard scans. Real users submit jailbreaks, informal queries and RAG patterns vary between staging and production. Agent workflows worsen this: multi‑step tool orchestration with retries, approvals and inter‑agent coordination exceed unit or integration scopes.
This session presents a layered testing strategy: fast, mock‑based tests with (mokksy.dev); behavioral tests validating tool calls and security; and live‑model sanity checks with LLMs as evaluation and assertion mechanisms (promptfoo). Examples using JetBrains Koog agentic framework demonstrate these patterns i practice.
Attendees will gain a clear map of AI testing pitfalls and actionable techniques to balance development speed with quality in unpredictable environments.

workflow
security
llms
testing
talks.speakers
Konstantin Pavlov

Konstantin Pavlov

JetBrains

Estonia

A software engineer with over two decades of experience in IT, specializing in server‑side development in Kotlin and Java. Expert in designing and delivering production systems in AI, communications, and financial technology domains. Past work includes developing AI‑powered contact center solutions at Twilio, and currently contributing to open‑source AI frameworks such as JetBrains Koog and LangChain4j. Deeply committed to software quality, with a hands‑on approach to testing, resilient architecture, and practical solutions in real‑world AI systems.
comments.title

comments.speakerNotEnabledComments