Development PracticesConference50min
Testing Challenges in the Age of AI
AI apps challenge traditional QA due to non-determinism, security needs, and complex agent workflows. This session introduces a layered testing approach—mock-based, behavioral, and live-model tests—using mokksy.dev and promptfoo, with real-world JetBrains Koog examples. Attendees learn practical strategies to ensure quality and reliability in evolving AI systems.
Konstantin PavlovJetBrains
talkDetail.whenAndWhere
Wednesday, October 8, 12:00-12:50
Room 6
AI apps break QA: non‑determinism, latency, rate limits and schema violations make assertion tests flaky, turning CIs red and eroding confidence. Security/privacy add complexity: data must be anonymized, prompts sanitized, leaks prevented—areas beyond standard scans. Real users submit jailbreaks, informal queries and RAG patterns vary between staging and production. Agent workflows worsen this: multi‑step tool orchestration with retries, approvals and inter‑agent coordination exceed unit or integration scopes.
This session presents a layered testing strategy: fast, mock‑based tests with (mokksy.dev); behavioral tests validating tool calls and security; and live‑model sanity checks with LLMs as evaluation and assertion mechanisms (promptfoo). Examples using JetBrains Koog agentic framework demonstrate these patterns i practice.
Attendees will gain a clear map of AI testing pitfalls and actionable techniques to balance development speed with quality in unpredictable environments.
This session presents a layered testing strategy: fast, mock‑based tests with (mokksy.dev); behavioral tests validating tool calls and security; and live‑model sanity checks with LLMs as evaluation and assertion mechanisms (promptfoo). Examples using JetBrains Koog agentic framework demonstrate these patterns i practice.
Attendees will gain a clear map of AI testing pitfalls and actionable techniques to balance development speed with quality in unpredictable environments.
Konstantin Pavlov
A software engineer with over two decades of experience in IT, specializing in server‑side development in Kotlin and Java. Expert in designing and delivering production systems in AI, communications, and financial technology domains. Past work includes developing AI‑powered contact center solutions at Twilio, and currently contributing to open‑source AI frameworks such as JetBrains Koog and LangChain4j. Deeply committed to software quality, with a hands‑on approach to testing, resilient architecture, and practical solutions in real‑world AI systems.
comments.speakerNotEnabledComments