Beyond the Prompt: Evaluating, Testing, and Securing LLM Applications
This talk discusses evaluating and securing LLM applications by measuring changes in prompts or RAG pipelines. It highlights evaluation frameworks like Vertex AI Evaluation, DeepEval, and Promptfoo, a...