Войти
  • 14157Просмотров
  • 1 год назадОпубликованоGoogle Cloud Tech

How to evaluate AI applications

Vertex AI Evaluation Service Tutorial Notebooks → How do developers know if their AI applications are working effectively? How can developers measure AI performance? In this episode of Real Terms for AI, Googlers Aja Hammerly and Jason Davenport delve into creating golden datasets, defining essential metrics, and utilizing tools to measure any AI application's performance. Chapters: 0:00 - Welcome 0:34 - Evaluating models versus evaluating apps 1:31 - Grounding 2:17 - Sources of evaluation data 3:47 - Define metrics and evaluation 5:07 - Analyzing and understanding metrics 6:19 - Ongoing evaluation 7:48 - Summary Watch more Real Terms for AI → Subscribe to Google Cloud Tech → #GoogleCloud #GenerativeAI Speakers: Aja Hammerly, Jason Davenport Products Mentioned: Gemini, Cloud General, Vertex AI