🎬 Videos — Observability, LLM Evaluation, Testing & DevOps

Overview

Videos covering LLM evaluation frameworks, observability, and testing. PUMA uses: CodeCarbon, Arize Phoenix, Wilcoxon tests, Promptfoo. See also: Carbon-Tracking-Log · LN-Tools-Dev-Environment


LLM Evaluation & Observability

#TitleChannelURLPUMA Relevance
1Construyendo IA Fiable: Evals, Trazabilidad y Observabilidad | LambdaCast 34LambdaLoopershttps://www.youtube.com/watch?v=qZ2Eu3kqA_g⭐⭐⭐⭐⭐ PUMA’s evaluation design; evals + traceability
2Curso evaluacion LLM con Promptfoo - episodio 1La Hora Makerhttps://www.youtube.com/watch?v=nGaHoH9HHu0⭐⭐⭐⭐ Promptfoo for LLM evaluation
3AI Testing Series Day 1 || Test AI 10× Faster with promptfoo!AB Automation Hubhttps://www.youtube.com/watch?v=vfHu2-YLBWEPromptfoo basics
4AI Testing Series Day 2 || Variable Injection & Assertions in promptfooAB Automation Hubhttps://www.youtube.com/watch?v=9S9UbvxO60cPromptfoo advanced
5Introduction to Observability and Prometheus TutorialNullSafe Architecthttps://www.youtube.com/watch?v=sNk9NkgTOLsObservability fundamentals

AI Testing (Playwright & TestSprite)

#TitleChannelURLPUMA Relevance
6Este IA hace el testing por ti (TestSprite)Fazt Codehttps://www.youtube.com/watch?v=-BKm_wUg9P8AI-automated testing
7TestSprite MCP + GitHub Copilot CLI = Agentic AI Agent TestExecute Automationhttps://www.youtube.com/watch?v=iSZMfK6SqRIMCP-based testing
8Claude Code + Playwright Claude Code + Playwright = INSANE Browser AutomationsChase AIhttps://www.youtube.com/watch?v=I9kO6-yPkfMPlaywright automation
9Claude Code + Playwright CLI: Automate QA with Less TokensEric Techhttps://www.youtube.com/watch?v=nN5R9DFYsXYQA automation
10Multi-Agent Code Review - AI Tools Compare Verilog AnalysisCraig Hollabaughhttps://www.youtube.com/watch?v=YdS45rcqHl0Multi-agent code review

Carbon & Sustainability

#TitleChannelURLPUMA Relevance
11(CodeCarbon official docs)https://codecarbon.io⭐⭐⭐⭐⭐ Primary CodeCarbon resource
12How AI is Automating AI Research: The Agentic Loop Explained!AINexLayerhttps://www.youtube.com/watch?v=KjbaFUjPkpMResearch automation measurement

DevOps & CI/CD

#TitleChannelURLPUMA Relevance
13Railway CLI + IA: Despliega TODO sin tocar nada 🤯Fazt Codehttps://www.youtube.com/watch?v=kzeNxAdpV6gRailway deployment
14De saturar 4GB a Infraestructura Mínima: Refactorización de un MVP en GoDevExperthttps://www.youtube.com/watch?v=zcID70f04g4MVP infrastructure
15Google Cloud Tech: Monitoring configuration and automating detection & remediationGoogle Cloudhttps://www.youtube.com/watch?v=uaa6VNxcn2sCloud monitoring automation

PydanticAI (Output Validation)

#TitleChannelURLPUMA Relevance
16E124 - Creando agentes con PydanticAIen_codershttps://www.youtube.com/watch?v=txRPLlkK4KE⭐⭐⭐⭐ PydanticAI agent creation
17Pydantic AI + DeepSeek V3 - The BEST AI Agent ComboCole Medinhttps://www.youtube.com/watch?v=zf_D2Eafvk0PydanticAI + DeepSeek
18LLM Tutorial REVOLUTIONIZED with PydanticAI’s AI-Powered Tech SupportAtef Atayahttps://www.youtube.com/watch?v=hDoN9AetTmsPydanticAI for structured outputs

MOCs