PUMA Vault
Search
Buscar
Modo oscuro
Modo claro
Explorador
Etiqueta: agents
50 artículos con esta etiqueta.
01 may 2026
Context Engineering — Designing the LLM Context Window as a System
permanent
context-engineering
prompt-engineering
llm
context-window
rag
memory
tools
system-prompt
puma-core
research
agents
architecture
prompting
planning
agent-design
information-retrieval
01 may 2026
Tree of Thoughts — Deliberate Multi-Path Reasoning for LLMs
permanent
tree-of-thoughts
tot
deliberate-reasoning
planning
backtracking
bfs
dfs
beam-search
cot
llm-reasoning
puma-core
research
prompting
agents
reasoning
issue-triage
backlog-prioritisation
14 abr 2026
Pensar con Prompts: La guía definitiva de la ingeniería de prompts
literature
prompting
prompt-engineering
costar
chain-of-thought
few-shot
zero-shot
structured-output
puma-core
book
methodology
research
literature-note
keshav
moc
llm
agents
human-ai-co-creation
generative-cognition
spanish
14 abr 2026
LLM Wiki: Personal Knowledge Base Pattern
literature
knowledge-management
llm-wiki
rag-alternative
persistent-wiki
obsidian
markdown
agent-memory
karpathy
puma-core
blog-gist
agents
architecture
context-engineering
zettelkasten
knowledge-graph
retrieval
compounding-knowledge
ai-tools
research-tools
literature-note
moc
13 abr 2026
Incident Management in the Age of AI: A Survey
literature
aiops
incident-management
triage
survey
llm
automation
puma-core
agents
benchmark
critical-thinking
effort-estimation
ict
issue-triage
keshav
literature-note
moc
project-management
research
sla
story-points
13 abr 2026
AssistGUI: Task-Oriented Desktop Graphical User Interface Automation
literature
llm-agents
gui-automation
desktop
task-completion
multimodal
puma-core
agents
architecture
benchmark
coding
keshav
literature-note
llm
moc
project-management
research
tool-use
13 abr 2026
Risks from Learned Optimization in Advanced Machine Learning Systems
literature
ai-safety
inner-alignment
deceptive-alignment
mesa-optimization
learned-optimization
puma-core
agents
architecture
ethics
keshav
literature-note
llm
moc
red-teaming
research
safety
13 abr 2026
Generative Agents: Interactive Simulacra of Human Behavior
literature
llm-agents
generative-agents
simulacra
human-behavior
multi-agent
memory
reflection
puma-core
agents
architecture
benchmark
citation
critical-thinking
effort-estimation
keshav
literature-note
llm
moc
project-management
research
social-simulation
story-points
13 abr 2026
AgentBench: Evaluating LLMs as Agents
literature
llm-agents
benchmark
agentbench
evaluation
puma-core
agents
architecture
baseline
coding
critical-thinking
effort-estimation
gpt
keshav
literature-note
llm
local-llm
moc
multi-agent
ollama
open-source
project-management
react
red-teaming
research
story-points
triage
web
13 abr 2026
Reflexion: Language Agents with Verbal Reinforcement Learning
literature
llm-agents
reflexion
self-reflection
verbal-reinforcement
puma-core
agents
architecture
benchmark
chain-of-thought
cot
critical-thinking
effort-estimation
keshav
literature-note
llm
moc
multi-agent
prompting
react
reasoning
red-teaming
research
self-critique
story-points
13 abr 2026
OpenAgents: An Open Platform for Language Agents in the Wild
literature
llm-agents
open-platform
data-agent
web-agent
plugins
puma-core
agents
architecture
benchmark
coding
critical-thinking
keshav
literature-note
llm
local-llm
moc
multi-agent
project-management
python
rag
research
tool-use
web
13 abr 2026
Collaborating with AI Agents: Field Experiments on Teamwork, Productivity, and Performance
literature
human-ai-collaboration
field-experiment
teamwork
productivity
hitl
puma-core
agents
benchmark
critical-thinking
effort-estimation
ethics
ict
keshav
literature-note
llm
moc
multi-agent
project-management
research
13 abr 2026
LLM-based Multi-Agent Systems for Software Engineering: Vision and Challenges
literature
llm-agents
multi-agent
software-engineering
mas
vision
puma-core
agents
architecture
benchmark
coding
critical-thinking
effort-estimation
ict
keshav
literature-note
llm
moc
project-management
research
sdd
spec-driven-development
story-points
triage
13 abr 2026
Prompting Frameworks — CO-STAR, Self-Consistency, and Structured Prompting
permanent
prompting
costar
self-consistency
chain-of-thought
few-shot
zero-shot
structured-output
prompt-engineering
puma-core
research
agents
llm
benchmark
issue-triage
effort-estimation
architecture
13 abr 2026
Fine-Tuning LLMs — LoRA, QLoRA, GGUF Quantization, and PUMA Considerations
permanent
fine-tuning
lora
qlora
quantization
gguf
ollama
llm
training
puma-core
research
agents
benchmark
local-models
effort-estimation
issue-triage
architecture
13 abr 2026
Generative Agents — Memory Stream, Reflection, and Planning Architecture
permanent
generative-agents
memory-stream
reflection
planning
agent-architecture
llm
simulation
emergent-behavior
puma-core
research
agents
architecture
multi-agent
smart-pmo
persistent-memory
13 abr 2026
Human-in-the-Loop (HITL) and Bounded Autonomy for AI Agents
permanent
hitl
human-in-the-loop
bounded-autonomy
ai-safety
ethics
oversight
control
human-ai-collaboration
puma-core
research
agents
architecture
project-management
accountability
alignment
13 abr 2026
LLM Models Used in PUMA — Technical Reference
permanent
llm
models
llama
mistral
phi
gemma
deepseek
gpt4o
claude
qwen
moe
quantization
ollama
puma-core
research
benchmark
effort-estimation
issue-triage
agents
architecture
13 abr 2026
Reflexion — Verbal Self-Reflection for Agent Self-Improvement
permanent
reflexion
self-reflection
agent
llm
verbal-reinforcement
self-critique
episodic-memory
actor-evaluator
puma-core
research
agents
architecture
benchmark
coding
issue-triage
effort-estimation
iterative-improvement
06 abr 2026
MASAI: Modular Architecture for Software-Engineering AI Agents
literature
llm-agents
masai
software-engineering
modular
microsoft
puma-core
academic-writing
agents
architecture
benchmark
bibliography
citation
critical-thinking
effort-estimation
github
issue-triage
keshav
literature-note
llm
moc
multi-agent
orchestration
project-management
react
reading-method
reasoning
reasoning-action
red-teaming
research
scheduling
smart-pmo
story-points
swe-bench
triage
06 abr 2026
GraphAgent: Agentic Graph Language Assistant
literature
graph-agent
knowledge-graph
agentic-ai
language-assistant
academic-writing
agents
bibliography
citation
graph-rag
keshav
literature-note
llm
llm-agents
moc
project-management
reading-method
research
sprint
06 abr 2026
MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution
literature
multi-agent
github-issues
software-engineering
issue-resolution
puma-core
academic-writing
agents
benchmark
bibliography
citation
github
issue-triage
keshav
literature-note
llm
llm-agents
moc
pipeline
project-management
reading-method
research
swe-bench
triage
06 abr 2026
The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey
literature
survey
agent-architectures
reasoning
planning
tool-calling
puma-core
academic-writing
agents
architecture
benchmark
bibliography
chain-of-thought
citation
cot
critical-thinking
effort-estimation
gpt
keshav
literature-note
llm
llm-agents
local-llm
masai
metagpt
moc
multi-agent
ollama
openai
orchestration
project-management
rag
react
reading-method
reasoning-action
red-teaming
research
retrieval
story-points
tool-use
tree-of-thoughts
06 abr 2026
A Taxonomy of Architecture Options for Foundation Model-based Agents: Analysis and Decision Model
literature
taxonomy
agent-architectures
foundation-models
decision-model
puma-core
academic-writing
agents
architecture
bibliography
citation
keshav
literature-note
llm
llm-agents
memory
moc
multi-agent
project-management
reading-method
research
06 abr 2026
OpenHands: An Open Platform for AI Software Developers as Generalist Agents
literature
open-source
platform
ai-agents
software-engineering
openhands
agents
architecture
benchmark
bibliography
citation
github
keshav
literature-note
llm
llm-agents
moc
multi-agent
project-management
reading-method
smart-pmo
swe-bench
tool-use
06 abr 2026
MemGPT: Towards LLMs as Operating Systems
literature
llm-agents
memory
operating-system
context-management
agents
architecture
bibliography
citation
keshav
literature-note
llm
memgpt
moc
project-management
rag
reading-method
retrieval
smart-pmo
sprint
06 abr 2026
ReAct: Synergizing Reasoning and Acting in Language Models
literature
llm-agents
react
reasoning
acting
prompting
puma-core
academic-writing
agents
api
architecture
benchmark
bibliography
chain-of-thought
citation
cot
critical-thinking
effort-estimation
embeddings
few-shot
fine-tuning
issue-triage
jira
keshav
literature-note
llm
moc
multi-agent
permanent-note
project-management
rag
reading-method
reasoning-action
red-teaming
research
retrieval
sdd
smart-pmo
spec-driven-development
story-points
triage
vector-db
06 abr 2026
¡No construyas Agentes IA hasta que veas esto! [Secretos de Anthropic]
video
agents
anthropic
context
best-practices
chain-of-thought
cot
data-formats
effort-estimation
issue-triage
json
metrics
moc
precision-recall
prompt-engineering
pydantic
python
story-points
triage
video-note
06 abr 2026
Aprende los Agent Skills y usalos en cualquier herramienta IA
video
skills
agents
tools
context
anthropic
chain-of-thought
claude
cot
dev-tools
few-shot
ide
issue-triage
llm
moc
opencode
reasoning
triage
video-note
zero-shot
06 abr 2026
Ingeniería de Contexto: La Habilidad CLAVE para crear AGENTES de IA ahora mismo
video
context-engineering
agents
key-skill
architecture
chain-of-thought
code-review
cot
few-shot
github
issue-triage
llama
meta
metrics
moc
para
precision-recall
project-management
triage
video-note
06 abr 2026
Karpathy Just Replaced RAG With Obsidian + Claude Code
video
agents
karpathy
obsidian
anthropic
architecture
claude
dev-tools
embeddings
graph-rag
ide
knowledge-graph
knowledge-management
llm
metrics
moc
precision-recall
project-management
rag
retrieval
vault
vector-db
video-note
06 abr 2026
12-Factor Agents: Patterns of reliable LLM applications
video
agents
reliability
design-patterns
checklist
effort-estimation
human-in-the-loop
langgraph
llm
moc
project-management
pydantic
python
story-points
video-note
06 abr 2026
3 Advanced AI agent design patterns
video
agents
design-patterns
google
architecture
issue-triage
llm
metrics
moc
multi-agent
orchestration
planning
precision-recall
project-management
react
reasoning-action
smart-pmo
tool-use
triage
video-note
06 abr 2026
Flujos de trabajo utilizando Agentes - Andrew Ng explica
video
agents
andrew-ng
workflows
few-shot
hypothesis
llm
metrics
moc
multi-agent
planning
precision-recall
project-management
rag
react
reasoning-action
research-methodology
retrieval
tool-use
video-note
workflow
06 abr 2026
Building AI Agents that actually work (Full Course)
video
agents
course
practical
api
autogen
crewai
jira
langgraph
llm
metrics
moc
non-parametric
precision-recall
project-management
pydantic
python
statistics
video-note
wilcoxon
06 abr 2026
Construyendo IA Fiable: Evals, Trazabilidad y Observabilidad
video
agents
observability
evals
arize
ai-ethics
ethics
evaluation
hypothesis
issue-triage
llm
moc
non-parametric
promptfoo
research-methodology
statistics
testing
triage
video-note
wilcoxon
06 abr 2026
Karpathy's Autoresearch: We Achieved Near-Human Scores in 2 Hours!
video
agents
karpathy
autoresearch
benchmark
anthropic
claude
dev-tools
ide
llm
metrics
moc
obsidian
precision-recall
project-management
rag
retrieval
vault
video-note
workflow
06 abr 2026
The only AutoResearch tutorial you'll ever need
video
agents
karpathy
autoresearch
tutorial
anthropic
claude
dev-tools
hypothesis
ide
keshav
metrics
moc
obsidian
pipeline
precision-recall
reading-method
research-methodology
vault
video-note
06 abr 2026
¡No construyas Agentes IA hasta que veas esto! [Secretos de Anthropic]
video
agents
anthropic
design-principles
effort-estimation
human-in-the-loop
llm
metrics
moc
precision-recall
project-management
pydantic
python
story-points
video-note
06 abr 2026
AI Agent Specialization. RAG vs Fine-tuning — T3chFest 2026
video
agents
rag
fine-tuning
specialisation
academic-writing
benchmark
embeddings
issue-triage
jira
llm
metrics
moc
precision-recall
project-management
research
retrieval
swe-bench
triage
vector-db
video-note
06 abr 2026
Orquestación de Agentes: Control Determinista con Hooks
video
agents
orchestration
determinism
hooks
effort-estimation
llm
moc
pydantic
python
story-points
video-note
06 abr 2026
Agentes de IA y LangGraph: cómo las empresas reducen costos
video
agents
langgraph
enterprise
cost
api
automation
code-review
github
issue-triage
llm
metrics
moc
orchestration
precision-recall
project-management
triage
video-note
06 abr 2026
Codelab: Construyendo un Sistema Multi-Agente Multimodal para Análisis de Eviden
video
agents
multi-agent
multimodal
evidence
architecture
benchmark
github
jira
langgraph
llm
mas
masai
moc
para
project-management
smart-pmo
sprint
video-note
06 abr 2026
MAIA Master Class — De NLP a la IA Agéntica: Una visión general
video
agents
academic
nlp
overview
code-review
critical-thinking
github
llm
metrics
moc
multi-agent
precision-recall
project-management
prompt-engineering
rag
reasoning
red-teaming
retrieval
tool-use
video-note
06 abr 2026
How to Permanently Fix Your Forgetful AI Agents (Full Guide)
video
agents
memory
persistence
langgraph
architecture
effort-estimation
embeddings
issue-triage
moc
rag
retrieval
semantic-search
sprint
story-points
triage
vector-db
video-note
06 abr 2026
E124 — Creando agentes con PydanticAI
video
pydanticai
agents
structured-output
python
effort-estimation
issue-triage
local-llm
moc
ollama
pydantic
reasoning
story-points
triage
video-note
06 abr 2026
🤖 BMAD Agent Roster — PUMA Project
bmad
agents
agentic
multi-agent
puma
agile
ai-ethics
ai-tools
anthropic
architecture
backlog
benchmark
bias
carbon-footprint
checklist
claude
codecarbon
cornell-notes
critical-thinking
dataset
dev-tools
effort-estimation
ethics
falsifiability
finer
github
human-in-the-loop
hypothesis
ide
issue-triage
jira
keshav
langgraph
literature-review
local-llm
mas
metrics
moc
non-parametric
note-taking
obsidian
ollama
openhands
openspec
orchestration
pec
perplexity
planning
popper
precision-recall
prisma
project-management
python
reading-method
red-teaming
research-methodology
research-tools
scrum
sdd
semantic-scholar
slr
smart-pmo
software-engineering
spec-driven-development
spec-kit
sprint
statistics
story-points
sustainability
tawos
triage
vault
wilcoxon
workflow
06 abr 2026
🤖 BMAD Agent Prompts — PUMA Project
bmad
prompts
agents
rcoif
cdd
puma
academic-writing
agile
api
architecture
baseline
benchmark
bibliography
carbon-footprint
chain-of-thought
checklist
citation
codecarbon
cot
data-formats
dataset
drca
effect-size
effort-estimation
egi
evaluation
falsifiability
few-shot
github
gtd
human-in-the-loop
ict
issue-triage
jira
json
keshav
literature-review
llama
llm
local-llm
meta
metrics
mistral
multi-agent
non-parametric
ollama
openspec
pec
planning
popper
precision-recall
project-management
prompt-template
prompting
python
reading-method
research
research-methodology
rest-api
sdd
slr
software-engineering
spec-driven-development
spec-kit
sprint
statistics
story-points
sustainability
tawos
triage
validity
wilcoxon
zero-shot
06 abr 2026
📊 MOC — LLM Benchmarks, PM-AI Convergence & Agent Architectures (v2)
moc
llm-benchmarks
pm-ai
agents
architectures
academic-writing
agentscope
aiops
aiopslabs
architecture
autogen
baseline
benchmark
bibliography
chain-of-thought
chatdev
citation
cot
critical-thinking
devops
embeddings
evaluation
gaia
github
gpt
langgraph
llm
local-llm
mas
masai
mcp
memgpt
memory
metagpt
multi-agent
navigation
ollama
openai
openhands
orchestration
project-management
protocol
rag
react
reasoning
reasoning-action
red-teaming
research
retrieval
root-cause-analysis
security
smart-pmo
software-engineering
swarm-intelligence
swe-bench
tree-of-thoughts
vector-db
workflow
01 mar 2026
LLM Agents — Definition and Taxonomy
permanent
concept
llm-agents
agents
orchestration
ai-ethics
ami
anthropic
api
architecture
artefact
baseline
bias
chain-of-thought
claude
code-review
cot
crewai
critical-thinking
data-formats
dataset
dev-tools
devops
docker
drca
dsr
effort-estimation
ethics
evaluation
few-shot
github
hypothesis
iipr
issue-triage
jira
json
langgraph
llama
llm
local-llm
memory
meta
mistral
moc
multi-agent
ollama
opencode
permanent-note
planning
project-management
prompt-engineering
prompting
python
rag
rcoif
reasoning
red-teaming
research-methodology
retrieval
software-engineering
story-points
tawos
tool-use
triage
validity
zero-shot
zettelkasten