πΊοΈ MOC β PUMA Master Map
Overview
Central navigation hub for the PUMA project. PUMA: Platform for Understanding and Management with Agents. βCan language models manage ICT projects?β
π― Project Identity
Full title: Pueden los modelos de lenguaje gestionar proyectos tecnolΓ³gicos? PUMA: Plataforma de benchmark para la evaluaciΓ³n empΓrica de agentes en tareas de gestiΓ³n de proyectos.
Research question: Do different LLM models and prompting strategies produce statistically significant differences in issue triage quality (F1-macro) and effort estimation (MAE) on real PM datasets with verified labels?
Hypotheses: EX-Hypotheses-H1-H2
MVP: Triage module (Stage 1) + statistical validation. Self-contained academic contribution.
π PUMA Project Structure β Vault Mapping
| PUMA Project | Content | Vault Location |
|---|---|---|
| 1. Introduction | Context, objectives, methodology, planning | PR-PUMA-Ch1-Introduction |
| 2. Materials & Methods | DSR + SLR + experiment design + stack | PR-PUMA-Ch3-Methods |
| 3. Results | F1-macro, MAE, Wilcoxon, carbon | PR-PUMA-Ch4-Results |
| 4. Conclusions | H1/H2 decision + future work (Smart PMO) | PR-PUMA-Ch5-Discussion |
| 5. Glossary | All definitions | Glossary-Master |
| 6. Bibliography | APA 7, β₯40 references | BIB-Master-APA7 |
| 7. Annexes | Templates, dataset prep, extended results | in project folders |
π¬ Experiment Design
| Stage | Task | Dataset | Metric | Status |
|---|---|---|---|---|
| 1 (MVP) π’ | Issue triage | Jira SR (200 stratified) | F1-macro β₯ 0.55 | π PEC2 |
| 2 π’ | Effort estimation | TAWOS | MAE β€ 3.0 SP | β³ PEC3 |
| 3 π‘ | Backlog prioritisation | TAWOS | Spearman β₯ 0.50 | β³ Conditional |
| 4 π΄ | RAG-enhanced triage | Jira SR | F1-macro > Stage 1 | β³ Optional |
| 5 π΄ | Smart PMO multi-agent | β | MTTD -30% | π Future work |
Prompting strategies: Zero-Shot Β· Few-Shot-3 Β· Few-Shot-6 Β· Chain-of-Thought
Models: Llama 3.2 8B Β· Mistral 7B Β· (Phi-3.5 Mini as fallback)
Reproducibility: seed=42, temperature=0, fixed requirements.txt
ποΈ Architecture
- SP-Architecture β 7-layer SwarmPM architecture
- SP-PUMA-Constitution β Non-negotiable principles
- SP-Triage-Agent β Triage agent spec
- BMAD-Agent-Roster β Multi-agent team
- BMAD-PRD-PUMA β Product requirements
π Key Literature
- LN-KeyPapers-CoGEE-Angermeir-Flyvbjerg β Core papers
- LN-Datasets-JiraSR-TAWOS β Datasets
- BIB-Master-APA7 β Full bibliography (42 refs)
Books β AI & Society:
- LN-Lawrence-2024-AtomicHuman β The Atomic Human: embodied intelligence, HITL theoretical basis
- LN-Suleiman-2023-ComingWave β The Coming Wave: AI governance and containment context
- LN-Shum-2025-PensarConPrompts β Pensar con Prompts: CO-STAR, prompt engineering taxonomy
Books β Agile & PM:
- LN-Beck-1999-XPExplained β XP Explained (2nd ed.): story points origin, TDD, adaptive development
- LN-Goldratt-2004-TheGoal β The Goal: Theory of Constraints; issue backlog as constraint system
Books β Business Systems (SmartPMO context):
- LN-Carpenter-2025-WorkTheSystem β Work the System: SOP documentation; systems mindset
- LN-Gerber-2009-EMythRevisited β The E-Myth Revisited: franchise prototype; working ON the business
- LN-Wickman-2012-Traction β Traction / EOS: execution operating system; Rocks; scorecard
- LN-Harnish-2022-ScalingUp β Scaling Up: Rockefeller Habits; Four Decisions framework
- LN-Price-2022-Frictionless β Frictionless Organization: CES; friction-free PM design
π§ Key Permanent Notes
PM & Experiment concepts:
- PN-IssueTriage-StoryPoints β F1-macro, MAE, priority schema
- PN-CoT-FewShot-Prompting β Prompting strategies (S1βS4)
- PN-LLM-Local-vs-Cloud β Why local inference
- PN-RAG-Embeddings-VectorDB β RAG for Stage 4
- PN-ToolSelection-PUMA β Tool selection rationale for PUMA
Agent patterns & AI science:
- PN-KeyConcepts-Agents-Reproducibility-RedTeam β Agents, Reproducibility, Uniqueness Trap, Red Teaming
- PN-MultiAgent-ArchitecturePatterns β Specialisation (β Smart PMO)
- PN-ReAct-AgentPattern β Stage 4 reasoning pattern
- PN-Agentic-Science-Paradigm β AI as active scientific agent
- PN-AI-Scientific-Knowledge-Generation β AI-generated scientific knowledge
- PN-PUMA-within-AgenticScience-Trajectory β PUMAβs place in the agentic science trajectory
- PN-ActiveReading-CognitivePractice β Active reading as cognitive practice
Research methods:
- PN-DSR-SLR-Methods β DSR + PRISMA
- PN-Wilcoxon-FINER-Cornell-PRISMA β Statistical protocol
Frameworks:
- PN-SDD-Framework β SDD + BDD + BMAD
- PN-RCOIF-Framework β Structured prompting
- PN-EGI-Framework β Exploratory guided interaction
- PN-AMI-DRCA-IIPR-Frameworks β AMI + DRCA + IIPR advanced prompting
- PN-MIT-Student-Method β MIT AI Lab active reading method
- PN-MIT-Student-Method-Complete β MIT AI Lab full Q1/Q2/Q3 + Keshav
- PN-PARA-GTD-Zettelkasten β PARA + GTD + Zettelkasten integration
Knowledge hub & Structure notes:
- ZK-Hub-PUMA β Full Zettelkasten index
- ST-Prompting-Strategies β Prompting strategies thematic cluster
- ST-Reproducibility-Cluster β Reproducibility crisis cluster
Sources & Persons:
- SRC-Keshav-2007-HowToReadPaper β Keshav 2007 Three-Pass paper
- SRC-MITAILab-WP316 β MIT AI Lab Working Paper 316
- PER-Keshav-Srinivasan β Three-Pass Method author
- PER-Flyvbjerg-Bent β Uniqueness Trap / Reference Class Forecasting
- PER-Yao-Shunyu β ReAct + Tree of Thoughts
- PER-Hong-Sirui-MetaGPT β MetaGPT multi-agent framework
- PER-Assalaarachchi-Nuwan β Agentic SPM vision
Results:
- RES-Results-Placeholders β Experiment results placeholders (Stage 1 & 2)
π Progress Dashboard
TABLE status AS "Status", deadline AS "Deadline", pec AS "PEC"
FROM "40 - Projects/PUMA"
WHERE type = "project-note"
SORT deadline ASCπ Linked MOCs
- MOC-Research-Pipeline β Research workflow
- MOC-Literature-Review β SLR state of the art
- MOC-LLM-Benchmarks-PM-AI β Benchmark landscape
- MOC-Methods-Frameworks β All methodologies
- MOC-Prompts-Library β Prompt templates
- MOC-Tools-Stack β Technology stack
MOC updated: April 2026 (PEC2)