πŸ“– MOC β€” Literature Review (SLR)

Overview

Systematic Literature Review for PUMA following PRISMA 2020 + PRISMA-DFLLM + PRISMA-trAIce protocols. Target: β‰₯ 40 papers Β· Period: 2022–2026 Β· SLR Workflow: WF-SLR-Pipeline


πŸ“Š PRISMA Status

StageCountProgress
Identified~[N]⏳
Screened (title/abstract)~[N]⏳
Eligible (full text)~[N]⏳
Includedβ‰₯40⏳

Full log: PRISMA-Log


πŸ—‚οΈ Papers by Topic

Issue Triage & Priority Classification

PaperYearDatasetMetricNotes
LN-Datasets-JiraSR-TAWOS2015Jira SRβ€”Dataset paper (Ortu et al.)
LN-Chen-2025-AIOpsLab2025AIOpsF1AIOps benchmark lab
LN-MAGIS-2024-GitHubIssues2024GitHubF1GitHub issue resolution via MAS
LN-Arora-2024-MASAI2024SWE-benchResolve%Modular agent for SE
LN-Wang-2024-OpenHands2024SWE-benchResolve%OpenHands coding agent
20 - Literature/20.1 Papers/LN-Manzoor2025-AI-PM2025MultipleVariousSurvey (note not yet created)
[Add papers here as SLR progresses]

Effort Estimation & Story Points

PaperYearDatasetBest MAENotes
LN-Datasets-JiraSR-TAWOS2022TAWOSβ€”Dataset paper (Tawosi et al.)
LN-KeyPapers-CoGEE-Angermeir-Flyvbjerg2024TAWOS~1.9 SPState of art (GPT-4) β€” CoGEE section
LN-Assalaarachchi-2026-AgenticSPM2026Variousβ€”Agentic SPM vision
LN-Cinkusz-2025-CognitiveAgentsAgilePM2025Variousβ€”Cognitive agents in Agile PM
LN-Li-2018-MultiProjectScheduling2018MRCPSPβ€”Multi-project scheduling
20 - Literature/20.1 Papers/LN-Yonathan2025-LocalLLMs2025TAWOS~3.2 SPLocal LLMs (note not yet created)
[Add papers here]

LLM Benchmarks in Software Engineering

PaperYearTasksReproducible?Notes
LN-Angermeir-2025-Reproducibility2025SE generalMeta-studyReproducibility gap
LN-Jimenez-2023-SWEbench2023SWE-benchPartialSE coding benchmark
LN-Mialon-2023-GAIA2023GAIAPartialGeneral AI assistant benchmark
LN-Hong-2023-MetaGPT2023HumanEval+YesMulti-role SE agents
20 - Literature/20.1 Papers/LN-Berti2024-PM-LLM-Benchmark2024Process miningPartialPM+LLM (note not yet created)
[Add papers here]

Prompting Strategies

PaperYearStrategyTaskKey finding
LN-Yao-2022-ReAct2022ReActReasoning+ActCombines reasoning with action
LN-Zelikman-2024-QuietSTaR2024Chain-of-thoughtReasoningImplicit CoT rationales
LN-Calikli-2025-RequestFormats2025MultipleEstimationNon-monotonic effect
20 - Literature/20.1 Papers/LN-Wei2022-CoT2022CoTReasoningCoT helps β‰₯100B models (note not yet created)
20 - Literature/20.1 Papers/LN-Brown2020-GPT3-FewShot2020Few-shotVariousICL discovery (note not yet created)
[Add papers here]

Research Methodology

PaperYearMethodUse in PUMA
LN-MITAILab-WP316-HowToDoResearch1988AI Lab methodActive reading + Q1/Q2/Q3
LN-Spichkova-2025-CognitiveAgents2025Cognitive agentsAgent design patterns
20 - Literature/20.1 Papers/LN-Hevner2004-DSR2004DSRParadigm (note not yet created)
20 - Literature/20.1 Papers/LN-Peffers2007-DSRM2007DSRMProcess (note not yet created)
20 - Literature/20.1 Papers/LN-Kitchenham2007-SLR2007SLRProtocol (note not yet created)
20 - Literature/20.1 Papers/LN-Page2021-PRISMA20202021PRISMAReporting (note not yet created)
20 - Literature/20.1 Papers/LN-Wohlin2012-Experimentation2012ExperimentsDesign (note not yet created)

πŸ” Research Gap Summary

Based on SLR, three gaps define PUMA’s contribution:

GapEvidencePUMA Response
Reproducibility5/18 artefacts executable (Angermeir et al., 2025)100% local, seed=42, MIT
Prompting comparisonNo systematic PM prompting study found4 strategies Γ— 2 models
Carbon footprint0/N PM+LLM papers measure COβ‚‚CodeCarbon per condition

πŸ“Œ Comparison Table (Extract)

TABLE authors, year, datasets_used, metrics, reproducible, puma_relevance
FROM "20 - Literature/20.1 Papers"
WHERE type = "literature-note" AND prisma_decision = "include"
SORT year DESC

WF-SLR-Pipeline | PRISMA-Log PR-PUMA-Ch2-Ch3-Ch4-Ch5