π arXiv β Frontier Scan for PUMA Topics
Tool: arXiv (https://arxiv.org) β Categories: cs.SE, cs.AI, cs.MA Phase: Phase 1 β Research Step: 01 β Literature Exploration
Prompt A β Core AI Agents + PM
("AI agents" OR "multi-agent systems" OR "agentic workflow") AND ("project management" OR "PMO" OR "software engineering")
arXiv search tip: Set date range to 2023-01-01 β 2026-12-31, category cs.SE OR cs.AI
Prompt B β Benchmark + Reproducibility
(benchmark OR evaluation OR dataset OR reproducible) AND ("LLM agent" OR "multi-agent") AND software
Prompt C β Governance + Safety
("human-in-the-loop" OR governance OR observability OR traceability) AND ("LLM agent" OR "agentic system" OR "multi-agent")
Prompt D β AIOps + DevOps
(AIOps OR "DevOps" OR "SRE" OR "cloud operations") AND ("LLM" OR "language model" OR agent) AND (automation OR monitoring OR incident)
Usage Notes
arXiv is the primary source for the most recent (2024β2026) preprints. Papers here often appear 6β12 months before journal publication. Always verify arXiv DOIs via Semantic Scholar before final citation.
PUMA Relevance
Key PUMA papers (e.g., Assalaarachchi 2026, Cinkusz 2025, DynTaskMAS 2025) originated as arXiv preprints. This query family ensures PUMA captures frontier work not yet indexed in IEEE/ACM. Feed results to PRISMA-Log.