πŸ” arXiv β€” Frontier Scan for PUMA Topics

Tool: arXiv (https://arxiv.org) β€” Categories: cs.SE, cs.AI, cs.MA Phase: Phase 1 β€” Research Step: 01 β€” Literature Exploration


Prompt A β€” Core AI Agents + PM

("AI agents" OR "multi-agent systems" OR "agentic workflow") AND ("project management" OR "PMO" OR "software engineering")

arXiv search tip: Set date range to 2023-01-01 β†’ 2026-12-31, category cs.SE OR cs.AI

Prompt B β€” Benchmark + Reproducibility

(benchmark OR evaluation OR dataset OR reproducible) AND ("LLM agent" OR "multi-agent") AND software

Prompt C β€” Governance + Safety

("human-in-the-loop" OR governance OR observability OR traceability) AND ("LLM agent" OR "agentic system" OR "multi-agent")

Prompt D β€” AIOps + DevOps

(AIOps OR "DevOps" OR "SRE" OR "cloud operations") AND ("LLM" OR "language model" OR agent) AND (automation OR monitoring OR incident)

Usage Notes

arXiv is the primary source for the most recent (2024–2026) preprints. Papers here often appear 6–12 months before journal publication. Always verify arXiv DOIs via Semantic Scholar before final citation.


PUMA Relevance

Key PUMA papers (e.g., Assalaarachchi 2026, Cinkusz 2025, DynTaskMAS 2025) originated as arXiv preprints. This query family ensures PUMA captures frontier work not yet indexed in IEEE/ACM. Feed results to PRISMA-Log.


MOCs