PUMA Vault
Search
Buscar
Modo oscuro
Modo claro
Explorador
Etiqueta: reinforcement-learning
8 artículos con esta etiqueta.
16 abr 2026
📚 Bibliography: Spec-Driven Development (SDD) and Agentic Software Engineering
bibliography
apa7
references
supplement
verified
academic-writing
agentscope
agile
aiops
aiopslabs
architecture
autogen
automation
benchmark
chatdev
citation
devops
gaia
github
langgraph
llm
mas
masai
mcp
memgpt
metagpt
multi-agent
openhands
orchestration
planning
project-management
protocol
react
reasoning
reasoning-action
reinforcement-learning
research
root-cause-analysis
scheduling
security
swe-bench
tool-use
tree-of-thoughts
workflow
sdd
spec-driven-development
07 abr 2026
Literature Note — Magnetic control of tokamak plasmas through deep reinforcement learning
literature-note
reinforcement-learning
plasma
fusion
ai-discovery
deepmind
puma
pec2
academic-writing
ai-science
alphafold
bibliography
chain-of-thought
citation
cot
issue-triage
metrics
moc
pec
plasma-physics
precision-recall
protein-folding
reasoning
research
triage
07 abr 2026
AI systems can generate new scientific knowledge, but only within human-defined research frameworks
permanent-note
ai-science
knowledge-generation
puma
agentic-science
pec2
academic-writing
accuracy
ai-scientist
alphafold
anthropic
claude
code-review
critical-thinking
gemini
github
gnome
google
gpt
graphcast
hypothesis
llm
materials-science
metrics
moc
multi-agent
notebooklm
openai
pec
pipeline
plasma-physics
project-management
protein-folding
red-teaming
reinforcement-learning
research
research-methodology
research-tools
scientific-knowledge
smart-pmo
weather-prediction
07 abr 2026
Bibliography Supplement — PEC2: AI and New Scientific Knowledge
bibliography
pec2
ai-science
agentic-science
apa7
verified
puma
academic-writing
ai-scientist
alphafold
automation
citation
gemini
gnome
google
gpt
graphcast
index
literature-review
materials-science
openai
pec
plasma-physics
protein-folding
quantitative-research
reasoning
reinforcement-learning
research
scientific-knowledge
slr
weather-prediction
07 abr 2026
MOC — AI and New Scientific Knowledge Generation
moc
ai-science
agentic-science
knowledge-generation
pec2
puma
ai-scientist
alphafold
anthropic
automation
bibliography
citation
claude
critical-thinking
effort-estimation
gemini
gnome
google
gpt
graphcast
issue-triage
jira
llm
materials-science
notebooklm
openai
orchestration
pec
pipeline
plasma-physics
project-management
protein-folding
red-teaming
reinforcement-learning
research-tools
scientific-knowledge
smart-pmo
story-points
triage
weather-prediction
zettelkasten
06 abr 2026
A Multi-Agent Reinforcement Learning Scheduling Algorithm Integrating State Graph and Task Graph Structural Modeling for Ride-Sharing Dispatching
literature
scheduling
multi-agent
reinforcement-learning
state-graph
task-graph
academic-writing
backlog
bibliography
citation
dataset
graph-rag
keshav
knowledge-graph
literature-note
llm
mas
moc
neural-network
project-management
reading-method
research
smart-pmo
sprint
06 abr 2026
📖 Glossary Supplement v2 — Extended Technical Terms
glossary
reference
definitions
supplement
academic-writing
accuracy
agentscope
ai-tools
aiops
ami
anthropic
api
architecture
auc
autogen
backlog
baseline
benchmark
bias
chain-of-thought
claude
cot
crewai
data-formats
dataset
devops
drca
effect-size
effort-estimation
egi
embeddings
ethics
evaluation
few-shot
fine-tuning
github
gpt
human-in-the-loop
hypothesis
ict
iipr
issue-triage
jira
json
keshav
langchain
langgraph
literature-review
llama
llm
lm-studio
local-llm
mas
memory
meta
metrics
mistral
mit-ai-lab
multi-agent
nlp
non-parametric
ollama
one-shot
openai
orchestration
perplexity
pipeline
planning
precision-recall
project-management
prompting
python
rag
rcoif
react
reading-method
reasoning
reasoning-action
reinforcement-learning
research
research-methodology
rest-api
retrieval
security
slr
software-engineering
sprint
statistics
story-points
supervised-learning
swarm-intelligence
swe-bench
tawos
tool-use
transformer
tree-of-thoughts
triage
validity
vector-db
wilcoxon
wp316
zero-shot
06 abr 2026
📚 Bibliography Supplement v3 — Verified New References
bibliography
apa7
references
supplement
verified
academic-writing
agentscope
agile
aiops
aiopslabs
architecture
autogen
automation
benchmark
chatdev
citation
devops
gaia
github
langgraph
llm
mas
masai
mcp
memgpt
metagpt
multi-agent
openhands
orchestration
planning
project-management
protocol
react
reasoning
reasoning-action
reinforcement-learning
research
root-cause-analysis
scheduling
security
swe-bench
tool-use
tree-of-thoughts
workflow