LN: Yao et al. (2023) — Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Bibliographic Reference

Citation: Yao, S., Yu, D., Zhao, J., et al. (2023). Tree of thoughts: Deliberate problem solving with large language models. arXiv:2305.10601. NeurIPS 2023. https://arxiv.org/abs/2305.10601


Pass 1 — Bird’s Eye View (5 Cs)

CAssessment
CategoryMethod proposal + empirical evaluation
ContextExtends Chain-of-Thought beyond linear sequences; inspired by BFS/DFS search
CorrectnessEvaluated on Game of 24, Creative Writing, Mini Crosswords. Strong results on 24 game (74% vs 4% CoT).
Contributions(1) Tree search over reasoning steps; (2) LLM evaluates intermediate steps; (3) BFS/DFS/beam search variants; (4) 74% on Game of 24 vs. 4% for standard CoT
ClarityExcellent. Clear algorithm descriptions.

Relevance: ⭐⭐⭐

ToT is relevant for PUMA’s complex backlog prioritisation (Stage 3) where multiple alternative orderings need to be evaluated. Too expensive for high-volume triage (Stage 1).


PUMA Connection

ToT could enhance Stage 3 (backlog prioritisation): the agent explores multiple orderings before committing to one. However, the computational cost (5–10× more tokens than CoT) makes it impractical for bulk triage. Relevant as a future-work option and for the prompting strategies background (Ch.2).

Connects to: PN-CoT-FewShot-Prompting

MOCs