Foundational Architecture
- LN-Vaswani-2017-AttentionIsAllYouNeed — Attention Is All You Need: Transformer architecture (NeurIPS 2017)
- LN-Fedus-2022-SwitchTransformers — Switch Transformers: Mixture-of-Experts sparse routing (JMLR 2022)
- LN-Wei-2022-ChainOfThought — Chain-of-Thought prompting: reasoning chains at scale (NeurIPS 2022)