PUMA Vault
Search
Buscar
Modo oscuro
Modo claro
Explorador
Etiqueta: moe
3 artículos con esta etiqueta.
01 may 2026
Transformer Architecture and Mixture of Experts — Technical Reference for PUMA Models
permanent
transformer
attention
moe
mixture-of-experts
architecture
deepseek
mixtral
llama
scaling
puma-core
research
llm
models
technical-reference
self-attention
feed-forward
positional-encoding
14 abr 2026
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
literature
mixture-of-experts
moe
sparse-model
switch-transformer
scaling
deepseek
mixtral
transformer
architecture
efficiency
puma-core
literature-note
moc
13 abr 2026
LLM Models Used in PUMA — Technical Reference
permanent
llm
models
llama
mistral
phi
gemma
deepseek
gpt4o
claude
qwen
moe
quantization
ollama
puma-core
research
benchmark
effort-estimation
issue-triage
agents
architecture