PUMA Vault
Search
Buscar
Modo oscuro
Modo claro
Explorador
Etiqueta: mixtral
2 artículos con esta etiqueta.
01 may 2026
Transformer Architecture and Mixture of Experts — Technical Reference for PUMA Models
permanent
transformer
attention
moe
mixture-of-experts
architecture
deepseek
mixtral
llama
scaling
puma-core
research
llm
models
technical-reference
self-attention
feed-forward
positional-encoding
14 abr 2026
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
literature
mixture-of-experts
moe
sparse-model
switch-transformer
scaling
deepseek
mixtral
transformer
architecture
efficiency
puma-core
literature-note
moc