PUMA Vault

Etiqueta: self-attention

2 artículos con esta etiqueta.

  • 01 may 2026

    Transformer Architecture and Mixture of Experts — Technical Reference for PUMA Models

    • permanent
    • transformer
    • attention
    • moe
    • mixture-of-experts
    • architecture
    • deepseek
    • mixtral
    • llama
    • scaling
    • puma-core
    • research
    • llm
    • models
    • technical-reference
    • self-attention
    • feed-forward
    • positional-encoding
  • 14 abr 2026

    Attention Is All You Need

    • literature
    • transformer
    • attention-mechanism
    • self-attention
    • multi-head-attention
    • positional-encoding
    • encoder-decoder
    • nlp
    • foundational
    • puma-core
    • architecture
    • llm
    • gpt
    • bert
    • literature-note
    • moc

Creado con Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community