PUMA Vault

Etiqueta: models

3 artículos con esta etiqueta.

  • 01 may 2026

    RLHF and Constitutional AI — LLM Alignment Training Paradigms

    • permanent
    • rlhf
    • constitutional-ai
    • rlaif
    • ppo
    • reward-model
    • alignment
    • ai-safety
    • fine-tuning
    • sft
    • anthropic
    • openai
    • puma-core
    • research
    • training
    • ethics
    • llm
    • models
    • hitl
  • 01 may 2026

    Transformer Architecture and Mixture of Experts — Technical Reference for PUMA Models

    • permanent
    • transformer
    • attention
    • moe
    • mixture-of-experts
    • architecture
    • deepseek
    • mixtral
    • llama
    • scaling
    • puma-core
    • research
    • llm
    • models
    • technical-reference
    • self-attention
    • feed-forward
    • positional-encoding
  • 13 abr 2026

    LLM Models Used in PUMA — Technical Reference

    • permanent
    • llm
    • models
    • llama
    • mistral
    • phi
    • gemma
    • deepseek
    • gpt4o
    • claude
    • qwen
    • moe
    • quantization
    • ollama
    • puma-core
    • research
    • benchmark
    • effort-estimation
    • issue-triage
    • agents
    • architecture

Creado con Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community