Matto

Backlinks

  • Mixture of Experts
  • Top-k Gating

Recent Notes

  • AGENTS

    Feb 05, 2026

    • CLAUDE

      Feb 05, 2026

      • About Me

        Feb 03, 2026

        Home

        ❯

        The thumb drive

        ❯

        Sparsely Gated Mixture of Experts

        Sparsely-Gated Mixture of Experts

        Feb 03, 20261 min read

        Mixture of experts, but with conditional computation (AKA don’t use all the experts).

        Using the output of the gate network, some common approaches are

        • Top-k gating
        • Noisy top-k gating

        • machine-learning

        Graph View

        Created with Quartz v4.5.2 © 2026