Matto

Backlinks

  • Mixture of Experts
  • Top-k Gating

Recent Notes

  • About Me

    Dec 23, 2025

    • How to navigate this space

      Dec 23, 2025

      • meta
    • The leather journal

      Dec 23, 2025

      Home

      ❯

      The thumb drive

      ❯

      Sparsely Gated Mixture of Experts

      Sparsely-Gated Mixture of Experts

      Dec 23, 20251 min read

      Mixture of experts, but with conditional computation (AKA don’t use all the experts).

      Using the output of the gate network, some common approaches are

      • Top-k gating
      • Noisy top-k gating

      • machine-learning

      Graph View

      Created with Quartz v4.5.2 © 2025