Matto

Backlinks

Noisy Top-k gating
Sparsely-Gated Mixture of Experts

Recent Notes

AGENTS
Feb 05, 2026
CLAUDE
Feb 05, 2026
About Me
Feb 03, 2026

❯

The thumb drive

❯

Top-k Gating

Feb 03, 20261 min read

A gating approach to sparsely-gated MoE that routes tokens to only the top-k experts.

“Top-k” refers to the experts with the k-highest gating scores as decided by the gating function.

machine-learning

Graph View

Created with Quartz v4.5.2 © 2026