Matto

Backlinks

Embedding
Large Language Model
Sequence Length
Sequence
Transformer Dataflow (types view)

Recent Notes

AGENTS
Feb 05, 2026
CLAUDE
Feb 05, 2026
About Me
Feb 03, 2026

❯

The thumb drive

❯

Token

Feb 03, 20261 min read

An atomic unit of input data to LLMs. In today’s models tokens are typically subword (e.g. a short word or a chunk of a longer word), but they can also be byte-level (e.g. individual characters).

Tokens are generated in sequences by passing input text through a tokenizer.

machine-learning

Graph View

Created with Quartz v4.5.2 © 2026