quantization

1 definition

445

#1·2h ago

Shrinking a neural network's weights to lower precision (float16 → int8 → int4) to run it on cheaper hardware with minimal quality loss. Essential for on-device LLMs.

"Quantized the 70B model down to 4-bit. Runs on my MacBook."

Origin: Tech / AI / developer industry term.

Comments (0)

No comments yet — say something.

Have a better definition?

Add your own interpretation of "quantization".

Add definition

Related terms

quantization

Comments (0)

Have a better definition?

Tags

Related terms