latency

1 definition

456

#1·2h ago

Time between sending a request and getting a response. In LLMs, typically 500ms–15s depending on model size and output length. Every millisecond matters in agentic loops.

"Latency is the UX story of 2026. Streaming helps; reasoning models hurt."

Origin: Tech / AI / developer industry term.

Comments (0)

No comments yet — say something.

Have a better definition?

Add your own interpretation of "latency".

Add definition

Related terms

latency

Comments (0)

Have a better definition?

Tags

Related terms