Throughput (aka. bandwidth)

Throughput is a measure of the amount of work done, or the amount of data processed, in a given amount of time. It is also known as bandwidth and is typically expressed in terms of requests per second (RPS) or transactions per second (TPS).

As a quality attribute of distributed software, trade-offs may be made with latency. For example, to increase throughput you might [batch] requests together, process data in [parallel], or [spread load] across multiple servers. But these strategies can have the effect of increasing the time individual requests spend waiting in queues, thereby increasing latency.

latency vs throughput