The cost of training today’s large-scale foundation models is often reduced to a single number: the price of a GPU hour. It's ...