Batch size has a significant impact on both latency and cost in AI model training and inference. Estimating inference time ...