You can maximize the number of threads that are active at one time using the Performance configuration page. If the maximum number is reached, the server holds new requests until another request finishes and threads become available.
Persistent connections allow the server to accept multiple requests and then send responses over the same TCP/IP connection.