To accept and process the requests many IoT devices give, the server on the IoT platform must open many threads. Enhancing server speed while working with constrained processors, memory, and other resources is extremely difficult. In this paper, we propose a thread scheduling model that abstracts server resources, combines queues to handle resource contention, avoids the issue of frequent switching of the server’s underlying processor context in traditional thread scheduling, and combines with a specific thread weighting mechanism to enable the server to handle more requests from IoT devices with the same overhead. Finally, we carry out tests to show that our model is valid.