As more and more compute-intensive and delay-sensitive applications are deployed on smart mobile devices, mobile edge computing is considered an effective way to solve the limited computing ability of smart mobile devices (SMDs). At present, latency has become the most critical indicator of the quality of service (QoS), and more and more studies focus on this aspect. Unlike previous work, our work fully takes into account the limited storage and computing ability of edge servers. To effectively reduce the delay of SMDs and improve QoS, we propose a Delay Control Strategy Joint Service Caching and Task Offloading (DCS-OCTO) in a three-tier mobile edge computing (MEC) system consist of multiuser , multiedge server and remote cloud servers. Some of the key challenges include service heterogeneity, unknown system dynamics, spatial demand coupling, and decentralized coordination. In particular, a very compelling but rarely studied issue is the dynamic service caching in the three-tier MEC system. The DCS-OCTO strategy is proposed based on Lyapunov optimization and Gibbs sampling. It works online without requiring prior information and achieves provable near-optimal performance. Finally, simulation results show that the strategy effectively reduces the overall system delay while ensuring low energy consumption.