Proxy caching is an effective technique that improves the quality of service (QoS) over the Internet. However, the existing methods for the cache replacement management cannot effectively support this process when the number of requests rapidly increases without boundaries. This paper proposes a new caching architecture model, called the Web Usage Pattern-Based Caching Architecture (WUPCA). The WUPCA implements a behavior-based learning mechanism that applies the concepts of the recommender system in the browsing procedure. Moreover, this learning mechanism leads to the grouping of caches that can better utilize the browsing characteristics and improve the performance of the Internet services. The experiments indicate that the proposed technique has much better performance than the traditional one in the quantitative metrics, such as hit rate, byte hit rate, and average response time of accessed websites. For example, the hit rate and the average response time of the WUPCA are enhanced approximately 30% while the byte hit rate increases more than 52%.