SummaryWeb caching and replication are efficient techniques for reducing Web traffic, user access latency, and server load. In this paper, we first propose an improved GreedyDual* (GD*) cache replacement algorithm, which considers update frequency as a factor in its utility function. Second, we present a group-based method for dynamically selecting distribution strategies for web documents according to access patterns. The documents are divided into groups according to access patterns and the documents in each group are assigned to the same distribution strategy. Our group-based model combines performance metrics with the different weights assigned to each of them. Finally, we use both trace data and statistical data to simulate our methods. The experimental results show that our improved GD*algorithm can outperform the existing GD* algorithm over the performance metrics considered, and our group-based method for document distribution strategy selection can improve several performance metrics, while keeping others almost the same.