These days, large-scale graph processing becomes more and more important. Pregel, inspired by Bulk Synchronous Parallel, is one of the highly used systems to process large-scale graph problems. In Pregel, each vertex executes a function and waits for a superstep to communicate its data to other vertices. Superstep is a very time-consuming operation, used by Pregel, to synchronize distributed computations in a cluster of computers. However, it may become a bottleneck when the number of communications increases in a graph with million vertices. Superstep works like a barrier in Pregel that increases the side effect of skew problem in distributed computing environment. ExPregel is a Pregel-like model that is designed to reduce the number of communication messages between two vertices resided on two different computational nodes. We have proven that ExPregel reduces the number of exchanged messages as well as the number of supersteps for all graph topologies. Enhancing parallelism in our new computational model is another important feature that manifolds the speed of graph analysis programs. More interestingly, ExPregel uses the same model of programming as Pregel. Our experiments on large-scale real-world graphs show that ExPregel can reduce network traffic as well as number of supersteps from 45% to 96%. Runtime speed up in the proposed model varies from 1.2× to 30×.shared-memory fashion [8,9]. This approach, to some extent, tries to overcome the problems of the previous approach. However, it has scalability and fault-tolerance problems. Adopting graphics processing units to accelerate various graph-processing tasks forms another approach [10,11]. Sampling approach was used in [12,13] to overcome the problem of scalability in massive data. They divided the input graph into various sub-graphs and then estimated the property of the main graph according to the properties of the smaller sub-graphs. One of the main problems of the sampling approach is the large difference between the real solution and the estimated one.In contrast to these approaches, distributed-memory approach uses a commodity of computers, and it is a general solution to scalability, performance, and availability problems. It can be particularly used to solve massive graph problems. In [14,15], distributed frameworks were used to shrink processing overheads among computational nodes. In particular, MapReduce has emerged as an enabling technology for big data processing [14,16,17]. While MapReduce simplifies implementation of large-scale data-processing systems, it does not naturally and efficiently support many important graph-processing algorithms and may lead to inefficient solutions.In 2010, Google proposed a computational model so-called Pregel [1] dedicated to large-scale graph processing. It is inspired by Valiant's Bulk Synchronous Parallel (BSP) model and facilitates implementing distributed graph algorithms. A program in Pregel consists of a sequence of iterations, called superstep. During a superstep, Pregel invokes a user-defined function ...