With the emerging of the large-scale graph data, Pregel-like graph parallel processing systems have been an essential tool to efficiently process the graph data. The first step to use the Pregel-like systems is to partition the graph into multiple blocks and distribute them on multiple machines. The partition strategy plays a significant role in determining the performance because a good partition could both ensure load balance and optimize network communication overhead, and vice versa. However, existing partition strategies fail to meet the requirements because they suffer from the following drawbacks: (1) they ignore the application features and (2) they ignore the multi-application feature in productive environment. To overcome those drawbacks, we proposed the superblock partition strategy, which utilizes the atomic blocks generated by pre-processing of the original graph and could be constructed and re-constructed dynamically according to the submitted applications in real time. The hash-based and clustering-based pre-partition methods are covered in details. The application feature extraction method and heuristic superblock partition algorithm are proposed to construct the superblocks. Experimental results show that the superblock partition strategy could boost the graph processing performance and its partition efficiency also outperforms the hash-based and topology optimal partition strategy. e3849 6 1 of 7 1 [17-19]. The original graph in Figure 1(a) is partitioned into blocks with different partition strategies. The most used partition strategy in existing distributed system is hash based, shown in 1b, where only the hash value of one vertex is considered. The hash-based strategy ensures an even distribution of vertices, which assists to balance the computation load. The hash partition strategy randomly distributes the vertices and causes the neighboring vertices to reside in different blocks, and it may cause network traffic overhead. Some research work, such as [20], is dedicated to solve the problem and limit the inter-machine communication by assigning the topologically connected vertices to the same block. The partition strategy in Figure 1(c) concerns more about network optimization. The connection between each blocks is minimized, which reduces the potential network communication because the vertex sends messages along edges. Compared with Figure 1(c) and d ensures a better load balance because the vertex in each block is of the same number and the worker loading those blocks have the same number of vertices to compute, which is regarded as the topology optimal partition strategy.Although existing partition strategy has partially solved the problems of distributing vertices among multiple workers according to the hash value or topology information, there still exists the following two drawbacks.Firstly, existing partition strategies ignore the application features. State-of-the-art partition strategies focus mainly on hash value or topology information but ignore the application features, which me...