Program behavior analysis is the foundation of computer architecture research. Therefore it is vital to be able to extract the representative behaviors of programs in an efficient manner. Representative behaviors of programs are usually extracted through the SimPoint methodology. However, generating BBV (Basic Block Vector) profiles for SimPoint is usually quite slow. This paper evaluates the effectiveness of accelerating BBV profile generation with dynamic binary translation technique. First, A general framework for BBV profile generation using dynamic binary translation is presented. Then several optimization techniques and accuracy enhancements are proposed. Based on the framework and the optimizations, a highly efficient BBV profile generator, QPoint, is presented. The performance, overhead and accuracy of QPoint is evaluated using the SPEC2006 benchmark set. Experimental results show that the optimization method proposed can improve the performance by up to 147%, on average 56%. The speed of the optimized QPoint is up to 40x, and on average 10.5x compared with a functional simulation based BBV profile generator. The overhead incurred by BBV profile gathering is less than 4% which is the lowest among existing tools. The accuracy of QPoint is also validated against a functional simulation based tool.Compared with existing tools, the proposed QPoint tool has two main advantages. First, the performance of QPoint is tremendous, with a speed of up to 292 MIPS, on average 109 MIPS, on an ordinary PC. Second, QPoint supports most architectures, including x86/x86 64, ARM, POWER, SPARC, MIPS et al., and can be used to generate cross-platform BBV profiles.