This paper presents an architecture-friendly k-means clustering algorithm referred to as SIVF for a large-scale and high-dimensional sparse data set. Algorithm efficiency on time is often measured by the number of costly operations such as similarity calculations. In practice, it depends greatly on how an algorithm adapts to an architecture of the computer system which the algorithm is executed on. Our proposed SIVF is carefully designed so as to operate at high speed and suppress memory usage on modern CPU architectures. It exploits a structured inverted-file for a mean set with an invariant centroid-pair based filter (ICP) to decrease the number of similarity calculations in an architecture-friendly manner. The structured inverted-file with the ICP effectively reduce instructions as well as similarity calculations, suppressing pipeline hazards that may cause pipeline stalls. We demonstrate in our experiments on real large-scale document data sets that SIVF operates at higher speed and with lower memory consumption than existing algorithms. Our performance analysis reveals that SIVF works at high speed by suppressing performance degradation factors of the number of instructions, cache misses, and branch mispredictionsrather than less similarity calculations.