A novel noncontact vision sensor for simultaneous measurement of structural displacements at multiple points using one camera is developed based on two advanced template matching techniques: the upsampled cross correlation (UCC) and the orientation code matching (OCM). While existing studies on vision sensors are mostly focused on the time-domain performance evaluation, this study investigates the performance in both time and frequency domains through a shaking table test of a three-story frame structure, in which the displacements at all the floors are measured by using one camera to track either high-contrast artificial targets or low-contrast natural targets on the structural surface such as bolts and nuts. Excellent agreements are observed between the displacements measured by the single camera and those measured by high-performance laser displacement sensors. The results of structural modal analysis based on the measurements by the vision sensor and reference accelerometers also agree well. Moreover, the identified modal parameters are used to update the finite element model of the structure, demonstrating the potential of the vision sensor for structural health monitoring applications. This study further examines the robustness of the proposed vision sensor against ill environmental conditions such as dim light, background image disturbance, and partial template occlusion, which is important for future implementation in the field. Significant advantages of the proposed vision sensor include its low cost (a single camera to remotely measure structural displacements at multiple points without installing artificial targets) and flexibility to extract structural displacements at any point from a single measurement.