Pedestrian detection is an important but challenging component of an Intelligent Transportation System. In this paper, we describe a pedestrian detection system based on a monocular vision with a Far-Infrared camera (FIR). We propose an original feature representation, called Intensity Self Similarity (ISS) , adapted to pedestrian detection in FIR images. The ISS representation is based on the relative intensity self similarity within a pedestrian region of interest (ROI) hypothesis. Our system consists of two components. The first component generates pedestrian ROI hypothesis by exploiting the specific characteristics of FIR images, where pedestrian shapes may vary in large scale, but heads appear usually as light regions. Pedestrian ROI are detected, with high recall rate, due to a Hierarchical Codebook (HC) of Speeded-Up Robust Features (SURF) located in light head regions. The second component consists of pedestrian hypothesis validation, by using a pedestrian full-body classification based on the ISS representation, with Support Vector Machine (SVM). For classification, we retained two feature descriptors: the Histogram of Oriented Gradients (HOG) descriptor and the original ISS feature representation that we proposed for FIR images. The early fusion of these two features enhances significantly the system precision, attaining an F-measure for the pedestrian class of 97.7%. Moreover, this feature fusion outperforms the state-of-the-art SURF descriptor proposed previously. The experimental evaluation shows that our pedestrian detector is also robust, since it performs well in detecting pedestrians even in large scale and crowded real-world scenes.