Abstract. A fusion method is proposed to keep a correct number of humans from all humans detected by the robot operating system based Perception Sensor Network (PSN) which includes multiple partially overlapped Field of View (FOV) Kinects. To this end, the fusion rules are based on the parallel and orthogonal congurations of Kinects in PSN system. For the parallel conguration, the system will decide whether the detected humans staying in FOV of single Kinect or in overlapped FOV of multiple Kinects by evaluating the angles formed between their locations and Kinect original point on top view (x, z plane) of 3D coordination. Then, basing on the angles, the PSN system will keep the person stay in only one FOV or keep the one with biggest ROI if they stay in overlapped FOV of Kinects. In the case of Kinects with orthogonal conguration, 3D Euclidian distances between detected humans are used to determine the group of humans supported to be same human but detected by dierent Kinects. Then the system, keep the human with a bigger Region of Interest (ROI) among this group. The experimental results demonstrate the outperforming of the proposed method in various scenarios.