Nowadays, eye gaze tracking of real-world people to acknowledge what they are seeing in a particular page is trending; but its complexity is also growing fast, and the accuracy is not enough. In the proposed system, the image patch of the eye region is extracted from the input image using the Viola Jones algorithm for facial feature detection. Then SqueezeNet and U-Net are combined to train the model for pixel classification of iris and pupil from the eye image patch with a training dataset that contains manually labelled iris and pupil region. After extracting the iris and pupil features, the eye gaze tracking is formulated by using 2D pupil center extracted by applying Mean-Shift algorithm and 3D eyeball center. The system achieved an accuracy of 99.93% which is best comparable to the state-of-the-art methods.