Cognitive Radio Networks are an emerging technology in for wireless communication. With increasing number of wireless devices in wireless communication, there is a shortage of spectrum. Also, due to the static allocation of channels in wireless networks, there is a scarcity of spectrum underutilization. For efficient spectrum utilization, secondary users dynamically select the free channel of primary users for the transmission of packets. In this work, the performance of routing in a cognitive radio network is improved by the decision of optimal channel selection. The aim of this work is to maximize the throughput and reduce the end-to-end delay. Therefore, an Improved Q-Reinforcement learning algorithm is proposed for the optimal channel selection during the packet routing between source and destination. The performance of this work is compared with the existing routing protocols. It is simulated in network simulator-2 (NS2) with Cognitive Radio Cognitive Network (CRCN) simulation. After performance evaluation, it is observed that the proposed work performs better than existing work with respect to packet delivery ratio, throughput, delay, jitter, control overhead, call blocking probability, packet dropping ratio, good put and normalized routing overhead.