Due to a drastic increase of the number of wireless communication devices, these devices are forced to interfere or interact with each other. This raises the issue of possible effects this coexistence might have on the performance of each of the networks. Negative effects are a consequence of contention for network resources (such as free wireless communication frequencies) between different devices. On the other hand, a possible cooperation between co-located networks could also improve certain aspects of networking for each one of them. This paper presents a self-learning, cognitive cooperation approach for heterogeneous co-located networks. Enabling cooperation is performed by activating or deactivating services that influence the interaction between wireless devices, such as an interference avoidance service, a packet sharing service, etc. Activation of a cooperative service might have both positive and negative effects on network's performance, regarding its high level goals. Such a cooperation approach has to incorporate a reasoning mechanism, centralized or distributed, able to determine the influence of each symbiotic service on the performance of all the participating sub-networks, taking into consideration their requirements. Coupled with the concept of enabling symbiotic services, a machine learning technique known as the Least Squares Policy Iteration (LSPI), is presented in this paper as a novel network cooperation paradigm.