It is well known that hippocampal place cells have spatio-temporal properties, namely that they generally respond to a single spatial location of a small environment, and they also display the temporal response property of theta phase precession, namely that the phase of spiking relative to the theta wave shifts from the late phase to early phase as the animal crosses the place field. Grid cells in layer II of the medial entorhinal cortex (MEC) also have spatio-temporal properties similar to hippocampal place cells, except that grid cells respond to multiple spatial locations that form a hexagonal pattern. Because the entorhinal cortex (EC) is the upstream area that projects strongly to the hippocampus, a number of EC-hippocampus learning models have been proposed to explain how the spatial receptive field properties of place cells emerge via synaptic plasticity. However, the question of how the phase precession properties of place cells and grid cells are related has remained unclear. This study shows how theta phase precession in hippocampal place cells can emerge from MEC input as a result of synaptic plasticity, demonstrating that a learning model based on non-negative sparse coding can account for both the spatial and temporal properties of hippocampal place cells. Although both MEC grid cells and other EC spatial cells contribute to the spatial properties of hippocampal place cells, it is the MEC grid cells that predominantly determine the temporal response properties of hippocampal place cells displayed here.