Precise energy management in distribution power system requires highāprecision time synchronization among largeāscale deployed devices. Multiple clock sourcesābased time synchronization possesses advantages of reliability, high precision, and robustness, but still faces several challenges such as coupling between time synchronization error and delay, as well as different timescales between clock source and clock weight optimization. In this paper, a multiāclock source time synchronization model is constructed and a problem is formulated to minimize the synchronization error and delay through jointly optimizing largeātimescale clock source selection and smallātimescale weight selection. A reinforcement learningābased multiātimescale multiāclock source time synchronization algorithm named RLāM2 is proposed to solve the formulated problem from a learning perspective. Besides, a lossless switching method is proposed to address the switching problem for multiple clock sources. Simulation results demonstrate the superior performance of RLāM2 and the lossless switching method in time synchronization delay andĀ error.