“…In most cases, the start/fixed learning rate will be in the range of [0.01,0.3] [21,22,25,34] while the end learning rate within [0.00013,0.001] [19,21]. The number of epochs usually depends on the training data size and the computational capacity, ranging from 200 to 15,000 [19,21,22,24,34,35,42]. Momentum is typically set to 0.9 [22], although the optimal value might be task-specific [21,24,34].…”