Delay in communication wires causes design iterations in system-on-chip. Latency-insensitive design copes with this issue by encapsulating each core in a shell wrapper and inserting buffers in the wires to separate the design of core from that of communication wires. Scheduling-based latency-insensitive protocol is a methodology which employs shift registers for periodic clock gating of blocks instead of the shell wrappers. In many cases, the bit sequences inside the shift registers are too long and therefore consume a large area. This study presents a heuristic algorithm that optimises the bit sequences and produces them with shorter lengths compared with the existing method. The algorithm steps are described and the accuracy is validated through several synthetic benchmarks as well as real systems. Simulation results show an average 44.7% reduction on the shift registers area and the synthetic analysis in the authors proposed approach show an average 12.8% reduction on the total system area compared with the existing method.