In this paper, a novel time of arrival (TOA) estimation method is proposed based on an iterative cleaning process to extract the first path signal. The purpose is to address the challenge in dense multipath indoor environments that the power of the first path component is normally smaller than other multipath components, where the traditional matchfiltering (MF) based TOA estimator causes huge errors. Along with parameter estimation, the proposed process is trying to detect and extract the first path component by eliminating the strongest multipath component using a band-elimination filter in fractional Fourier Domain (FrFD) at each iterative procedure. To further improve the stability, a slack threshold and a strict threshold are introduced. Six simple and easily calculated termination criteria are proposed to monitor the iterative process. When the iterative 'cleaning' process is done, the outputs include the enhanced first path component and its estimated parameters. Based on these outputs, an optimal reference signal for the matchfiltering (MF) estimator can be constructed, and a more accurate TOA estimation can be conveniently obtained. The results from numerical simulations and experimental investigations verified that, for acoustic chirp signal TOA estimation, the accuracy of the proposed method is superior to those obtained by the conventional MF estimators.