“…The optimization is often trapped into local minima when FWI is performed from a very poor starting model, using the data without long offset, lowfrequency information. A number of solutions to mitigate such difficulties have been proposed, using multiscale strategies from low to high frequencies (Bunks et al, 1995), layer stripping from long to near offset (Bian and Yu, 2011), the modifications of the misfit functions based on cross-correlation (Luo and Schuster, 1991), deconvolution (Warner and Guasch, 2016) or optimal transport distance . These ideas can be explored easily within the framework of SMIwiz, but out of the scope of this work.…”