“…However, floatingpoint arithmetic, especially floating-point divide and square root, are difficult to design, and often the critical, performance limiting factors. In particularly, applications that require floating-point divide and square root include transcranial magnetic stimulation [Cret et al 2007], Molecular Dynamics (MD) simulations [Govindu et al 2005], Monte Carlo radiative heat transfer simulation · 16: 31 [Gokhale et al 2004], sparse matrix Jacobi solver , QR decomposition [Wang and Leeser 2007a], smoothed particle hydrodynamics method [Lienhart et al 2002], and gravity calculation for N-body simulation [Lienhart et al 2006], radiation dose calculation [Whitton et al 2006], optical flow algorithms for image stabilization [Etiemble et al 2005], and Least Mean Squares (LMS) and Maximum Likelihood (ML) for a space systems [Poznanovic 2004]. As a result, a floating-point library including floating-point divide and square root is very desirable.…”