Adaptive equalization is an important aspect of communication systems in various environments. It is particularly important in underwater acoustic communication systems, as the channel has a long delay spread and is subject to the effects of timevarying multipath fading and Doppler spreading.The design of the adaptation algorithm has a profound influence on the performance of the system. In this thesis, we explore this aspect of the system. The emphasis of the work presented is on applying concepts from inference and decision theory and information theory to provide an approach to deriving and analyzing adaptation algorithms. Limited work has been done so far on rigorously devising adaptation algorithms to suit a particular situation, and the aim of this thesis is to concretize such efforts and possibly to provide a mathematical basis for expanding it to other applications.We derive an algorithm for the adaptation of the coefficients of an equalizer when the receiver has limited or no information about the transmitted symbols, which we term the Soft-Decision Directed Recursive Least Squares algorithm. We will demonstrate connections between the Expectation-Maximization (EM) algorithm and the Recursive Least Squares algorithm, and show how to derive a computationally efficient, purely recursive algorithm from the optimal EM algorithm.Then, we use our understanding of Markov processes to analyze the performance of the RLS algorithm in hard-decision directed mode, as well as of the Soft-Decision Directed RLS algorithm. We demonstrate scenarios in which the adaptation procedures fail catastrophically, and discuss why this happens. The lessons from the analysis guide us on the choice of models for the adaptation procedure. We then demonstrate how to use the algorithm derived in a practical system for underwater communication using turbo equalization. As the algorithm naturally incorporates soft information into the adaptation process, it becomes easy to fit it into a turbo equalization framework. We thus provide an instance of how to use the information 3 of a turbo equalizer in an adaptation procedure, which has not been very well explored in the past. Experimental data is used to prove the value of the algorithm in a practical context.