This paper presents a new approach to finding optimal patterns for the placement of fields of sonobuoys in a complex undersea environment. We model the problem as a biobjective one, where the aim is to minimise both sensor placement time and uncertainty over target localisation. Both objectives may be important in time-critical localisation scenarios and our approach allows an operator to choose between different optimal solutions, favouring lower placement time or lower localisation uncertainty as operational circumstances require. We develop a two-phase algorithm, where an offline multiobjective evolutionary phase finds initial Pareto-non-dominated solutions to a static problem and then an online multiobjective reinforcement learning phase finds improved solutions using updated information. We find that the evolutionary algorithm improves significantly on standard grid patterns and that the reinforcement learning algorithm improves further on the evolutionary phase. The number of sonobuoys required may also be reduced.This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.