Multiblock component methods are applied to data sets for which several blocks of variables are measured on a same set of observations with the goal to analyze the relationships between these blocks of variables. In this article, we focus on multiblock component methods that integrate the information found in several blocks of explanatory variables in order to describe and explain one set of dependent variables. In the following, multiblock PLS and multiblock redundancy analysis are chosen, as particular cases of multiblock component methods when one set of variables is explained by a set of predictor variables that is organized into blocks. Because these multiblock techniques assume that the observations come from a homogeneous population they will provide suboptimal results when the observations actually come from different populations. A strategy to palliate this problem-presented in this article-is to use a technique such as clusterwise regression in order to identify homogeneous clusters of observations. This approach creates two new methods that provide clusters that have their own sets of regression coefficients. This combination of clustering and regres-B Stéphanie Bougeard
Abstract. We processed daily ocean-color satellite observations to construct a monthly
climatology of phytoplankton pigment concentrations in the Senegalo–Mauritanian region. Our proposed new method primarily consists of associating, in well-identified clusters, similar pixels in terms of
ocean-color parameters and in situ pigment concentrations taken from a
global ocean database. The association is carried out using a new self-organizing map (2S-SOM). Its major advantage is allowing the specificity of the optical properties of the water to be taken into account
by adding specific
weights to the different ocean-color parameters and the in situ
measurements. In the retrieval phase, the pigment concentration of a pixel
is estimated by taking the pigment concentration values associated with the
2S-SOM cluster presenting the ocean-color satellite spectral measurements that are the closest to those of the pixel under study according to some
distance. The method was validated by using a cross-validation procedure. We
focused our study on the fucoxanthin concentration, which is related to the
abundance of diatoms. We showed that the fucoxanthin starts to develop in
December, presents its maximum intensity in March when the upwelling
intensity is maximum, extends up to the coast of Guinea in April and begins
to decrease in May. The results are in agreement with previous observations
and recent in situ measurements. The method is very general and can be
applied in every oceanic region.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.