a b s t r a c tKernel functions in support vector machines (SVM) are needed to assess the similarity of input samples in order to classify these samples, for instance. Besides standard kernels such as Gaussian (i.e., radial basis function, RBF) or polynomial kernels, there are also specific kernels tailored to consider structure in the data for similarity assessment. In this paper, we will capture structure in data by means of probabilistic mixture density models, for example Gaussian mixtures in the case of real-valued input spaces. From the distance measures that are inherently contained in these models, e.g., Mahalanobis distances in the case of Gaussian mixtures, we derive a new kernel, the responsibility weighted Mahalanobis (RWM) kernel. Basically, this kernel emphasizes the influence of model components from which any two samples that are compared are assumed to originate (that is, the "responsible" model components). We will see that this kernel outperforms the RBF kernel and other kernels capturing structure in data (such as the LAP kernel in Laplacian SVM) in many applications where partially labeled data are available, i.e., for semi-supervised training of SVM. Other key advantages are that the RWM kernel can easily be used with standard SVM implementations and training algorithms such as sequential minimal optimization, and heuristics known for the parametrization of RBF kernels in a C-SVM can easily be transferred to this new kernel. Properties of the RWM kernel are demonstrated with 20 benchmark data sets and an increasing percentage of labeled samples in the training data.
Today, so-called "smart" or "intelligent" systems heavily rely on machine learning techniques to adjust their behavior by means of sample data (e.g., sensor observations). But, it will be more and more complicated or even impossible to provide those data at design-time of that system. As a consequence, these systems have to learn at run-time. Moreover, these systems will have to self-organize their learning processes. They have to decide which information or knowledge source they use at which time, depending on the quality of the information or knowledge they collect, the availability of these sources, the costs of gathering the information or knowledge, etc. With this article, we propose opportunistic collaborative interactive learning (O-CIL) as a new learning principle for future, even "smarter" systems. O-CIL will enable a "lifelong" or "neverending" learning of such systems in open-ended (i.e., time-variant) environments, based on active behavior and collaboration of such systems. Not only these systems collaborate, also humans collaborate either directly or indirectly by interacting with these systems. The article characterizes O-CIL, summarizes related work, sketches research challenges, and illustrates O-CIL with some preliminary results.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.