Hiroshi Kajino scite author profile

Hiroshi Kajino

5Publications

116Citation Statements Received

11Citation Statements Given

How they've been cited

124

114

How they cite others

Affiliations

IBM Research - Tokyo, The University of Tokyo, IBM (United States)

Publications

Order By: Most citations

Convex Formulations of Learning from Crowds

Kajino

Kashima

2012

Transactions of the Japanese Society for Artificial Intelligenc

View full text Add to dashboard Cite

It has attracted considerable attention to use crowdsourcing services to collect a large amount of labeled data for machine learning, since crowdsourcing services allow one to ask the general public to label data at very low cost through the Internet. The use of crowdsourcing has introduced a new challenge in machine learning, that is, coping with low quality of crowd-generated data. There have been many recent attempts to address the quality problem of multiple labelers, however, there are two serious drawbacks in the existing approaches, that are, (i) non-convexity and (ii) task homogeneity. Most of the existing methods consider true labels as latent variables, which results in nonconvex optimization problems. Also, the existing models assume only single homogeneous tasks, while in realistic situations, clients can offer multiple tasks to crowds and crowd workers can work on different tasks in parallel. In this paper, we propose a convex optimization formulation of learning from crowds by introducing personal models of individual crowds without estimating true labels. We further extend the proposed model to multi-task learning based on the resemblance between the proposed formulation and that for an existing multi-task learning model. We also devise efficient iterative methods for solving the convex optimization problems by exploiting conditional independence structures in multiple classifiers.

show abstract

Learning from Crowds and Experts

Kajino

Tsuboi

Sato

et al. 2013

Transactions of the Japanese Society for Artificial Intelligenc

View full text Add to dashboard Cite

SummaryCrowdsourcing services are often used to collect a large amount of labeled data for machine learning. Although they provide us an easy way to get labels at very low cost in a short period, they have serious limitations. One of them is the variable quality of the crowd-generated data. There have been many attempts to increase the reliability of crowdgenerated data and the quality of classifiers obtained from such data. However, in these problem settings, relatively few researchers have tried using expert-generated data to achieve further improvements. In this paper, we apply three models that deal with the problem of learning from crowds to this problem: a latent class model, a personal classifier model, and a data-dependent error model. We evaluate these methods against two baseline methods on a real data set to demonstrate the effectiveness of combining crowd-generated data and expert-generated data.

show abstract

Cogra: Concept-Drift-Aware Stochastic Gradient Descent for Time-Series Forecasting

Miyaguchi

Kajino

2019

AAAI

View full text Add to dashboard Cite

We approach the time-series forecasting problem in the presence of concept drift by automatic learning rate tuning of stochastic gradient descent (SGD). The SGD-based approach is preferable to other concept drift algorithms in that it can be applied to any model and it can keep learning efficiently whilst predicting online. Among a number of SGD algorithms, the variance-based SGD (vSGD) can successfully handle concept drift by automatic learning rate tuning, which is reduced to an adaptive mean estimation problem. However, its performance is still limited because of its heuristic mean estimator. In this paper, we present a concept-drift-aware stochastic gradient descent (Cogra), equipped with more theoretically-sound mean estimator called sequential mean tracker (SMT). Our key contribution is that we define a goodness criterion for the mean estimators; SMT is designed to be optimal according to this criterion. As a result of comprehensive experiments, we find that (i) our SMT can estimate the mean better than vSGD's estimator in the presence of concept drift, and (ii) in terms of predictive performance, Cogra reduces the predictive loss by 16-67% for real-world datasets, indicating that SMT improves the prediction accuracy significantly.

show abstract

Preserving worker privacy in crowdsourcing

Kajino

Arai

Kashima

2014

Data Min Knowl Disc

View full text Add to dashboard Cite

Active Learning for Multi-relational Data Construction

Kajino

Kishimoto

Botea

et al. 2015

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Hiroshi Kajino

Convex Formulations of Learning from Crowds

Learning from Crowds and Experts

Cogra: Concept-Drift-Aware Stochastic Gradient Descent for Time-Series Forecasting

Preserving worker privacy in crowdsourcing

Active Learning for Multi-relational Data Construction

Contact Info

Product

Resources

About