Object class detection

Zhang, Xin; Yang, Yee‐Hong; Zhao, Han; Wang, Hui; Gao, Chao

doi:10.1145/2522968.2522978

Cited by 99 publications

(60 citation statements)

References 148 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Section: Introductionmentioning

confidence: 99%

“…To address these two challenges, [3] argues that there are three aspects involved, namely modelling appearance, localization strategies and supervised classification. The focus of the researchers was trying to develop techniques and algorithms in those three aspects in order to improve the visual object recognition system performance.…”

Section: Introductionmentioning

confidence: 99%

“…The focus of the researchers was trying to develop techniques and algorithms in those three aspects in order to improve the visual object recognition system performance. Among these three aspects, modelling appearance is the most important aspect [3]. Appearance modelling is focused on the selection of features that can handle various types of intra-class variations and can capture the discriminative aspects of the different categories.…”

Section: Introductionmentioning

confidence: 99%

“…While the challenges belonging to the second group include very large objects of different categories, high-dimensional descriptors and difficulties in obtaining labelled training samples without any ambiguity etc. [3].To address these two challenges, [3] argues that there are three aspects involved, namely modelling appearance, localization strategies and supervised classification. The focus of the researchers was trying to develop techniques and algorithms in those three aspects in order to improve the visual object recognition system performance.…”

mentioning

confidence: 99%

See 3 more Smart Citations

Review of Local Descriptor in RGB-D Object Recognition

2014

View full text Add to dashboard Cite

The emergence of an RGB-D (Red-Green-Blue-Depth) Keywords: RGB-D images, local descriptor, object recognition, depth images IntroductionObject recognition is an important problem in computer science, which has attracted the interest of researchers in the fields of computer vision, machine learning and robotics [1]. The core of building object recognition systems is to extract meaningful representations (features) from high-dimensional observations such as images, videos and 3D point clouds [2]. Satisfactory results have been achieved by using a variety of methods, applications and standard benchmark datasets. Nevertheless, object recognition of daily objects in a scene image is still an open problem. The major challenges in a visual object recognition system are divided into two groups, which are related to system robustness and computational complexity and scalability. Belong to the first group is the challenge in handling intra-class variations in appearance (different appearance from a number of objects of the same category) and interclass variations. Instances of the same object category can generate different images caused by a variety of variables that influence illumination, object pose, camera viewpoint, partial occlusion and background clutter. While the challenges belonging to the second group include very large objects of different categories, high-dimensional descriptors and difficulties in obtaining labelled training samples without any ambiguity etc. [3].To address these two challenges, [3] argues that there are three aspects involved, namely modelling appearance, localization strategies and supervised classification. The focus of the researchers was trying to develop techniques and algorithms in those three aspects in order to improve the visual object recognition system performance. Among these three aspects, modelling appearance is the most important aspect [3]. Appearance modelling is focused on the selection of features that can handle various types of intra-class variations and can capture the discriminative aspects of the different categories. Furthermore, [4] also stated that "the next step in the evolution of object recognition algorithm will require radical and bold steps forward in terms of the object representations, as well as the learning and inference algorithm used".The emergence of the RGB-D sensor (Microsoft Kinect, Asus Xtion, and PrimeSense), which is relatively cheap, promises to improve performance in object recognition. The sensor is capable of providing a depth image for each pixel so that the image information is abundant. RGB-D sensor has an RGB camera and an infrared camera and projector, so it can capture colour images and the depth of each pixel in the image. These two factors are very helpful for the image processing field that was always dependent on the colour channels of the image [5], [6]. By using the depth channel for foreground segmentation or complementary information on

show abstract

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

mentioning

confidence: 99%

See 2 more Smart Citations

Review of Local Descriptor in RGB-D Object Recognition

2014

View full text Add to dashboard Cite

show abstract

“…Thus, this can be used to categorize the scene accordingly. Object recognition in an image is the fundamental challenge [6] in computer vision. In this work, we interested in recognizing scene categories using the image taken by normal rectilinear camera lens.…”

mentioning

confidence: 99%

A Performance Evaluation of Content-Based Image Retrieval for Scene Categorization

2017

Aug. 8-9, 2017 Singapore Back RTET-17, CFBHS-17, IECMAE-17, AESTWM-17, LHHSS-17 &Amp; LEBCSR-17

View full text Add to dashboard Cite

Abstract-Content-based image retrieval remains a critical problem in computer vision. In this paper, we study the performance of various content-based image retrieval technique for recognizing the object and scene. We conduct the comparative survey to compare the state of the art bag of words (BOW) framework with other method to help the researcher to understand more and enable researcher selecting the most suitable technique. We carried out the experiment and tested with 3 publicly dataset that are Caltech 101, Caltech 256 and 15-Scene Category dataset with BOW method. We also compare and evaluate the effect on the number of cluster toward the computational time and the accuracy. In addition, the significant of different feature extraction method ap plied in BOW performance is analyzed. In conclusion, we discuss on several key potential research topics towards the content-based image retrieval.

show abstract

Machine learning from crowds: A systematic review of its applications

Rodrigo

Aledo

Gámez

2018

WIREs Data Min & Knowl

View full text Add to dashboard Cite

Crowdsourcing opens the door to solving a wide variety of problems that previously were unfeasible in the field of machine learning, allowing us to obtain relatively low cost labeled data in a small amount of time. However, due to the uncertain quality of labelers, the data to deal with are sometimes unreliable, forcing practitioners to collect information redundantly, which poses new challenges in the field. Despite these difficulties, many applications of machine learning using crowdsourced data have recently been published that achieved state of the art results in relevant problems. We have analyzed these applications following a systematic methodology, classifying them into different fields of study, highlighting several of their characteristics and showing the recent interest in the use of crowdsourcing for machine learning. We also identify several exciting research lines based on the problems that remain unsolved to foster future research in this field.

show abstract

Object class detection

Cited by 99 publications

References 148 publications

Review of Local Descriptor in RGB-D Object Recognition

Review of Local Descriptor in RGB-D Object Recognition

A Performance Evaluation of Content-Based Image Retrieval for Scene Categorization

Machine learning from crowds: A systematic review of its applications

Contact Info

Product

Resources

About