Convolutional spiking neural network model for robust face detection

Matsugu, Masakazu; Mori, Katsuhiko; Ishii, M.; Mitarai, Yusuke

doi:10.1109/iconip.2002.1198140

Cited by 34 publications

(26 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…CoNN (LeCun and Bengio, 1995) as well as Neocognitrons (Fukushima, 1980) have been used for face detection (Matsugu et al, 2002;Osadchy et al, 2004) and recognition (Lawrence et al, 1995). Proposed architecture in Figure 1 comes with the property of robustness in object recognition such as translation and deformation invariance as in well-known neocognitrons, which also have similar architecture.…”

Section: Modified Convolutional Neural Network (Mconn)mentioning

confidence: 99%

“…First, it has only FD modules in the bottom and top layers. The intermediate features detected in FD2 constitute a set of figural alphabets (Matsugu et al, 2002;Matsugu & Cardon, 2004). Local features in FD1 are used as bases of figural alphabets, which are used for eye or mouth detection.…”

Section: Modified Convolutional Neural Network (Mconn)mentioning

confidence: 99%

“…The training proceeds as follows. As in (Matsugu et al, 2002, training of the MCoNN is performed module by module using fragment images as positive data extracted from publicly available database (e.g., Softpia Japan) of more than 100 persons. Other irrelevant fragment images extracted from background images are used as negative samples.…”

Section: Modified Convolutional Neural Network (Mconn)mentioning

confidence: 99%

“…In this chapter, inspired by cortical processing, we will address the problem of efficient selection and economical use of visual features for face recognition (FR) as well as facial expression recognition (FER). We demonstrate that by training our previously proposed (Matsugu et al, 2002) hierarchical neural network architecture (modified convolutional neural networks: MCoNN) for face detection (FD), higher order visual function such as FR and FER can be organized for shared use of such local features. The MCoNN is different from those previously proposed networks in that training is done layer by layer for intermediate as well as global features with resulting receptive field size of neurons being larger for higher layers.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Selection and Efficient Use of Local Features for Face and Facial Expression Recognition in a Cortical Architecture

Matsugu¹

2007

Face Recognition

View full text Add to dashboard Cite

Section: Modified Convolutional Neural Network (Mconn)mentioning

confidence: 99%

Section: Modified Convolutional Neural Network (Mconn)mentioning

confidence: 99%

Section: Modified Convolutional Neural Network (Mconn)mentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Selection and Efficient Use of Local Features for Face and Facial Expression Recognition in a Cortical Architecture

Matsugu¹

2007

Face Recognition

View full text Add to dashboard Cite

“…Convolutional neural networks with a hierarchical structure, which imitate the vision nerve system in the brain, have such functions [1][2][3].…”

Section: Introductionmentioning

confidence: 99%

A Convolutional Neural Network VLSI for Image Recognition Using Merged/Mixed Analog-Digital Architecture

Korekado

Matsukawa

Nakamura

et al. 2003

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. Hierarchical convolutional neural networks are a well-known robust image-recognition model. In order to apply this model to robot vision or various intelligent vision systems, its VLSI implementation with high performance and low power consumption is required. This paper proposes a convolutional network VLSI architecture using a hybrid approach composed of pulse-width modulation (PWM) and digital circuits. We call this approach merged/mixed analog-digital architecture. The VLSI includes PWM neuron circuits, PWM/digital converters, digital adder-subtracters, and digital memory. We have designed and fabricated a VLSI chip by using a 0.35 m CMOS process. The VLSI chip can perform 6-bit precision convolution calculations for an image of 100¢100 pixels with a receptive field area of up to 20¢20 pixels within 5 ms, which means a performance of 2 GOPS. Power consumption of PWM neuron circuits is estimated to be 20 mW. We have verified successful operations using a fabricated VLSI chip.

show abstract

A Convolutional Neural Network VLSI Architecture Using Sorting Model for Reducing Multiply-and-Accumulation Operations

Nakamura

Matsukawa

Matsugu

et al. 2005

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Convolutional spiking neural network model for robust face detection

Cited by 34 publications

References 12 publications

Selection and Efficient Use of Local Features for Face and Facial Expression Recognition in a Cortical Architecture

Selection and Efficient Use of Local Features for Face and Facial Expression Recognition in a Cortical Architecture

A Convolutional Neural Network VLSI for Image Recognition Using Merged/Mixed Analog-Digital Architecture

A Convolutional Neural Network VLSI Architecture Using Sorting Model for Reducing Multiply-and-Accumulation Operations

Contact Info

Product

Resources

About