University of Wollongong
Browse

A Pyramidal Neural Network For Visual Pattern Recognition

Download (1.19 MB)
journal contribution
posted on 2024-11-15, 10:44 authored by Son Lam PhungSon Lam Phung, Abdesselam BouzerdoumAbdesselam Bouzerdoum
In this paper, we propose a new neural architecture for classification of visual patterns that is motivated by the two concepts of image pyramids and local receptive fields. The new architecture, called pyramidal neural network (PyraNet), has a hierarchical structure with two types of processing layers: Pyramidal layers and one-dimensional (1-D) layers. In the new network, nonlinear two-dimensional (2-D) neurons are trained to perform both image feature extraction and dimensionality reduction. We present and analyze five training methods for PyraNet [gradient descent (GD), gradient descent with momentum, resilient backpropagation (RPROP), Polak-Ribiere conjugate gradient (CG), and Levenberg-Marquadrt (LM)] and two choices of error functions [mean-square-error (mse) and cross-entropy (CE)]. In this paper, we apply PyraNet to determine gender from a facial image, and compare its performance on the standard facial recognition technology (FERET) database with three classifiers: The convolutional neural network (NN), the k-nearest neighbor (k-NN), and the support vector machine (SVM).

History

Citation

This article was originally published as: Phung, SL & Bouzerdoum, A, A Pyramidal Neural Network For Visual Pattern Recognition, IEEE Transactions on Neural Networks, March 2007, 18(2), 329-343. Copyright IEEE 2007.

Journal title

IEEE Transactions on Neural Networks

Volume

18

Issue

2

Pagination

329-343

Language

English

RIS ID

16392

Usage metrics

    Categories

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC