pattern recognition, part 1 · pdf file... deep learning has boosted perfor- ... research...

3
6 1541-1672/16/$33.00 © 2016 IEEE IEEE INTELLIGENT SYSTEMS Published by the IEEE Computer Society Cheng-Lin Liu, Chinese Academy of Sciences Brian Lovell, University of Queensland, Australia Dacheng Tao, University of Technology Sydney, Australia Massimo Tistarelli, University of Sassari, Italy P attern recognition (PR) is one of the key capabilities in human intelligence. Machines equipped with a PR unit can sense environment, thus PR is often integrated in intelligent systems to acquire information and assist in decisions and human-machine interaction. GUEST EDITORS’ INTRODUCTION PR’s scope includes pattern classification (statistical and structural PR, neural net- works, kernel machines, ensemble learning, and multiview learning), clustering, feature extraction and selection, data preprocessing (such as image enhancement and segmenta- tion), visual object recognition, video analy- sis, applications in document analysis, bio- metrics, medical imaging, remote sensing image analysis, multimedia, video surveil- lance, and intelligent transportation. Both applications of and methods for PR have seen tremendous advances in recent years—for example, deep learning has boosted perfor- mance in many practical applications such as handwriting recognition, large-scale image recognition, object detection, and facial and speech recognition, just to cite a few. This special issue reports the state of the art in PR theory, algorithms, and applications. As guest editors, we had a wealth of choices among the 38 full submissions that the call for Pattern Recognition, Part 1

Upload: lycong

Post on 30-Mar-2018

226 views

Category:

Documents


3 download

TRANSCRIPT

6 1541-1672/16/$33.00 © 2016 IEEE iEEE iNTElliGENT SYSTEMSPublished by the IEEE Computer Society

n A t u r A L L A n G u A G e P r o c e s s i n G

Cheng-Lin Liu, Chinese Academy of Sciences

Brian Lovell, University of Queensland, Australia

Dacheng Tao, University of Technology Sydney, Australia

Massimo Tistarelli, University of Sassari, Italy

Pattern recognition (PR) is one of the key capabilities in human intelligence.

Machines equipped with a PR unit can sense environment, thus PR is often

integrated in intelligent systems to acquire information and assist in decisions and

human-machine interaction.

G u e s t e d i t o r s ’ i n t r o d u c t i o n

PR’s scope includes pattern classifi cation (statistical and structural PR, neural net-works, kernel machines, ensemble learning, and multiview learning), clustering, feature extraction and selection, data preprocessing (such as image enhancement and segmenta-tion), visual object recognition, video analy-sis, applications in document analysis, bio-metrics, medical imaging, remote sensing image analysis, multimedia, video surveil-lance, and intelligent transportation. Both

applications of and methods for PR have seen tremendous advances in recent years—for example, deep learning has boosted perfor-mance in many practical applications such as handwriting recognition, large-scale image recognition, object detection, and facial and speech recognition, just to cite a few.

This special issue reports the state of the art in PR theory, algorithms, and applications. As guest editors, we had a wealth of choices among the 38 full submissions that the call for

Pattern Recognition, Part 1

March/april 2016 www.computer.org/intelligent 7

papers attracted. All submitted articles underwent a strict peer review process, with each one assigned to at least two reviewers and most undergoing a second-round review. Because so many of the submissions were so good, we’re splitting this topic across two issues, with the first seven articles focusing on basic PR and image analysis issues, along with related applications. The May/June issue will look more closely at computer vision techniques and applications.

In “Cognitive Mechanisms Under-lying Enhanced Human and Com-puter Classification of Reduced Dimensionality, Information Rich (RDIR) Representations of Images,” Kaveri Thakoor proposes a cognitive mechanism-inspired approach for image feature presentation called reduced dimensionality, information rich (RDIR) representation. They’re generated by processing the original image with an algorithm that captures prominent orientation information in the scene, inspired by the way humans capture the gist of a scene upon observing it for 200 milliseconds or less, with a principal components analysis applied to the gist result. A possible cognitive mechanism is proposed for the enhanced recognition accuracy observed with RDIR rep-resentations based on the higher decorrelation of image pairs depicted in RDIR format compared to that of their downsampled counterparts.

In “A Stochastic Approach for Finding Optimal Context in Contextual Pattern Analysis Task,” Utpal Garain focuses on contextual pattern analysis tasks using random field models. An underlying random field is represented by a set of parameters that capture spatial dependence, and a Bayesian approach is followed to develop a decision rule for choosing appropriate context. The relevance of this approach

is explored for three pattern analysis tasks in which local context plays an important role: handwriting analysis, image compression, and word sense disambiguation.

In “Abnormal Event Detection via Compact Low-Rank Sparse Learn-ing,” Zhong Zhang, Xing Mei, and Baihua Xiao propose a method called compact low-rank sparse rep-resentation (CLSR) for abnormal event detection in video. The method adds compact regularization to the sparse representation model for considering the relationship of coefficient vectors. The low-rank property is exploited to capture the dictionary’s underlying structure. Experiments on three challenging databases demonstrate the superiority of the method in comparison to the current state of the art.

In “Churn Prediction in Customer Relationship Management via GMDH-Based Multiple Classifiers Ensemble,” Jin Xiao and his colleagues address the problem of churn prediction in customer relationship management. Specifically, they propose a novel multiple classifiers ensemble selection model, based on the group method of data handling (GMDH). Experimental results show that this proposed method can perform classification with imbalanced distributions better than existing ensemble methods such as bagging and boosting.

In “DeepWriterID: An End-to-end Online Text-independent Writer Iden-tification System,” Weixin Yang, Lianwen Jin, and Manfei Liu introduce an end-to-end writer identification system called DeepWriterID that employs a deep convolutional neural network (CNN). A key feature of the system is the so-called DropSegment, designed to achieve data augmentation and to improve CNN applicability. Experiments on an online handwriting database achieved

IEEE Computer SocietyPublications Office

10662 Los Vaqueros Circle, PO Box 3014

Los Alamitos, CA 90720-1314

Lead Editor Bonnie Wylie

[email protected]

Editorial Management Jenny Stout

Publications Coordinator [email protected]

Director, Products & Services Evan Butterfield

Senior Manager, Editorial Services Robin Baldwin

Digital Library Marketing Manager Georgann Carter

Senior Business Development Manager Sandra Brown

Senior Advertising Coordinator Marian Anderson

[email protected]

Submissions: For detailed instructions and formatting, see the author guidelines at www.computer.org/intelligent/author. htm or log onto IEEE Intelligent Systems’ author center at Manuscript Central (www.computer.org/mc/intelligent/ author.htm). Visit www.computer.org/intelligent for editorial guidelines.

Editorial: Unless otherwise stated, bylined articles, as well as product and service descriptions, reflect the author’s or firm’s opinion. Inclusion in IEEE Intelligent Systems does not necessarily constitute endorsement by the IEEE or the IEEE Computer Society. All submissions are subject to editing for style, clarity, and length.

IEE

E

8 www.computer.org/intelligent IEEE INTELLIGENT SYSTEMS

G u e s t e d i t o r s ’ i n t r o d u c t i o n

identification rates that outperformed the current state of the art.

In “Monocular Depth Ordering Rea-soning with Occlusion Edge Detec-tion and Couple Layers Inference,” Anlong Ming and colleagues present a novel depth-ordering reasoning approach for image analysis. The main contri-bution lies in two aspects: an occlu-sion edge detection method for gener-ating precise same-layer relationship judgment and producing reliable region proposals, and an inference method for inferring the final depth order. In the global layer, the inference is executed by finding a valid path on a simple and effective directed graph model.

Finally, in “Brain MR Image Tumor Segmentation with 3D Intracranial Structure Deformation Features,” Shang-Ling Jui and colleagues aim to improve brain tumor segmentation ac-curacy by using an improved feature extraction algorithm to exploit the cor-relation between intracranial structure deformation and the compression from brain tumor growth. The component is capable of measuring lateral ventricular deformation in volumetric magnetic res-onance images. It was evaluated qualita-tively and quantitatively with promising results on 11 datasets comprising real patient and simulated images.

We thank all the authors who submitted their invaluable

works to the special issue and all the re-viewers for their insightful comments in reviewing the submitted papers. We also thank Daniel Zeng, the editor-in-chief of IEEE Intelligent Systems for giv-ing us the opportunity of guest editing this special issue and for his continuous support in the entire process.

T h e A u T h o r sCheng-Lin Liu is a professor in and the director of the National Laboratory of Pattern Recognition, Institute of Automation of Chinese Academy of Sciences (CASIA). His research interests include pattern recognition, image processing, neural networks, machine learning, and especially applications to character recognition and document analysis. Liu received a PhD in pattern recognition and intelligent control from CASIA. He is a Fellow of IEEE and the IAPR. Contact him at [email protected].

Brian Lovell is director of the Advanced Surveillance Group in the School of ITEE, University of Queensland, Australia. His interests include face recognition, biometrics, nonlinear manifold learning, and pattern recognition. Lovell is a Fellow of the IAPR, senior member of IEEE, and voting member for Australia on the governing board of the IAPR. Contact him at [email protected].

Dacheng Tao is a professor of computer science in the Centre for Quantum Computation & Intelligent Systems and on the Faculty of Engineering and Information Technology at the University of Technology, Sydney. His research interests span computer vision, data science, image processing, machine learning, neural networks, and video surveillance. Tao is a Fellow of IEEE, the OSA, the IAPR, the SPIE, the BCS, and the IET. Contact him at [email protected].

Massimo Tistarelli is a professor of computer science and director of the Computer Vision Laboratory at the University of Sassari, Italy. His research interests cover biological and artificial vision (particularly in the area of recognition, 3D reconstruction, and dynamic scene analysis), pattern recognition, biometrics, visual sensors, robotic navigation, and visuo-motor coordination. Tistarelli is scientific director of the Italian Platform for Biometric Technologies, vice president of the IAPR, member of the IEEE Biometrics Professional Certification Committee, fellow member of the IAPR, and a senior member of IEEE. Contact him at [email protected].

Selected CS articles and columns are also available for free at http://

ComputingNow.computer.org.