disconnected recurrent neural networks for text …disconnected recurrent neural networks for text...

RESEARCH POSTER PRESENTATION DESIGN © 2015 www.PosterPresentations.com RNN can model the entire sequence and capture long-term dependencies, but it does not do well in extracting key patterns. In contrast, convolutional neural network (CNN) is good at Extracting local and position-invariant features. We present a novel model named disconnected recurrent neural network (DRNN), which incorporates position-invariance into RNN by constraining the distance of information flow in RNN. DRNN achieves the best performance on several benchmark datasets for text categorization. INTRODUCTION MOTIVATION EXPERIMENTS ACKNOWLEDGMENTS & CONTACT This work was supported by the National Key Research and Development Program of China (No.2016YFC0800806). To get in touch with our lab (HFL), please scan the QR-code on the right. E-mail: [email protected] The key phrase that determines the category is unsolved mysteries of mathematics, which can be well extracted by CNN due to position-invariance. RNN can model the whole sequence and capture long-term dependencies, yet it doesn't address the above issue well because the representation of the key phrase relies on all the previous terms and it changes as the key phrase moves. DRNN is proposed to deal with such issues while eliminating the burden of modeling the entire sentence. Joint Lab of HIT and iFLYTEK, iFLYTEK Research, Beijing, China Baoxin Wang Disconnected Recurrent Neural Networks for Text Categorization Case1: One of the seven great unsolved mysteries of mathematics may have been cracked by a reclusive Russian. Case2: A reclusive Russian may have cracked one of the seven great unsolved mysteries of mathematics. THE MODEL DRNN limits the distance of information flow in RNN. An important difference from RNN is that the state of our model at each step is only related to the previous k-1 words rather than all the previous words. DRNN can be considered as a special 1D CNN which replaces the convolution filters with recurrent units. Figure： Dropout applied on hidden states as well. Figure：Model Architecture We utilize DRNN in text categorization and conduct experiments on several datasets, the table lists the error rates (%) on seven datasets: The comparison among models: Effects of recurrent units and pooling methods: Results of different window sizes: The type of task has a great impact on the optimal window size, while the effect of different training data sizes on the optimal window size seems little. Table ： Error rates on three datasets compared to RNN and CNN Figure： The influence of window size on DRNN compared to CNN

Upload: others

Post on 15-Jul-2020

27 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Disconnected Recurrent Neural Networks for Text …Disconnected Recurrent Neural Networks for Text Categorization Case1: One of the seven great unsolved mysteries of mathematics may

www.PosterPresentations.com

RNN can model the entire sequence and capture long-term

dependencies, but it does not do well in extracting key patterns.

In contrast, convolutional neural network (CNN) is good at

Extracting local and position-invariant features. We present a

novel model named disconnected recurrent neural network

(DRNN), which incorporates position-invariance into RNN by

constraining the distance of information flow in RNN. DRNN

achieves the best performance on several benchmark datasets for

text categorization.

INTRODUCTION

MOTIVATION

EXPERIMENTS

ACKNOWLEDGMENTS & CONTACTThis work was supported by the National Key

Research and Development Program of China

(No.2016YFC0800806).

To get in touch with our lab (HFL), please

scan the QR-code on the right.

E-mail: [email protected]

The key phrase that determines the category is unsolved

mysteries of mathematics, which can be well extracted by CNN

due to position-invariance. RNN can model the whole sequence

and capture long-term dependencies, yet it doesn't address the

above issue well because the representation of the key phrase

relies on all the previous terms and it changes as the key phrase

moves.

DRNN is proposed to deal with such issues while eliminating the

burden of modeling the entire sentence.

Joint Lab of HIT and iFLYTEK, iFLYTEK Research, Beijing, China Baoxin Wang

Disconnected Recurrent Neural Networksfor Text Categorization

Case1: One of the seven great unsolved mysteries of mathematics

may have been cracked by a reclusive Russian.

Case2: A reclusive Russian may have cracked one of the seven

great unsolved mysteries of mathematics.

THE MODELDRNN limits the distance of information flow in RNN. An

important difference from RNN is that the state of our model at

each step is only related to the previous k-1 words rather than all

the previous words. DRNN can be considered as a special 1D

CNN which replaces the convolution filters with recurrent units.

Figure： Dropout applied on

hidden states as well.

Figure：Model Architecture

We utilize DRNN in text categorization and conduct experiments

on several datasets, the table lists the error rates (%) on seven

datasets:

The comparison among models:

Effects of recurrent units and pooling methods:

Results of different window sizes:

The type of task has a great impact

on the optimal window size, while

the effect of different training data

sizes on the optimal window size

seems little.

Table： Error rates on three

datasets compared to RNN and

CNN Figure： The influence of window size

on DRNN compared to CNN

Recurrent Neural Network Grammars - nlp.cs.hku.hk

순환신경망(Recurrent neural networks) 개요

Deep Multi-State Dynamic Recurrent Neural Networks ...papers.nips.cc/paper/9594-deep-multi-state-dynamic-recurrent-neural... · Deep Multi-State Dynamic Recurrent Neural Networks

Lecture 6: Recurrent & Graph Neural Networks · 2019. 12. 14. · UVA DEEP LEARNING COURSE –EFSTRATIOS GAVVES RECURRENT NEURAL NETWORKS - 1 Lecture 6: Recurrent & Graph Neural Networks

RECURRENT NEURAL NETWORKS

Recurrent Neural Networksjcheung/teaching/fall-2017/...Recurrent Neural Networks COMP-550 Oct 5, 2017 Outline Introduction to neural networks and deep learning Feedforward neural networks

RECURRENT NEURAL NETWORKS - Lagout Intelligence/Neural networks... · Recurrent neural networks have been an interesting and important part of neural network research during the 1990's

Online Learning with Stochastic Recurrent Neural …...recurrent neural network, hence, we refer to it as a stochastic recurrent neural network. The basis model consists of two different

Neural and Evolutionary Computing - Lecture 5 1 Recurrent neural networks Architectures –Fully recurrent networks –Partially recurrent networks Dynamics

Lecture 13: Recurrent Neural Nets

Hierarchical Recurrent Neural Networks for Long …papers.nips.cc/paper/1102-hierarchical-recurrent-neural...Hierarchical Recurrent Neural Networks for Long-term Dependencies 495 The

ELEC 677: Recurrent Neural Network Applications & Recurrent Neural Network … · 2016-11-08 · Recurrent Neural Network Applications & Recurrent Neural Network Language Models Lecture

Dilated Recurrent Neural Networks - Neural …papers.nips.cc/paper/6613-dilated-recurrent-neural...Dilated Recurrent Neural Networks Shiyu Chang 1 , Yang Zhang , Wei Han 2 , Mo Yu

Recurrent Neural Network for Language Modeling Taskfraser/smt_nmt_2016_seminar/08_rnns.pdf · Recurrent Neural Network for Language Modeling Task ... Recurrent Neural Network Language

Recurrent neural network - SINTEF · 2017. 1. 18. · THE RECURRENT NEURAL NETWORK A recurrent neural network (RNN) is a universal approximator of dynamical systems. It can be trained

Discovering Gated Recurrent Neural Network Architecturesnn.cs.utexas.edu/downloads/papers/AdityaRawalThesis.pdfDiscovering Gated Recurrent Neural Network Architectures by Aditya Rawal,

Recurrent Neural Network - whdeng Neural Network.pdfRecurrent neural networks Long Short-Term Memory recurrent networks Applications Recurrent Neural Network Xiaogang Wang [email protected]

Lecture 6: Recurrent Neural Networks - GitHub Pages · Lecture 6: Recurrent Neural Networks Efstratios Gavves. UVA DEEP LEARNING COURSE –EFSTRATIOS GAVVES RECURRENT NEURAL NETWORKS

Recurrent Neural Networks - GitHub Pages

Recurrent Neural Network - University of Torontotingwuwang/rnn_tutorial.pdf · Recurrent Neural Network ... RNN could do a lot more than modeling language 1. ... (Gated Feedback Recurrent

Beyond Recurrent Neural Networks - GitHub Pagesiamthevastidledhitchhiker.github.io/figs/SOM_11MAR2016/TensorFlow... · The Hitchhiker’s Guide to TensorFlow: Beyond Recurrent Neural

Recurrent Neural Radio Anomaly Detection

Recurrent Neural Networks · Recurrent Neural Networks LING572 Advanced Statistical Methods for NLP March 5 2020 1. Outline Word representations and MLPs for NLP tasks Recurrent neural

Recurrent neural networks for statistical machine translation (Neural Machine Translation) · 2016-11-14 · Recurrent neural networks for statistical machine translation (Neural

Pixel Recurrent Neural Networks

Jacek Mazurkiewicz, PhD Softcomputing · Softcomputing Part 3: Recurrent Artificial Neural Networks Self-Organising Artificial Neural Networks . Recurrent Artificial Neural Networks

Convolutional Neural Networks (CNNs) and Recurrent Neural

A Recurrent Quantum Neural Network

Recurrent neural networks and robust time series prediction - Neural

Tutorial Training Recurrent Neural Networks

Understanding Recurrent Neural Networks Using

Translation Modeling with Bidirectional Recurrent Neural ... · PDF fileTranslation Modeling with Bidirectional Recurrent Neural Networks ... g. language modeling, this type of neural

Deep Q-Learning with Recurrent Neural Networkscs229.stanford.edu/...DeepQLearningWithRecurrentNeuralNetwords... · Deep Q-Learning with Recurrent Neural Networks ... rent neural network

Deep Learning Basics Lecture 9: Recurrent Neural … Learning Basics Lecture 9: Recurrent Neural Networks Princeton University COS 495 Instructor: Yingyu Liang Introduction Recurrent

Gated Recurrent Convolution Neural Network for OCR · which leads to the Gated Recurrent Convolution Neural Network ... Many new deep neural networks for general image ... Gated Recurrent