deep relevance ranking using enhanced document-query ...nlp.cs.aueb.gr/pubs/emnlp2018preso.pdf ·...

16
Deep Relevance Ranking Using Enhanced Document-Query Interactions Ryan McDonald, George Brokos and Ion Androutsopoulos https://github.com/nlpaueb

Upload: others

Post on 07-Oct-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

Deep Relevance Ranking Using Enhanced Document-Query InteractionsRyan McDonald, George Brokos and Ion Androutsopoulos

https://github.com/nlpaueb

Page 2: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

Ad-hoc Retrieval - Relevance Ranking

(NL) Query

DocumentRanking

Document Collection

RankSelect top-N

Page 3: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

(Deep) Ad-hoc Retrieval / Relevance Ranking● Relevance-based

Query-Doc term similarity matrices

● Interaction-based○ DeepMatch (Lu and Li 2013)○ ARC-II (Hu et al. 2014)○ MatchPyramid (Pang et al. 2016)○ DRMM (Guo et al. 2016)○ PACRR (Hui et al. 2017)○ DeepRank (Pang et al. 2017)

RelevanceScore

Query Document

Page 4: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

Deep Relevance Matching Model (DRMM)Guo et al 2016

Term Score Aggregation

Relevance Score

Term ScoreDense Layers

Document-terms

q1Query-terms q2 q3

Doc-awareQuery-term Encodings

Doc-Query Interaction

Fixed-width

Term Gates q3idf

Page 5: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

Deep Relevance Matching Model (DRMM)Guo et al 2016 Relevance Score

Query

Document

Cosine SimilarityHistograms2 3

1A B C

B C B A A B

Page 6: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

Query

Document

Element-wise Function(e.g., Hadamard)

Attention-BasedELement-wise(ABEL-DRMM)

Attention-baseddoc-encoding

DifferentiableEnd-to-end

Training

Page 7: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

Query

Document

Attention-baseddoc-encoding

Element-wise Function(e.g., Hadamard)

Attention-BasedELement-wise(ABEL-DRMM)

Context-sensitiveencoder

DifferentiableEnd-to-end

Training

Page 8: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

Exact Match

POoled-SImilariTy(POSIT-DRMM)

MULTI-VIEW(also applied to ABEL-DRMM)

Query

Document

k-max max, avg

POOLINGContext Sensitive (BiLSTM)

Context Insensitive (w2v) concatenate

Page 9: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

There is more going on … (read the paper)

● Re-ranked top N documents from traditional index using BM25 score

● All models use final linear layer○ Severyn and Moschitti 2015○ Mohan et al. 2017

● Pairwise training with negative sampling

Deep Relevance

Module

Relevance Score Extra FeaturesNormalized BM25;Unigram overlap;Bigram overlap;Tf-idf weighted

overlap;...

Final Relevance Score

From top N retrieved documents

Page 10: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

Experimental Datasets

BioASQTsatsaronis et al. (2015)

TREC RobustVoorhees (2004)

Queries 2,251 250

Train / dev / test Y1-4 / Y5b1 / Y5b2-5 5X-CV 60/20/20

Average query length 9.16 2.74

Collection size (documents) 17.7M 528K

Average document length 196.6 476.5

Vocabulary size 15.2M 1.4M

Average relevant documents 12.0 69.6

Page 11: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

Trad IR/ML DL Baselines Our Models

BioASQ(Year 5)

Stat. Sig.Stat. Sig.

Page 12: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

BioASQ Document Ranking (Year 6)

USTB: best system in years 3, 4, 5

Page 13: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

POSIT Ablation Study (BioASQ dev)VIEW / MATCH-TYPEC.I. = context-insensitive (word2vec)C.S. = context-sensitive (BiLSTM)E.M. = exact match

Page 14: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

Trad IR/ML** DL Baselines Our Models

TRECRobust2004

Stat. Sig.Stat. Sig.

** Lin, Yang, and Lu.SIGIR Forum, December 2018, forthcoming.

Page 15: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

Summary● Simple architectures can be effective for document ranking

○ Especially on NL query data sets ○ End-to-end training coupled with traditional IR signals○ End-to-end architectures enables context-sensitive encoders

● POSIT-DRMM vs. ABEL-DRMM○ Modeling best and top-k-average match improves accuracy

● Multi-view models promotes exact match in addition to vector match

Page 16: Deep Relevance Ranking Using Enhanced Document-Query ...nlp.cs.aueb.gr/pubs/EMNLP2018Preso.pdf · End-to-end training coupled with traditional IR signals End-to-end architectures

Thanks!

https://github.com/nlpaueb/deep-relevance-ranking