discovering key concepts in verbose queries
DESCRIPTION
Discovering Key Concepts in Verbose Queries. Michael Bendersky and W. Bruce Croft University of Massachusetts SIGIR 2008. Objective. “Discovering Key Concepts in Verbose Queries”. Objective. “Discovering Key Concepts in Verbose Queries” Number 829 Spanish Civil War support - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/1.jpg)
Discovering Key Concepts in Verbose Queries
Michael Bendersky and W. Bruce Croft
University of Massachusetts
SIGIR 2008
![Page 2: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/2.jpg)
Objective
• “Discovering Key Concepts in Verbose Queries”
![Page 3: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/3.jpg)
Objective
• “Discovering Key Concepts in Verbose Queries”
• <num> Number 829
<title> Spanish Civil War support
<desc> Provide information on all kinds of material international support provided to either side in the Spanish Civil War
![Page 4: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/4.jpg)
Objective
• “Discovering Key Concepts in Verbose Queries”
• <num> Number 829
<title> Spanish Civil War support
<desc> Provide information on all kinds of material international support provided to either side in the Spanish Civil War
![Page 5: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/5.jpg)
Objective
• “Discovering Key Concepts in Verbose Queries”
• Use of key concepts?
![Page 6: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/6.jpg)
Objective
• “Discovering Key Concepts in Verbose Queries”
• Use of key concepts?
• Combine with current IR model
![Page 7: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/7.jpg)
Retrieval Model
• Conventional Language Model:
score(q,d) = p(q|d) = )(
),(
dp
dqp
![Page 8: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/8.jpg)
Retrieval Model
• Conventional Language Model:
score(q,d) = p(q|d) =
• New Model:
score(q,d) = p(q|d) = =
)(
),(
dp
dqp
)(
),,(
dp
cdqpic
i)(
),(
dp
dqp
![Page 9: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/9.jpg)
Final Retrieval Function
score(q,d) = ic
ii dcpqcpdqp )|()|()1()|(
![Page 10: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/10.jpg)
Final Retrieval Function
score(q,d) =
Language Model
ic
ii dcpqcpdqp )|()|()1()|(
![Page 11: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/11.jpg)
Final Retrieval Function
score(q,d) =
Key Concepts
ic
ii dcpqcpdqp )|()|()1()|(
![Page 12: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/12.jpg)
What is a Concept?
• Noun phrase in a query
![Page 13: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/13.jpg)
What is a Concept?
• Noun phrase in a query
• <num> Number 829
<title> Spanish Civil War support
<desc> Provide information on all kinds of material international support provided to either side in the Spanish Civil War
![Page 14: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/14.jpg)
What is a Concept?
• Noun phrase in a query
• <num> Number 829
<title> Spanish Civil War support
<desc> Provide information on all kinds of material international support provided to either side in the Spanish Civil War
![Page 15: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/15.jpg)
Finding ‘Key’ Concepts
• Rank concepts by p(ci|q)
![Page 16: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/16.jpg)
Finding ‘Key’ Concepts
• Rank concepts by p(ci|q)
• Compute p(ci|q) by frequency?
• <num> Number 829
<title> Spanish Civil War support
<desc> Provide information on all kinds of material international support provided to either side in the Spanish Civil War
![Page 17: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/17.jpg)
Finding ‘Key’ Concepts
• Approximate p(ci|q) by machine learning
• h(ci) is ci’s query-independent importance score
• p(ci|q) = h(ci) / ciq h(ci)
ci AdaBoost.M1 h(ci)
![Page 18: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/18.jpg)
Features of a Concept
• is_cap : is capitalized• tf : in corpus• idf : in corpus• ridf : idf modified by Poisson model• wig : weighted information gain; change in entro
py from corpus to retrieved data• g_tf : Google term frequency• qp : number of times the concept appears as a
part of a query in MSN Live• qe : number of times the concept appears as ex
act query in MSN Live
![Page 19: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/19.jpg)
TREC Corpus
![Page 20: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/20.jpg)
Exp 1: Identifying Key Concept
• Cross-validation on corpus
• Each fold has 50 queries
• Check whether the top concept is a key concept
• Assume 1 key concept per query during annotation
![Page 21: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/21.jpg)
Exp 1: Identifying Key Concept
![Page 22: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/22.jpg)
Exp 1: Identifying Key Concept
• Better than idf ranking
![Page 23: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/23.jpg)
Exp 2: Information Retrieval
score(q,d) =
• Use only the top 2 concepts for each query
• q is the entire <desc> section = 0.8
ic
ii dcpqcpdqp )|()|()1()|(
![Page 24: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/24.jpg)
Exp 2: Information Retrieval
• KeyConcept[2]<desc> : author’s method
• SeqDep<desc> : include all bigrams in query
![Page 25: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/25.jpg)
Exp 2: Information Retrieval
![Page 26: Discovering Key Concepts in Verbose Queries](https://reader035.vdocuments.site/reader035/viewer/2022062222/56815029550346895dbe1853/html5/thumbnails/26.jpg)
What to take home?
• Singling out key concepts improves retrieval