learning to query: focused web page harvesting for entity ... · learning to query: focused web...

1

Learning to Query: Focused Web Page Harvesting for Entity Aspects Yuan Fang 1 Vincent Zheng 2 Kevin Chang 23 Problem: Learning to Query (L2Q) 1 Institute for Infocomm Research, Singapore 2 Advanced Digital Sciences Center, Singapore 3 University of Illinois at Urbana-Champaign, USA Subproblem #1: Domain-aware L2Q Subproblem #2: Context-aware L2Q Result A: Effect of Domain+Context Entity type Common aspects celebrity spouse, age, net worth car safety, cost, interior business address, hours, phone abundant but scattered Business Analytics Vertical portal or search Overall Workflow: Iterative Querying Keywords (uniquely) identifying the entity Seed query: Target aspect: A pre-trained classifier for the target aspect Utility: (precision/recall) In each iteration, ݍ∗ ൌ arg max ሺ ݍሻ a) Domain Pages b) Template Abstraction <topic> <journal> Target Entity Marc Snir Peer Entities Philips Yu Andrew Ng ∗ ሺሻ ா + entity pages ா domain pages Domain Phase Entity Phase Utility regularization (i.e. supervision on target aspect) c) Bridging Domain & Entity Collective precision ஐ ⋃ஐ ⋂ஐ ஐ ⋃ஐሺሻ Collective recall ஐ ⋃ஐ ⋂ஐ ஐ pages for target aspect pages of candidate query RND: select query randomly P/R: optimize precision/recall without domain and context P/R+q: with domain pages but no templates, and without context P/R+t: with domain pages and templates, without context L2QP/L2QR: full approaches optimizing precision/recall Result B: Compare with Indep. Baselines L2QBAL: optimize for F-score, balancing L2QP & L2QR LM: language feedback model AQ: adaptive querying for text databases HR: harvest rate for hidden structured databases MQ: manually designed queries

Upload: others

Post on 23-Sep-2020

13 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Learning to Query: Focused Web Page Harvesting for Entity ... · Learning to Query: Focused Web Page Harvesting for Entity Aspects Yuan Fang1 Vincent Zheng2 Kevin Chang23 Problem:

Learning to Query:Focused Web Page Harvesting for Entity Aspects

Yuan Fang1 Vincent Zheng2 Kevin Chang23

Problem: Learning to Query (L2Q)

1 Institute for Infocomm Research, Singapore2 Advanced Digital Sciences Center, Singapore3 University of Illinois at Urbana-Champaign, USA

Subproblem #1: Domain-aware L2Q

Subproblem #2: Context-aware L2Q

Result A: Effect of Domain+Context

Entity type Common aspects

celebrity spouse, age, net worth

car safety, cost, interior

business address, hours, phone

abundant but scattered

Business Analytics

Vertical portal or search

Overall Workflow: Iterative Querying

Keywords (uniquely) identifying the entitySeed query:

Target aspect:A pre-trained classifierfor the target aspect

Utility: (precision/recall)

In each iteration, ∗ argmaxa) Domain Pages b) Template Abstraction

<topic> <journal>

Target Entity

Marc Snir

Peer Entities

Philips YuAndrew Ng

∗ +entity pages domain pages

Domain Phase Entity Phase

Utility regularization (i.e. supervision on target aspect)

c) Bridging Domain & Entity

Collective precision⋃ ⋂⋃Collective recall⋃ ⋂

pages for target aspect

pages of candidate query

RND: select query randomly

P/R: optimize precision/recall without domain and context

P/R+q: with domain pages but no templates, and without context

P/R+t: with domain pages and templates, without context

L2QP/L2QR: full approaches optimizing precision/recall

Result B: Compare with Indep. Baselines

L2QBAL: optimize for F-score, balancing L2QP & L2QR

LM: language feedback model

AQ: adaptive querying for text databases

HR: harvest rate for hidden structured databases

MQ: manually designed queries

Named Entity Recognition in an Intranet Query Log Richard Sutcliffe 1, Kieran White 1, Udo Kruschwitz 2 1 - University of Limerick, Ireland 2 - University

Querying Knowledge Graphs by Example Entity Tuples · Our work is the ﬁrst to query knowledge graphs by example entity tuples. The paradigm of query-by-example(QBE) has a long history

GRAQULA: A graphical query language for entity …dblab.kaist.ac.kr/Prof/pdf/Sockut1993.pdfdatabase's organization into entity and relationship types. We do not define application-

QuERy: A Framework for Integrating Entity Resolution with Query …haltwaij/QuERy.pdf · QuERy: A Framework for Integrating Entity Resolution with Query Processing Hotham Altwaijry

Diversi ed and Verbalized Result Summarization for ...ws.nju.edu.cn/association/summ2018/wise18_extended.pdfSA pattern (SAP), which is obtained by substituting each non-query entity

Query to Knowledge: Unsupervised Entity Extraction from ...kzhai.github.io/paper/2016_sigir.pdf · be organized to knowledgebase [5] which can impact and im-prove the performance

Entity Query Feature Expansion using Knowledge Base Linksciir.cs.umass.edu/~dietz/entitylinking/eqfe-sigir14-cameraready.pdf · research area. Query expansion is a commonly used technique

XIMENIA HARVESTING AND POST-HARVESTING PRACTICES … harvesting and post... · XIMENIA HARVESTING AND POST-HARVESTING PRACTICES ... The specific objectives of this survey were:

Recovering Question Answering Errors via Query Revisionxyan/papers/emnlp17_revision.pdf · 2 Question Revisions We formalize three kinds of question revi-sions, namely entity-centric,

Fast Join Project Query Evaluation using Matrix Multiplicationxh102/sigmod2020-arxiv.pdf · query execution plan. Such queries have a large number of applications in entity matching,

Entity Extraction for Query Interpretation Patrick Pantel ǂ

National Guideline for Management, Prevention and Control ...searo.who.int/entity/bangladesh/nationalguideline... · outbreaks were reported during date palm sap harvesting season

Utilizing Online Social Media for Post-Disaster Relief · " Traditional NLP, e.g., Named Entity Recognition, WordNet " IR methods – query expansion, Indri, Lucene " Neural network

Entity-Based Query Recommendation for Long-Tail Queries · Entity-Based Query Recommendation for Long-Tail Queries 1:3 Challenges and solutions. Given the entities found in an input

Cross-Lingual Entity Query from Large-Scale Knowledge Graphs · Cross-Lingual Entity Query from Large-Scale Knowledge Graphs Yonghao Su, Chi Zhang, Jinyang Li, Chengyu Wang, Weining

Enriching Textual Search Results at Query Time …users.ics.forth.gr/~tzitzik/publications/Tzitzikas_2015...Enriching Textual Search Results using Entity Mining, Linked Data and Link

1 Copyright © 2011, Oracle and/or its affiliates. All ... · Oracle Support for LINQ and Entity Framework •Query language interfaces –LINQ to Entities –Entity SQL –DML capabilities

J2EEKurs - Enterprise JavaBeans---Entity Beans/2 · Lokale Interfaces Assoziationen EJB Query Language EJB-Methoden J2EEKurs Enterprise JavaBeans—Entity Beans/2 Peter Thiemann Universitat

Data Models Matter Less Than You Thinkboncz/keynote-graphta2016-boncz.pdf · Keynote Statements 1970- Relational Data Model, SQL Query Language, Entity Relationship Modeling 1980-

Entity Framework Query Generation – LINQ-syntax – Auto-compilation Database Access – Single context during page interaction – —————— Less code – ————————

Chapter 7 The Query Compiler Query Processor ： Query Parser Tree Logical Query Plan Physical Query Plan Query Structure Relational Algebraic Expression

Databases - Entity-Relationship Modelling › units › CITS2232 › ... · ER Modelling Relationship Sets The power of relational databases comes from the ability to model and query

Named Entity Recognition in an Intranet Query Log

Entity 78 11 108 Entity 75 Entity 77 Entity 76 Entity 74

UrbanGreen Rainwater Harvesting Brochure€¦ · UrbanGreen Rainwater Harvesting Brochure Author: CONTECH Subject: Rainwater Harvesting and Water Reuse Keywords: Rainwater Harvesting,

Entity Query API

Energy Harvesting, tekhnologi harvesting

Subquery. Subquery or Inner query or Nested query A subquery is a query within another query. The outer query is called as main query and The inner query

Query-time Entity ResolutionQuery-time Entity Resolution Indrajit Bhattacharya [email protected] IBM India Research Laboratory Vasant Kunj, New Delhi 110 070, India Lise Getoor [email protected]

Harvesting, Processing and Visualising “Jožef Stefan” Institute, … · 2017-07-28 · API examples Twitter API Search API - enables your program to query for specific keywords

Query processing and optimization - IDATDDD37/fo/fo-optimization2.pdf · • Entity integrity constraint = no primary ... management Processing of queries and updates ... in order

Word-Entity Duet Representations for Document Ranking · „is paper presents a word-entity duet framework for utilizing knowledge bases in ad-hoc retrieval. In this work, the query

Query-Specific Knowledge Summarization with Entity ... · Given a query, unlike traditional IR that •nds relevant documents or entities, in this work, we focus on retrieving both

Select, Link and Rank: Diversiﬁed Query Expansion and ...aditk2.web.engr.illinois.edu/reports/krishnan_wise16.pdfsimultaneously diversifying query expansions and entity recommenda-tions

Rainwater Harvesting and Utilisation Rainwater Harvesting and