• 検索結果がありません。

Schedule for 2017 Web Information Extraction and Retrieval Web IE & IR

N/A
N/A
Protected

Academic year: 2018

シェア "Schedule for 2017 Web Information Extraction and Retrieval Web IE & IR"

Copied!
13
0
0

読み込み中.... (全文を見る)

全文

(1)

Web IE & IR

Dr. Chia-Hui Chang

National Central University, Taiwan 2017/02/13

(2)

Google Search Engine

News Search

Image Search

Book Search

Product

Search on Map

Job Search

People Search

Event Search

Search on the Web

(3)
(4)
(5)
(6)

 Topic Search or Deep Search or your choi

ce

 Learn how to crawl, extract data, and in

dex data for better search experience on

the Web

 Major topics:

 Information Extraction

 Template Pages & Free Texts

 Information Retrieval

Course Goal

(7)

Constructing a POI database from Web

7

Query Log

POI Search System

POI DB POI

Name+Address +Description

POI DB POI

Name+Address +Description

Query

Associated Information

Extraction

Address Extraction & Normalizatio

n

Query-based Crawler

POI Extraction

Address-bearing pages

POI Name

POI Name & Address Pairing

Addres s Keyword

s

POI Pair

POI Name Recognitio

n Address Pattern

Associated Information Summarization

(8)

Key Components

Crawler:

 Yellow Page Web sites  Deep Web data extraction

 Query-based crawler for Address-Bearing Pages (A BP).

 POI Extraction from ABP

 Address extraction

 POI name extraction

 POI name and address pairing

 Description of a POI

 Associated information extraction

 Summarization of the associated information

8

(9)

• As a portal for mobile devices to replace APP s like Web search engines

• Categorization of Chatbots:

Template-Driven Chatbots

Retrieval-Based Chatbots

Generative-based Chatbots

 Schedules of Chinese Subtask

Task registration due: Apr 2017

STC run submission deadline: May 2017

Relevance assessments: Jun-Jul 2017

Results released to participants: Sep 1 2017

Participants draft papers due: Oct 1 2017

NTCIR – Short Text Conversa

tion

(10)

Retrieval-Based Chatbots (Closed)

(11)

Generative-based Methods (Op

en)

(12)

 5 Quiz: 25%

 3 Assignments: 45%

 Oral presentation: 10%

 Class involvement: 5%

 Ask at least 1 question every week

 Final project: 20%

 Paper submission: +3~5 bonus

 SIGIR, SAC WT, AI and the Web

Grading

(13)

 Stanford NLP-Dan Jurafsky & Chris

 18 - 1 - Introduction to Information Retrieval

 18 - 2 - Term-Document Incidence Matrices

 18 - 3 - The Inverted Index

 18 - 4 - Query Processing with the Inverted In dex

 18 - 5 - Phrase Queries and Positional Indexes

Information Retrieval Overvi

ew

参照

関連したドキュメント

In recent communications we have shown that the dynamics of economic systems can be derived from information asymmetry with respect to Fisher information and that this form

Some useful bounds, probability weighted moment inequalities and variability orderings for weighted and unweighted reliability measures and related functions are presented..

The calibration problem for the Black-Scholes model was solved based on the S&P500 data, and the S&P 500 call and put option price data were interpreted in the framework

for the observed functions, smooth.type a string with the name of smoothing method to be used (B-splines or Fourier), nbasis a numeric value defining the number of basis functions

東京都は他の道府県とは値が離れているように見える。相関係数はこう

These results are motivated by the bounds for real subspaces recently found by Bachoc, Bannai, Coulangeon and Nebe, and the bounds generalize those of Delsarte, Goethals and Seidel

The proof of Theorem 1.1 was the argument due to Bourgain [3] (see also [6]), where the global well-posedness was shown for the two dimensional nonlinear Schr¨ odinger equation

For the three dimensional incompressible Navier-Stokes equations in the L p setting, the classical theories give existence of weak solutions for data in L 2 and mild solutions for