Natural Language Question Answering over Large-scale Linked Data

Similar documents
腾讯.key

Microsoft Word 定版

Microsoft Word - A doc

2/80 2

2006中國文學研究範本檔


Microsoft PowerPoint SSBSE .ppt [Modo de Compatibilidade]

Microsoft PowerPoint - 01國家考試講座簡報--中興大學簡報

穨423.PDF

南華大學數位論文

Improved Preimage Attacks on AES-like Hash Functions: Applications to Whirlpool and Grøstl

small fire indd

國立中山大學學位論文典藏.PDF

Microsoft Word - ChineseSATII .doc

第一页为封面

60 教 育 資 料 集 刊 第 四 十 五 輯 2010 各 國 初 等 教 育 ( 含 幼 兒 教 育 ) The Centennial Change from Imitation to Innovation : A Strategic Adjustment in the Reform of C

1 引言

,,,,,,,,,, :,, 1,,, () (, ), 11,,,,,,,,,,,WTO,, ,,, ,, (58 ) 1, :,1999,211 4

9330.doc

,.,,.. :,, ,:, ( 1 ). Π,.,.,,,.,.,. 1 : Π Π,. 212,. : 1)..,. 2). :, ;,,,;,. 3

A VALIDATION STUDY OF THE ACHIEVEMENT TEST OF TEACHING CHINESE AS THE SECOND LANGUAGE by Chen Wei A Thesis Submitted to the Graduate School and Colleg

: 1, ( high2accessibil2 ity),,,,,, : (3),,,,!,,? :,!?? ( ) ( ),, :?? ( ),,,,, (3),,,,,,,, : (4) a., :,, b.,,:,,, (4aΠb),,,,,,, N + V + + N + V,


( 70 ) B,,,, B,,,, : (5) ( ),, A, : (6) ( ) (7), ( ) (8),, ( ) (9), ( ) (6), (7), (8), (9) B,,,, : (10),, ( ) (11)! ( ) 1. 2 A A,, : (12), ( )

科 研 信 息 化 技 术 与 应 用,2015, 6 (1) of identity and the framework of identity management, this paper analyses the development trend of Identity Management

~ Capability Maturity Model Integration, CMMI CMMI

<4D F736F F D205F FB942A5CEA668B443C5E9BB73A740B5D8A4E5B8C9A552B1D0A7F75FA6BFB1A4ACFC2E646F63>

Microsoft Word doc

,,,,, 1972 ; 60, 60,,, 1, 30, 6 10,,,, ,,, IU SSP,,,,,,, 1/ 5,,,,, 1952,,,, 3p, pop ulation, poverty, pollution (,, ), 3p IU SSP,, 1.

IP TCP/IP PC OS µclinux MPEG4 Blackfin DSP MPEG4 IP UDP Winsock I/O DirectShow Filter DirectShow MPEG4 µclinux TCP/IP IP COM, DirectShow I

Chn 116 Neh.d.01.nis

國立中山大學學位論文典藏.PDF

Microsoft Word RCE MP_Year Book.doc


一般社団法人電子情報通信学会 信学技報 THE INSTITUTE OF ELECTRONICS, IEICE Technical Report INFORMATION THE INSTITUTE OF AND ELECTRONICS, COMMUNICATION ENGINEERS IEICE L

簡報技巧

Microsoft PowerPoint - Aqua-Sim.pptx

東莞工商總會劉百樂中學

untitled

, (), 15,,,,, 2,,,1000 2,,, 5, ;, 5,,3,,,4 2,,, :, , , ,

摘要

,, [1 ], [223 ] :, 1) :, 2) :,,, 3) :,, ( ),, [ 6 ],,, [ 3,728 ], ; [9222 ], ;,,() ;, : (1) ; (2),,,,, [23224 ] ; 2,, x y,,, x y R, ( ),,, :

國立中山大學學位論文典藏.PDF

前 言 香 港 中 文 大 學 優 質 學 校 改 進 計 劃 ( 下 稱 計 劃 ) 團 隊 自 1998 年 起 積 極 於 本 地 推 動 理 論 及 實 踐 並 重 的 學 校 改 進 工 作, 並 逐 步 發 展 成 為 本 地 最 具 規 模 的 校 本 支 援 服 務 品 牌, 曾 支

Logitech Wireless Combo MK45 English

Microsoft Word - Preface_1_14.doc

國立中山大學學位論文典藏.PDF

报 告 1: 郑 斌 教 授, 美 国 俄 克 拉 荷 马 大 学 医 学 图 像 特 征 分 析 与 癌 症 风 险 评 估 方 法 摘 要 : 准 确 的 评 估 癌 症 近 期 发 病 风 险 和 预 后 或 者 治 疗 效 果 是 发 展 和 建 立 精 准 医 学 的 一 个 重 要 前

Shanghai International Studies University THE STUDY AND PRACTICE OF SITUATIONAL LANGUAGE TEACHING OF ADVERB AT BEGINNING AND INTERMEDIATE LEVEL A Thes


% % 34

passion for making open-source software to be developed and updated breaking the traditional business software copyright erected barriers. But market-

2 3. 1,,,.,., CAD,,,. : 1) :, 1,,. ; 2) :,, ; 3) :,; 4) : Fig. 1 Flowchart of generation and application of 3D2digital2building 2 :.. 3 : 1) :,

,

《中文信息学报》投稿模版


文憑試中國語文科練習卷評卷參考



4 115,,. : p { ( x ( t), y ( t) ) x R m, y R n, t = 1,2,, p} (1),, x ( t), y ( t),,: F : R m R n.,m, n, u.,, Sigmoid. :,f Sigmoid,f ( x) = ^y k ( t) =

第三章 国内外小组合作学习的应用情况

<4D F736F F F696E74202D20C8EDBCFEBCDCB9B9CAA6D1D0D0DEBDB2D7F92E707074>

4

2015 年 第 24 卷 第 11 期 计 算 机 系 统 应 用 历 的 主 体 部 分 多 以 非 结 构 化 的 文 本 形 式 存 储, 很 多 研 究 只 能 基 于 有 限 的 结 构 化 数 据 进 行 [4,5], 无 法 满 足 临

IC L05 Visit friends

附件1:

講 綱 一 職 涯 規 劃 決 定 自 己 的 人 生 二 認 識 國 家 考 試 三 年 新 制 措 施 四 報 考 類 科 如 何 決 定 五 如 何 準 備 國 家 考 試 六 嶺 東 科 大 輝 煌 成 果 七 參 加 國 家 考 試 程 序 八 考 試 資 訊 如 何 取

Multi-national Company Operation and Public...


Microsoft Word - 第四組心得.doc

% 6 9 [1] % 97% [2] 2 93% 3 4,, 2

Microsoft Word - 11月電子報1130.doc

Abstract To overcome the present crisis of conditions of knowledge, an effort to reconceptualize, position and identify the shared experien

/3 CAD JPG GIS CAD GIS GIS 1 a CAD CAD CAD GIS GIS ArcGIS 9. x 10 1 b 1112 CAD GIS 1 c R2VArcscan CAD MapGIS CAD 1 d CAD U

Microsoft Word - A doc

Microsoft PowerPoint - CH 04 Techniques of Circuit Analysis

南華大學數位論文


The Development of Color Constancy and Calibration System

致 谢 本 人 自 2008 年 6 月 从 上 海 外 国 语 大 学 毕 业 之 后, 于 2010 年 3 月 再 次 进 入 上 外, 非 常 有 幸 成 为 汉 语 国 际 教 育 专 业 的 研 究 生 回 顾 三 年 以 来 的 学 习 和 生 活, 顿 时 感 觉 这 段 时 间 也

Microsoft Word - A doc

a a a 1. 4 Izumi et al Izumi & Bigelow b


现代汉语语料库基本加工规格说明书

热设计网

Microsoft Word 李若鶯.doc

Microsoft Word ¶À°ê¾±¼B²»³q.doc

by industrial structure evolution from 1952 to 2007 and its influence effect was first acceleration and then deceleration second the effects of indust


南華大學數位論文

,,, () 20 80,,,,, ;,, ;,, ;,,,,,,,,, [1 ], :,,,,2 2,,, () (),,,,:,,,,:,,,, :, [2 ] :,,,,,,, : AN NA,,,,,, ( ),:,,: ( F) = (A1 + A2 + A3 + An -

江 汉 学 术 总 第 34 卷 氏 的 杂 姓 村 本 研 究 采 取 田 野 调 查 的 访 谈 法 和 文 献 档 案 法 收 集 资 料 笔 者 对 A 村 干 部 和 普 通 村 民 进 行 深 度 访 谈, 并 获 得 该 村 自 年 的 全 部 计 划 生 育 对

Sept Arab World Studies No.5. Nivien Saleh, The European Union and the Gulf States: A Growing Partnership, Middle East Policy, Vol.7, N

课题调查对象:

Microsoft Word 記錄附件

274 28, [2,3 ],,,,,,,, /, : (O ECD) PSR ( Pressure2State2Response) [47 ], [812 ], MA [2,3,13 ], 1990 (O ECD) PSR, ; ; / PSR, [1417 ] (MA) 2000, 2005,

<4D F736F F F696E74202D20C6F3D2B5BCB0B2FAC6B7BCF2BDE92DD6D0D3A2CEC420C1F5B9FAD3B1205BBCE6C8DDC4A3CABD5D>

%

第壹章

CH01.indd

Transcription:

Natural Language Question Answering over Large- scale Linked Data Kang Liu National Laboratory of Pattern Recognition (NLPR) Institute of Automation, Chinese Academy of Sciences 8/30/2014 Kun Ming

Knowledge Graph: Linked Data 超过 5.7 亿实体超过 18 亿条事实 ( 关系 ) 百度知 心 2,653,873 概念 搜狗知 立 方

More Linked Data

How to access these Linked Data Which software has been developed by organizations founded in California, USA? Experts or Developer SELECT DISTINCT?uri WHERE {?uri rdf:type dbo:software.?uri dbo:developer?x1.?x1 rdf:type dbo:company.?x1 dbo:foundationplace dbr:california. } SPARQL 2(Integer) version Android {Answer} developer Google foundationplace Apache_License license type programmedin California Java Software developer Oracle Linked Data

How to access these Linked Data Which software has been developed by organizations founded in California, USA? QA System SELECT DISTINCT?uri WHERE {?uri rdf:type dbo:software.?uri dbo:developer?x1.?x1 rdf:type dbo:company.?x1 dbo:foundationplace dbr:california. } SPARQL 2(Integer) version Android {Answer} developer Google foundationplace Apache_License license type programmedin California Java Software developer Oracle Linked Data

Pipeline Process Which software has been developed by organizations founded in California, USA? Question software, developed by, organizations, founded in, California Phrase dbo:software, dbo:developer, dbo:company, dbo:foundationplace, dbr:california. Semantic Item <dbo:software, dbo:developer, dbo:company> <dbo:company, dbo:foundationplace, dbr:california> Semantic Triple SELECT DISTINCT?uri WHERE {?uri rdf:type dbo:software.?uri dbo:developer?x1.?x1 rdf:type dbo:company.?x1 dbo:foundationplace dbr:california. } SPARQL

Challenges Manually designed patterns Phrase detenction rules NN NNP:Entity NN:Class Property VB:Property Semantic item grouping patterns (syntactic patterns) Verb and its arguments Adjectives and its arguments Prepositionally modified tokens and its objects (?x, dbo:productor, dbo:film)

Challenges Manually designed patterns Phrase detenction rules NN NNP:Entity NN:Class Property VB:Property Semantic item grouping patterns (syntactic patterns) Verb and its arguments Adjectives and its arguments Prepositionally modified tokens and its objects Can we automatically learn rules or (?x, patterns? dbo:productor, dbo:film)

Challenges Ambiguities Phrase Detection: { California }, { California, USA } Which software has been developed by organizations founded in California, USA? Phrase Mapping: California: {California_State}, {California_Film} Semantic Item Grouping: {dbo:software, dbo:developer, dbo:company} {dbo:software, dbo:foundationplace, dbo:company}

Challenges Ambiguities Phrase Detection: { California }, { California, USA } Which software has been developed by organizations founded in California, USA? Phrase Mapping: California: {California_State}, {California_Film} Semantic Item Grouping: {dbo:software, dbo:developer, dbo:company} {dbo:software, dbo:foundationplace, dbo:company} Can we make joint inference?

Our Solution Pattern Learning Meta patterns Not only verb and its arguments All syntactic paths maybe possible Joint Inference First-order logic formulas Markov Logic Network p1(a,b) p3(a,a) p2(a) p2(b) p3(b,b) 1 φi p( y) = exp( w fc ( y)) Z φ i ( φi, wi ) L c C n i p4(a) p1(b,a) p4(b)

Predicates Hidden Predicates Observed Predicates hasphrase(i) hasresource(i, j) hasrelation (ri, rj, rr) The ith candidate phrase has been chosen The ith phrase is mapped to the jth semantic item The semantic item ri and rj can be grouped together with the relation type rr

Predicates Hidden Predicates Observed Predicates hasphrase(i) hasresource(i, j) hasrelation (ri, rj, rr) The ith candidate phrase has been chosen The ith phrase is mapped to the jth semantic item The semantic item ri and rj can be grouped together with the relation type rr

Hard Formulas Formulas

Soft Formulas Formulas

Soft Formulas Formulas

Framework 问题预处理 : 问题类型 Focus 去除 无 用词等 问句 In which movies directed by Garry Marshall was Julia Roberts starring? DBpedia Wikipedia Word2vec Reverb&Patty 统计信息 短语检测 & 资源映射 & 特征提取 问句 MLN 模型谓词和公式 MLN 联合消岐 构造查询图 资源映射候选结构匹配候选 资源映射结果结构匹配结果 查询图 生成查询 SPARQL 语句 10

Framework 问题预处理 : 问题类型 Focus 去除 无 用词等 问句 In which movies directed by Garry Marshall was Julia Roberts starring? movies directed by Garry Marshall was Julia Roberts starring in DBpedia Wikipedia Word2vec Reverb&Patty 统计信息 短语检测 & 资源映射 & 特征提取 问句 MLN 模型谓词和公式 MLN 联合消岐 构造查询图 资源映射候选结构匹配候选 资源映射结果结构匹配结果 查询图 生成查询 SPARQL 语句 10

Framework 问题预处理 : 问题类型 Focus 去除 无 用词等 问句 In which movies directed by Garry Marshall was Julia Roberts starring? movies directed by Garry Marshall was Julia Roberts starring in DBpedia Wikipedia Word2vec Reverb&Patty 统计信息 MLN 模型谓词和公式 短语检测 & 资源映射 & 特征提取 MLN 联合消岐 构造查询图 问句 资源映射候选结构匹配候选 资源映射结果结构匹配结果 >hascandidateresource >hasheadpos movies" "dbo:film " movies " "NNS "directed by" "dbo:director "directed" "VBN" "by" "dbo:publisher" "Garry Marshall " "NNP "Garry Marshall " "dbr:garry_marshall >hasdeppath >hasresourcetype " movies " "directed by" "nsubj- prep" "dbo:film" "Concept "directed by" "Garry Marshall" "pobj- nn "dbo:director" "Property >istypecompatible "dbr:garry_marshall""concept "dbo:film" "dbo:director" "1_1" "dbo:director" "dbr:garry_marshall " "2_1 查询图 生成查询 SPARQL 语句 10

Framework 问题预处理 : 问题类型 Focus 去除 无 用词等 问句 In which movies directed by Garry Marshall was Julia Roberts starring? movies directed by Garry Marshall was Julia Roberts starring in DBpedia Wikipedia Word2vec Reverb&Patty 统计信息 MLN 模型谓词和公式 短语检测 & 资源映射 & 特征提取 MLN 联合消岐 构造查询图 问句 资源映射候选结构匹配候选 资源映射结果结构匹配结果 >hascandidateresource >hasheadpos movies" "dbo:film " movies " "NNS "directed by" "dbo:director "directed" "VBN" "by" "dbo:publisher" "Garry Marshall " "NNP "Garry Marshall " "dbr:garry_marshall >hasdeppath >hasresourcetype " movies " "directed by" "nsubj- prep" "dbo:film" "Concept "directed by" "Garry Marshall" "pobj- nn "dbo:director" "Property >istypecompatible "dbr:garry_marshall""concept "dbo:film" "dbo:director" "1_1" "dbo:director" "dbr:garry_marshall " "2_1 >hasresource " movies " "dbo:film" "directed by" "dbo:director" "Garry Marshall " "dbr:garry_marshall" "Julia Roberts" "dbr:julia_roberts" "starring in" "dbo:starring > hasrelation "dbo:film" "dbo:director" "1_1" "dbo:film" "dbo:starring" "1_1" "dbo:director " "dbr:garry_marshall " "2_1" "dbr:julia_roberts " "dbo:starring " "1_2" 查询图 生成查询 SPARQL 语句 10

Framework 问题预处理 : 问题类型 Focus 去除 无 用词等 问句 In which movies directed by Garry Marshall was Julia Roberts starring? movies directed by Garry Marshall was Julia Roberts starring in DBpedia Wikipedia Word2vec Reverb&Patty 统计信息 MLN 模型谓词和公式 短语检测 & 资源映射 & 特征提取 MLN 联合消岐 构造查询图 问句 资源映射候选结构匹配候选 资源映射结果结构匹配结果 >hascandidateresource >hasheadpos movies" "dbo:film " movies " "NNS "directed by" "dbo:director "directed" "VBN" "by" "dbo:publisher" "Garry Marshall " "NNP "Garry Marshall " "dbr:garry_marshall >hasdeppath >hasresourcetype " movies " "directed by" "nsubj- prep" "dbo:film" "Concept "directed by" "Garry Marshall" "pobj- nn "dbo:director" "Property >istypecompatible "dbr:garry_marshall""concept "dbo:film" "dbo:director" "1_1" "dbo:director" "dbr:garry_marshall " "2_1 >hasresource " movies " "dbo:film" "directed by" "dbo:director" "Garry Marshall " "dbr:garry_marshall" "Julia Roberts" "dbr:julia_roberts" "starring in" "dbo:starring > hasrelation "dbo:film" "dbo:director" "1_1" "dbo:film" "dbo:starring" "1_1" "dbo:director " "dbr:garry_marshall " "2_1" "dbr:julia_roberts " "dbo:starring " "1_2" 查询图 生成查询 SPARQL 语句 10

Framework 问题预处理 : 问题类型 Focus 去除 无 用词等 问句 In which movies directed by Garry Marshall was Julia Roberts starring? movies directed by Garry Marshall was Julia Roberts starring in DBpedia Wikipedia Word2vec Reverb&Patty 统计信息 MLN 模型谓词和公式 短语检测 & 资源映射 & 特征提取 MLN 联合消岐 构造查询图 问句 资源映射候选结构匹配候选 资源映射结果结构匹配结果 >hascandidateresource >hasheadpos movies" "dbo:film " movies " "NNS "directed by" "dbo:director "directed" "VBN" "by" "dbo:publisher" "Garry Marshall " "NNP "Garry Marshall " "dbr:garry_marshall >hasdeppath >hasresourcetype " movies " "directed by" "nsubj- prep" "dbo:film" "Concept "directed by" "Garry Marshall" "pobj- nn "dbo:director" "Property >istypecompatible "dbr:garry_marshall""concept "dbo:film" "dbo:director" "1_1" "dbo:director" "dbr:garry_marshall " "2_1 >hasresource " movies " "dbo:film" "directed by" "dbo:director" "Garry Marshall " "dbr:garry_marshall" "Julia Roberts" "dbr:julia_roberts" "starring in" "dbo:starring > hasrelation "dbo:film" "dbo:director" "1_1" "dbo:film" "dbo:starring" "1_1" "dbo:director " "dbr:garry_marshall " "2_1" "dbr:julia_roberts " "dbo:starring " "1_2" 生成查询 查询图 SPARQL 语句 10 SELECT DISTINCT?uri WHERE {?uri rdf:type dbo:film.?uri dbo:starring res:julia_roberts.?uri dbo:director res:garry_marshall. }

Experiments Questions three collections of questions from QALD QALD1, QALD3, QALD4 Linked Data: DBpedia, YAGO MLN: thebeast toolkit inference algorithm: cutting plane approach[3] weights learning algorithm: MIRA

Effect of Pattern Learning

Effect of Joint Inference

Effect of Joint Inference

Ours vs. state- of- the- art

Conclusion and Future work Pattern learning is needed for parsing a question over large- scale linked data Joint inference can effective for improving the performance of natural language question answering Scaled up to multiple interlinked knowledge bases Labeled data is insufficient to build up a robust model More robust solutions to find the implicit properties in questions

Reference: Questioning Answering over Linked Data Using Markov First- order Logic, To appear in Proceedings of EMNLP 2014, Doha, Qatar, October, 25-29 Thanks Email: kliu@nlpr.ia.ac.cn : 刘康 _ 自动化所 Homepage: http://www.nlpr.ia.ac.cn/cip/liukang.htm