Microsoft PowerPoint - ANM08 - fast indexing [Read-Only]

Similar documents
<4D F736F F D B0D3ACE3A9D2A44ABEC7B2CEAD70A6D2C A8F729>

PowerPoint Presentation

untitled

《金融评论》投稿 封面

カテゴリー Ⅰ 日本建築学会構造系論文集第 82 巻第 734 号, ,2017 年 4 月 J. Struct. Constr. Eng., AIJ, Vol. 82 No. 734, , Apr., 2017 DOI

报告总结

CPI Facor-Augmened Vecor Auoregressive FAVAR Sargen & Sims 1977 Giannone e al Sock & Wason 2002a Bai & Ng 2006 Bernanke e al FAVAR Boivin

786

ARIMA

2005 5,,,,,,,,,,,,,,,,, , , 2174, 7014 %, % 4, 1961, ,30, 30,, 4,1976,627,,,,, 3 (1993,12 ),, 2

A Community Guide to Environmental Health

4/09 4/15 4/21 4/25 5/07 5/13 5/19 5/23 5/29 6/04 6/10 6/16 6/20 6/26 7/02 7/08 第 2 期 湖 南 人 文 科 技 学 院 大 学 生 学 报 2 由 表 1 可 知, 白 银 期 货 价 格 序 列 与 现 货 价 格

Microsoft Word - A doc

Essential procedures of stereological (morphometric( morphometric) ) study / / / / / / /

穨control.PDF

2015年4月11日雅思阅读预测机经(新东方版)

從詩歌的鑒賞談生命價值的建構

课题调查对象:

<4D F736F F D205F FB942A5CEA668B443C5E9BB73A740B5D8A4E5B8C9A552B1D0A7F75FA6BFB1A4ACFC2E646F63>

Microsoft Word doc

Microsoft PowerPoint SSBSE .ppt [Modo de Compatibilidade]

A VALIDATION STUDY OF THE ACHIEVEMENT TEST OF TEACHING CHINESE AS THE SECOND LANGUAGE by Chen Wei A Thesis Submitted to the Graduate School and Colleg


Microsoft Word doc

深圳股票市场稳定性研究报告

Microsoft Word - 第四組心得.doc

/ / Critical Mass of Web Servers * Network economics Network externalities positive feedback effect Critical Mass WWW

Microsoft Word - 口試本封面.doc

南華大學數位論文

國 史 館 館 刊 第 23 期 Chiang Ching-kuo s Educational Innovation in Southern Jiangxi and Its Effects ( ) Abstract Wen-yuan Chu * Chiang Ching-kuo wa


untitled

2 时 差 相 关 分 析 是 宏 观 经 济 景 气 分 析 中 经 常 采 取 的 计 量 方 法 是 用 以 判 断 相 关 指 标 的 先 行 [1] 滞 后 关 系 进 而 对 指 标 进 行 筛 选 的 一 种 技 术 以 下 简 述 这 一 方 法 : x = { x } = 1 y

202 The Sending Back of The Japanese People in Taiwan in The Beginning Years After the World War II Abstract Su-ying Ou* In August 1945, Japan lost th

Microsoft Word - A _ doc

Chn 116 Neh.d.01.nis

by mild (22.7%). Inhaled corticosteroids, systemic corticosteroids, and antibiotics were applied to 94.8% (292 cases), 74.7% (230 cases), and 90.9% (2

, (), 15,,,,, 2,,,1000 2,,, 5, ;, 5,,3,,,4 2,,, :, , , ,

Microsoft Word - TIP006SCH Uni-edit Writing Tip - Presentperfecttenseandpasttenseinyourintroduction readytopublish

invesigae he lag relaionship linkage beween he sock marke of U.S. and he sock of Japan and he sock of Taiwan.According o he empirical resuls analysis,

Microsoft Word - 論文封面 修.doc


,

Microsoft PowerPoint - NCBA_Cattlemens_College_Darrh_B

Microsoft Word - D-2°w¶Ë¬ì¹ï¤U�Iµh®{¤âÀˬd¬yµ{_¬x°ö�×__P _.doc

Windows XP

Taiwan Fuures Exchange, TAIFEX TAIFEX Taiwan Sock Index Fuures 00 4 expiraion eec S&P 500 ( ). Kawaller, Koch and Koch 987 S&P

Microsoft PowerPoint - Aqua-Sim.pptx

南華大學數位論文

% % 34

Microsoft PowerPoint - talk8.ppt

論文封面

我国原奶及乳制品安全生产和质量安全管理研究

业 务 与 运 营 社 交 网 络 行 为 将 对 网 络 流 量 造 成 较 大 影 响 3) 即 时 通 信 类 业 务 包 括 微 信 QQ 等, 该 类 业 务 属 于 典 型 的 小 数 据 包 业 务, 有 可 能 带 来 较 大 的 信 令 开 呼 叫 建 立 的 时 延 销 即 时

快乐蜂(Jollibee)快餐连锁店 的国际扩张历程

报 告 1: 郑 斌 教 授, 美 国 俄 克 拉 荷 马 大 学 医 学 图 像 特 征 分 析 与 癌 症 风 险 评 估 方 法 摘 要 : 准 确 的 评 估 癌 症 近 期 发 病 风 险 和 预 后 或 者 治 疗 效 果 是 发 展 和 建 立 精 准 医 学 的 一 个 重 要 前

廣州舊城區的保護和發展

03施琅「棄留臺灣議」探索.doc

20

untitled

東莞工商總會劉百樂中學

Microsoft Word - 黃淑蓉碩士論文_0817

學校發展計劃(二零零六至二零零七年)

高層辦公建築避難演練驗證與避難安全評估之研究

國立中山大學學位論文典藏

untitled

Microsoft PowerPoint - TTCN-Introduction-v5.ppt

Avision

Knowledge and its Place in Nature by Hilary Kornblith

English Language

南華大學數位論文

1-34


(Microsoft Word - \263\257\253T\247\312-An Application of New Error Estimation Technique to the Boundary Element Method.doc)

論元人戰爭劇與戰爭場面的喜劇精神


4. 每 组 学 生 将 写 有 习 语 和 含 义 的 两 组 卡 片 分 别 洗 牌, 将 顺 序 打 乱, 然 后 将 两 组 卡 片 反 面 朝 上 置 于 课 桌 上 5. 学 生 依 次 从 两 组 卡 片 中 各 抽 取 一 张, 展 示 给 小 组 成 员, 并 大 声 朗 读 卡

Microsoft PowerPoint - STU_EC_Ch08.ppt

9330.doc


Japan He Bin Professor, School of Humanities and Social Sciences Tokyo Metropolitan University Abstract In daily life, the food on the table in the fa

天 主 教 輔 仁 大 學 社 會 學 系 學 士 論 文 百 善 孝 為 先? 奉 養 父 母 與 接 受 子 女 奉 養 之 態 度 及 影 響 因 素 : 跨 時 趨 勢 分 析 Changes in attitude toward adult children's responsibilit

國立中山大學學位論文典藏.PDF

Abstract After over ten years development, Chinese securities market has experienced from nothing to something, from small to large and the course of

Public Projects A Thesis Submitted to Department of Construction Engineering National Kaohsiung First University of Science and Technology In Partial


<4D F736F F D2035B171AB73B6CBA8ECAB73A6D3A4A3B6CBA158B3AFA46CA9F9BB50B169A445C4D6AABAB750B94AB8D6B9EFA4F1ACE3A873>

<4D F736F F D20BEDBC9B3B3C9CBFEA1AAA1AAC9CCBDADBDCCD3FDCEC4BCAF20A3A8D6D0A3A92E646F63>

Microsoft Word 谢雯雯.doc

,20,, ; ;,,,,,,,, 20 30,,,,,, ( 2000 ) ( 2002 ) ( ) ( ) ( ), ( ) :, ;:, ; 20 ( ) (181 ) 185

高職教師教學成敗歸因之研究

/ OA FAS /BAS /ACS /AFC

VaR 3.1 VaR 3.2 VaR

31 17 www. watergasheat. com km 2 17 km 15 km hm % mm Fig. 1 Technical route of p

VASP应用运行优化

瞿佑詞校勘輯佚及板本探究

參 加 第 二 次 pesta 的 我, 在 是 次 交 流 營 上 除 了, 與 兩 年 沒 有 見 面 的 朋 友 再 次 相 聚, 加 深 友 誼 外, 更 獲 得 與 上 屆 不 同 的 體 驗 和 經 歴 比 較 起 香 港 和 馬 來 西 亞 的 活 動 模 式, 確 是 有 不 同 特

UDC Empirical Researches on Pricing of Corporate Bonds with Macro Factors 厦门大学博硕士论文摘要库

瑏瑡 B ~ 瑏瑡

2/80 2

Transcription:

Fas Saisical Relaionship Discover in Massive Monioring Daa Hui Zhang Haifeng Chen Guofei Jiang Xiaoqiao Meng Kenji Yoshihira EC Laboraories America Princeon, J EC Laboraories America, Inc. 1

Technical challenges in service nework managemen Difficul o characerize/model ssems Heerogenei: Mixure of various sofware, hardware, neworking ec. Dnamici: User behavior, frequen ssem upgrade and changes, uncerainies such as caching ec.. Scale: Large number of sofware &hardware componens. Widel disribued. Segmened adminisraions. Difficul o characerize/model fauls Diversi: operaor fauls, sofware fauls, hardware fauls and nework fauls ec. A faul ofen manifess iself differenl. Difficul o generalize common knowledge across differen ssems EC Laboraories America, Inc. 2

Invarians of dnamic ssems User raffic m i+2... m i+1 Targe Ssem m n m i... an consan relaionship???... m 1 m 2 m 4 m 3 m n Flow inensi: he inensi wih which inernal monioring daa reacs o he volume of user raffic. User raffic flow hrough ssem endlessl and man inernal monioring daa reac o he volume of user raffic accordingl. We search he relaionships among hese inernal measuremens colleced a various poins. If modeled relaionships coninue o hold all he ime, he can be regarded as invarians of he ssem. EC Laboraories America, Inc. 3

Modeling local properies Difficul o characerize a large, dnamic & complex ssem in a holisic wa!! Raher han modeling he whole ssem, we model man local relaionships among monioring daa Divide & Conquer. 1. Each invarian is able o capure some local properies of is relaed componens. 2. Discovering large number of invarians enable use o characerize he whole ssem from differen perspecives. 3. Inerpre ssem operaional saus b racking an changes of invarians. EC Laboraories America, Inc. 4

EC Laboraories America, Inc. 5 One example in model librar We use an AuoRegressive model wih exogenous ARX o learn he relaionship beween wo flow inensi measuremens. Define Given a sequence of real observaions Using LSM, we learn he model b minimizing....... 1 0 1 m k u b k u b n a a m n + + + + + T n b m b b a a a ],...,,,,...,, [ 1 0 2 1 θ T m k u k u n ],...,, 1,..., [ ϕ ϕ T θ }, 1,..., 1, { n n u u O T O E 1 2 1 2 1 ˆ 1, θ ϕ θ θ θ, O E θ T 1 1 1. ] [ ˆ ϕ ϕ ϕ θ

Finess score of a model A finess funcion can be used o evaluae how well he learned model fis he real daa. F θ [1 1 1 θ ] 100 Man oher relaionships can be modeled in he similar wa. ARX model is onl an example of models ha can be used o describe he relaionship beween flow inensi measuremens. Muliple inpu and oupu relaionship can be described b ARX Oher linear and nonlinear models o describe a dnamic relaionship beween inpus and oupus. ˆ The goal is o find a model ha capure he dnamic relaionship well, i.e. he model ha bes fi he inpu-oupu observaion daa under all scenarios. 2 2 EC Laboraories America, Inc. 6

Confidence score Compue he finess score o evaluae how well he model fis he daa of each ime window. Define a funcion and a hreshold o deermine wheher he model fis he daa or nor. ~ 1, if Fi θ > F. f Fi θ ~ 0, if Fi θ F. Compue he confidence score p k θ prob F θ ~ F 1 > f F θ Sop esing hose models wih afer a period of ime. The score p k θ represens how robus a relaionship is. I can be inegraed o he deecion and diagnosis process well. k k p k θ P i EC Laboraories America, Inc. 7

Faul deecion and localizaion Approach 1. Collec he monioring daa from he arge ssem. 2. Model he normal behavior of he underling ssem. 3. Use he learned models o deec anomalies. 4. Correlae he anomalies o locae he faul componens. 2. Build models Model librar 1. Collec raw daa x x Componen * f x * * R - * A B.. X X A B.. X 4. Correlae Targe Ssem Updaing models Compare raw daa 3. Use models Failure! Deecor failure or no? Localizer Faul Componens Anomalous Behavior Residual Correlaion Curren Pahs Curren Comp. Ineracions faul EC Laboraories America, Inc. 8

A graph model of he relaionship discover process Discover he se E in a graph GV,E When V increases, he compuaion overhead increases quickl! -1 ess. Soluion 1: providing enough compuing resource. Soluion 2: appling domain knowledge o cluser he measuremen daa ses and discover relaionships onl wihin individual clusers. Soluion 3: guided discover wih fas indexing EC Laboraories America, Inc. 9

The Guided Relaionship Discover 1. Indexing phase: appl one fas indexing algorihm o generae an approximae verex rank based on he degree. decide he compuaion budge for he rank esimaion process which is eiher specified direcl b he adminisraor or calculaed based on he esimaion accurac requiremens. 2. Discover phase: repea picking one verex in he order of he rank, esing is relaionship wih he remaining verices, and removing i from he graph unil eiher he overall compuaion budge is run ou or he graph runs ou of verices. a greed heurisic o he verex cover problem, which is Pcomplee. EC Laboraories America, Inc. 10

Rank esimaion algorihm 1: Uniform Sampling 1. Keep one couner for each verex. Iniiall all se o 0. 2. Randoml pick wo differen verices x and from he graph wih uniform probabili, appl a relaionship esing. If he pair x, has been picked before, he es is skipped and he cached resul will be used for he nex sep. 3. If he es resul is posiive, increase he couners of x and b 1. 4. Go back o 2 unil k imes. k: compuaion budge. 5. Oupu a rank on all verices based on he couner values; a ie is broken wih a random choice. EC Laboraories America, Inc. 11

Correc Ranking Probabili CRP Measuring he ranking accurac beween verex x and verex in he esimaion. For he US algorihm, we have EC Laboraories America, Inc. 12

Rank esimaion algorihm 2: Adapive Sampling 1. Keep one couner for each verex. Iniiall all se o 1. 2. Randoml pick wo differen verices x and from he graph wih probabili proporional o heir couner values, appl a relaionship esing. If he pair x, has been picked before, he es is skipped and he cached resul will be used for he nex sep. 3. If he es resul is posiive, increase he couners of x 4. and b 1. if he couner value of x is larger han a hreshold e.g., half of he verex se size, we remove x from he verex se in he following sampling process. 5. Go back o 2 unil k imes. k: compuaion budge. 6. Oupu a rank on all verices based on he couner values; a ie is broken wih a random choice. EC Laboraories America, Inc. 13

Evaluaion Daa ses a collecion of monioring daa from an operaional UTRA UMTS Terresrial Radio Access ework ssem. 129 Ke Performance Indicaors KPI daa ses for each monioring period. The ARX linear regression model was used o es he correlaion relaionships beween hose KPI daa ses. B choosing differen correlaion significance hresholds, differen relaionship graphs were generaed on he same daa se collecion. EC Laboraories America, Inc. 14

Evaluaion The guided relaionship discover on he generaed graphs was ran as following: The Indexing Phase had he compuaion budge of kn ess k 4, 8 and n 129 Four rank esimaion schemes were compared Uniform sampling, adapive sampling, opimal ranking, and random ranking. The Discover Phase followed he oupu rank and was sopped when he graph ran ou of verices. EC Laboraories America, Inc. 15

A dense relaionship graph esed avg degree 32.6 EC Laboraories America, Inc. 16

Uniform Sampling algorihm, UTRA dense graph. wih 8n esimaion ess, he guided discover wih US algorihm covered 80% 90% of he edges wihin 64 85; he opimal discover required 57 81 verices o cover 80% 90% edges; he random discover required 110 119 verices o cover 80% 90% edges. EC Laboraories America, Inc. 17

A sparse relaionship graph esed avg degree 2.3 EC Laboraories America, Inc. 18

Adapive sampling algorihm, UTRA sparse graph. wih 8n esimaion ess, he guided discover wih AS algorihm covered 80% 89% of he edges wihin 24 30; he opimal discover required 22 28 verices o cover 80% 90% edges; he random discover required 92 106 verices o cover 80% 90% edges. EC Laboraories America, Inc. 19

Conclusions & fuure work We sudied he problem of fas saisics relaionship discover in massive monioring daa. a guided discover scheme wih wo simple sampling based rank esimaion algorihms was proposed o enable fas and parial relaionship discover. Fuure work he proper of he adapive sampling algorihm in erms of he correc ranking probabili. analsis and experimens on more real-world daa and snheic opologies. he opimal allocaion beween esimaion ime and discover ime in he guided discover process. he guided discover wih hbrid indexing and discover process. EC Laboraories America, Inc. 20