884 42 nary pattern, LBP).,,.,,. Krizhevsky [1] ILSVRC-2012,, SIFT.,, Lopes [2],, Extended CohnKanade (CK+) [3].,,,, CK+ : 1), Wild,. 1.,. 1 Fig. 1 CK



Similar documents
2 3. 1,,,.,., CAD,,,. : 1) :, 1,,. ; 2) :,, ; 3) :,; 4) : Fig. 1 Flowchart of generation and application of 3D2digital2building 2 :.. 3 : 1) :,

报 告 1: 郑 斌 教 授, 美 国 俄 克 拉 荷 马 大 学 医 学 图 像 特 征 分 析 与 癌 症 风 险 评 估 方 法 摘 要 : 准 确 的 评 估 癌 症 近 期 发 病 风 险 和 预 后 或 者 治 疗 效 果 是 发 展 和 建 立 精 准 医 学 的 一 个 重 要 前

Sep (SCI) 10. Jiann-Ming Wu, Annealing by two sets of interactive dynamics, IEEE Trans. on Systems Man and Cybernetics Part B-Cybernetics 34 (3)

2015 年 第 24 卷 第 11 期 计 算 机 系 统 应 用 历 的 主 体 部 分 多 以 非 结 构 化 的 文 本 形 式 存 储, 很 多 研 究 只 能 基 于 有 限 的 结 构 化 数 据 进 行 [4,5], 无 法 满 足 临

untitled

,,.,, : 1),,,,, 2),,,,, 3),,,,,,,,,, [6].,,, ( ),, [9], : 1), 2),,,,, 3),,, 2.,, [10].,,,,,,,,, [11]. 2.1,, [12],, ;, ; Fig. 1 1 Granular hier

Microsoft Word - A doc

untitled

[1] [4] Chetverikov Lerch[8,12] LeaVis CAD Limas-Serafim[6,7] (multi-resolution pyramids) 2 n 2 n 2 2 (texture) (calf leather) (veins)

13-4-Cover-1

填 写 要 求 一 以 word 文 档 格 式 如 实 填 写 各 项 二 表 格 文 本 中 外 文 名 词 第 一 次 出 现 时, 要 写 清 全 称 和 缩 写, 再 次 出 现 时 可 以 使 用 缩 写 三 涉 密 内 容 不 填 写, 有 可 能 涉 密 和 不 宜 大 范 围 公

University of Science and Technology of China A dissertation for master s degree Research of e-learning style for public servants under the context of

Microsoft Word - A _ doc

2 ( 自 然 科 学 版 ) 第 20 卷 波 ). 这 种 压 缩 波 空 气 必 然 有 一 部 分 要 绕 流 到 车 身 两 端 的 环 状 空 间 中, 形 成 与 列 车 运 行 方 向 相 反 的 空 气 流 动. 在 列 车 尾 部, 会 产 生 低 于 大 气 压 的 空 气 流

一般社団法人電子情報通信学会 信学技報 THE INSTITUTE OF ELECTRONICS, IEICE Technical Report INFORMATION THE INSTITUTE OF AND ELECTRONICS, COMMUNICATION ENGINEERS IEICE L

标题

RPN 2 DeepParts 22 part pool 2 3 HOG 12 LBP 13 Harr - like 15 DPM 16 Deformable Parts Model VGG16 X. Wang 14 VGG Convolutiona

2 : 143, l 1,., Theis [12] Washizawa [14], ((a) ; (b) ) Fig. 1 Four speech signals mixed into three mixtures ((a) Scatter plot of three mixture

698 39,., [6].,,,, : 1) ; 2) ,, 14,, [7].,,,,, : 1) :,. 2) :,,, 3) :,,,., [8].,. 1.,,,, ,,,. : 1) :,, 2) :,, 200, s, ) :,.

1 引言

附3

2006產業管理創新研討會論文格式說明

Vol. 22 No. 4 JOURNAL OF HARBIN UNIVERSITY OF SCIENCE AND TECHNOLOGY Aug GPS,,, : km, 2. 51, , ; ; ; ; DOI: 10.

泰迪杯全国数据挖掘挑战赛 OCR (CNN) OCR() CNN % 92.1% 15% 90%. Viterbi. OCR..,,,,,

(Pattern Recognition) 1 1. CCD

<A448A4E5AAC0B77CBEC7B3F8B2C43132A8F7B2C434B4C15F E706466>

物理学报 Acta Phys. Sin. Vol. 62, No. 14 (2013) 叠 [4]. PET 设备最重要的部件就是探测器环, 探测 备重建图像具有减少数据插值的优势. 器环的性能直接影响 PET 的成像能力. 探头与探头 之间得到的符合直线叫做投影线. 所有的投影线在

g 100mv /g 0. 5 ~ 5kHz 1 YSV8116 DASP 1 N 2. 2 [ M] { x } + [ C] { x } + [ K]{ x } = { f t } 1 M C K 3 M C K f t x t 1 [ H( ω )] = - ω 2

标题

第 2 期 王 向 东 等 : 一 种 运 动 轨 迹 引 导 下 的 举 重 视 频 关 键 姿 态 提 取 方 法 257 竞 技 体 育 比 赛 越 来 越 激 烈, 为 了 提 高 体 育 训 练 的 效 率, 有 必 要 在 体 育 训 练 中 引 入 科 学 定 量 的 方 法 许 多

Microsoft Word tb 赵宏宇s-高校教改纵横.doc

Vol. 15 No. 1 JOURNAL OF HARBIN UNIVERSITY OF SCIENCE AND TECHNOLOGY Feb O21 A

Microsoft Word - 33-p skyd8.doc

SVM OA 1 SVM MLP Tab 1 1 Drug feature data quantization table

2013国际营销科学与信息技术大会(MSIT2013)

從實驗教材到官方課程──小學社會科板橋模式教材與改編本教科書的發展

~ ~

14-1-人文封面

(單位名稱)大事記---96學年度(96

,.,,.. :,, ,:, ( 1 ). Π,.,.,,,.,.,. 1 : Π Π,. 212,. : 1)..,. 2). :, ;,,,;,. 3

6 : 835, RGBD.,.,,,,., (Mean shift) OpenCV.,.,,.. 1. Fig. 1 1 The framework of video object tracking.,,,.,.,,,,. 1.1,.,, : 1) (Mean shift),. [1],,, Bh

* CUSUM EWMA PCA TS79 A DOI /j. issn X Incipient Fault Detection in Papermaking Wa

Microsoft Word - 专论综述1.doc

室内设计2015年第4期.indd

Outline Speech Signals Processing Dual-Tone Multifrequency Signal Detection 云南大学滇池学院课程 : 数字信号处理 Applications of Digital Signal Processing 2

Microsoft Word - chnInfoPaper6

IT 36% Computer Science Teachers Association, CSTA K K-12 CSTA K-12 K-12 K-6 K6-9 K STEM STEM STEM

江苏省高等学校

Chinese Journal of Applied Probability and Statistics Vol.25 No.4 Aug (,, ;,, ) (,, ) 应用概率统计 版权所有, Zhang (2002). λ q(t)

續論

JOURNAL OF DISTANCE EDUCATION International View [1] ( ) ; (NITRD) (Basic R&D) (Cross-cutting R&D Foundations) ( ) ( 1) (1) ;(2) ;(3) ;(4) ;(5)

穨423.PDF

荨荨 % [3] [4] 86%( [6] 27 ) Excel [7] 27 [8] 2 [9] K2 [2] ; Google group+ 5 Gmail [2] 2 fxljwcy 3E [22] 2 2 fxljzrh 2D [23] 3 2 fxzphjf 3D 35

Microsoft Word - 31空中大學校稿檔.doc

48 Computer Education 课 程 体 系 设 置 2.1 科 学 设 置 培 养 方 案 课 程 模 块, 确 定 培 养 方 向 首 先, 我 们 通 过 对 人 才 市 场 需 求 分 析, 确 定 了 专 业 培 养 目 标 然 后, 根 据 教 育 部 高 等

Dan Buettner / /

~ 10 2 P Y i t = my i t W Y i t 1000 PY i t Y t i W Y i t t i m Y i t t i 15 ~ 49 1 Y Y Y 15 ~ j j t j t = j P i t i = 15 P n i t n Y

Microsoft Word - 专论综述1.doc

[1] Liu Hongwei,2013, Study on Comprehensive Evaluation of Iron and Steel Enterprises Production System s Basic Capacities, International Asia Confere

2013 年 7 月 总 第 235 期 主 办 单 位 : 中 国 科 学 院 自 动 化 研 究 所 CONTENTS 中 国 科 学 院 自 动 化 研 究 所 所 刊 卷 首 语 节 令 是 一 种 命 令 毕 淑 敏 1 聚 焦 CASIA 自 动 化 所 召 开 庆 祝 建 党 92 周

1 SIGMA Lab Sydney IntelliGent MultimediA Laboratory Electrical and Information Engineering department EIE Ph.D. M.Phil. 1.QS (Associate Profess

[29] a N d N b 2 d sin θ N b ФФ a b Ф Ф θ θ a b Fig.1 Working principle demonstration of a phased-array antenna θ

MAXQ BA ( ) / 20

Your Paper's Title Starts Here: Please Center

與 所 長 有 約 座 談 會 之 師 生 合 影 與 所 長 有 約 座 談 會 之 生 分 組 討 論 五 103 年 6 月 9 日 ( 一 ) 舉 辦 新 生 座 談 會, 並 將 新 編 印 完 成 之 新 生 手 冊 發 予 新 生 與 本 所 教 師 新 生 座 談 會 之 師 生 合

2/80 2

环境指标

j.si

本研究主要关注的是中国内地东北地区一个省会城市中的中低收入社区内居民使用手机的情况,廓清手机这种信息沟通技术对于社区居民的影响

理 成 可 做 關 聯 分 析 的 格 式, 再 應 用 統 計 統 計 計 算 軟 體 R (R Core Team, 2013) 中 的 延 伸 套 件 arules (Hahsler, Gruen, and Hornik, 2005; Hahsler, Buchta, Gruen, and H

,, [1 ], [223 ] :, 1) :, 2) :,,, 3) :,, ( ),, [ 6 ],,, [ 3,728 ], ; [9222 ], ;,,() ;, : (1) ; (2),,,,, [23224 ] ; 2,, x y,,, x y R, ( ),,, :

word2vec 8-10 GloVe 11 Word2vec X king - X man X queen - X woman Recurrent Neural Network X shirt - X clothing X chair - X furniture 2 n-gra

Microsoft Word doc

4 115,,. : p { ( x ( t), y ( t) ) x R m, y R n, t = 1,2,, p} (1),, x ( t), y ( t),,: F : R m R n.,m, n, u.,, Sigmoid. :,f Sigmoid,f ( x) = ^y k ( t) =

a b

untitled

M M. 20

第壹章

关于2007年硕士研究生培养方案修订几点要求的说明

MIT 4 MIT 2014 / MIT 30% MIT 2 MIT 20% Investor's Business Daily MIT MIT 3 CNKI

附件4

44(1) (1) (4) (4) 63-88TSSCI Liu, W. Y., & Teele S. (2009). A study on the intelligence profile

Improving the Effectiveness of the Training of Civil Service by Applying Learning Science and Technology: The Case Study of the National Academy of Ci

08_729.dvi

2 207 Manuel Castells

CH01.indd

強化信用卡電子交易安全之研究

United Nations ~ ~ % 2010

Microsoft Word 記錄附件

标题

34 22 f t = f 0 w t + f r t f w θ t = F cos p - ω 0 t - φ 1 2 f r θ t = F cos p - ω 0 t - φ 2 3 p ω 0 F F φ 1 φ 2 t A B s Fig. 1

1 科 学 谋 划, 有 序 促 进 扶 贫 工 作 的 持 续 发 展 1.1 科 学 定 位, 精 准 发 现 地 方 的 需 求 按 照 国 家 生 态 功 能 区 的 划 分, 库 伦 旗 属 重 点 生 态 保 护 开 发 区 这 里 生 态 环 境 优 良 特 色 作 物 资 源 优 势

交流活动

一 课 程 负 责 人 情 况 姓 名 吴 翊 性 别 男 出 生 年 月 基 本 信 息 学 位 硕 士 职 称 教 授 职 务 所 在 院 系 理 学 院 数 学 与 系 统 科 学 系 电 话 研 究 方 向 数 据 处 理 近 三 年 来

小论文草稿2_邓瀚

<4D F736F F D20C9CFBAA3BFC6BCBCB4F3D1A7D0C5CFA2D1A7D4BA C4EAC7EFBCBEC8EBD1A7B2A9CABFD7CAB8F1BFBCCAD4CAB5CAA9CFB8D4F22D C8B7B6A8B8E5>

<4D F736F F D D DBACEC0F25FD0A3B6D4B8E55F2DB6FED0A32D2D2DC8A5B5F4CDBCD6D0B5C4BBD8B3B5B7FBBAC52E646F63>

增 刊 谢 小 林, 等. 上 海 中 心 裙 房 深 大 基 坑 逆 作 开 挖 设 计 及 实 践 745 类 型, 水 位 埋 深 一 般 为 地 表 下.0~.7 m 场 地 地 表 以 下 27 m 处 分 布 7 层 砂 性 土, 为 第 一 承 压 含 水 层 ; 9 层 砂 性 土

PCA+LDA 14 1 PEN mL mL mL 16 DJX-AB DJ X AB DJ2 -YS % PEN

[9] R Ã : (1) x 0 R A(x 0 ) = 1; (2) α [0 1] Ã α = {x A(x) α} = [A α A α ]. A(x) Ã. R R. Ã 1 m x m α x m α > 0; α A(x) = 1 x m m x m +

Transcription:

42 6 Vol. 42, No. 6 2016 6 ACTA AUTOMATICA SINICA June, 2016 ROI-KNN 1 1 1, 2.,.,, (Region of interest, ROI) K (K-nearest neighbors, KNN), ROI-KNN,,,,. DOI,,,,,. ROI-KNN., 2016, 42(6): 883 891 10.16383/j.aas.2016.c150638 Facial Expression Recognition Using ROI-KNN Deep Convolutional Neural Networks SUN Xiao 1 PAN Ting 1 REN Fu-Ji 1, 2 Abstract Deep neural networks have been proved to be able to mine distributed representation of data including image, speech and text. By building two models of deep convolutional neural networks and deep sparse rectifier neural networks on facial expression dataset, we make contrastive evaluations in facial expression recognition system with deep neural networks. Additionally, combining region of interest (ROI) and K-nearest neighbors (KNN), we propose a fast and simple improved method called ROI-KNN for facial expression classification, which relieves the poor generalization of deep neural networks due to lacking of data and decreases the testing error rate apparently and generally. The proposed method also improves the robustness of deep learning in facial expression classification. Key words Convolution neural networks, facial expression recognition, model generalization, prior knowledge Citation Sun Xiao, Pan Ting, Ren Fu-Ji. Facial expression recognition using ROI-KNN deep convolutional neural networks. Acta Automatica Sinica, 2016, 42(6): 883 891.,, 2015-10-12 2016-04-01 Manuscript received October 12, 2015; accepted April 1, 2016 (61432004), (1508085QF119), (NLPR201407345), (2015M580532), 2015 (2015cxcys109) Supported by Key Program of National Natural Foundation Science of China (61432004), the Natural Science Foundation of Anhui Province (1508085QF119), Open Project Program of the National Laboratory of Pattern Recognition (NLPR201407345), China Postdoctoral Science Foundation (2015M580532), and National Training Program of Innovation and Entrepreneurship for HFUT Undergraduates (2015cxcys109) Recommended by Associate Editor KE Deng-Feng 1. 230009 2. 7708500 1. School of Computer and Information, Hefei University of Technology, Hefei 230009, China 2. Department of Information Science and Intelligent Systems, Faculty of Engineering, Tokushima University, Tokushima 7708500, Japan,.,., ( )., ( ),,.,,,. : ( )..,. (Principal component analysis, PCA),, (Scale-invariant feature transform, SIFT) Haar (Local bi-

884 42 nary pattern, LBP).,,.,,. Krizhevsky [1] ILSVRC-2012,, SIFT.,, Lopes [2],, Extended CohnKanade (CK+) [3].,,,, CK+ : 1), Wild,. 1.,. 1 Fig. 1 CK+ Wild Samples from CK+ and Wild 2) CK+ 593,, 100., Lopes [2] 30, ( ),. 60 k ( ), MNIST Cifar10., CK+., CK+ (95 %),. 1. 2. 3 CK+ 1 https://github.com/neopenx/ Wild,. 4. Theano Github 1. 1 1.1, Bishop [4] : p(t x, t, α, β) = N(t m T NΦ(x), σn(x)) 2 (1) N m T NΦ(x) = y(x, m N ) = kernel(x, x n )t n (2) n=1 (1) t, t x, β α. (2) ( Smooth ). x x n.,, t t n,. Bengio [5], (Support vector machine, SVM), K (K-nearest neighbors, KNN),, (Smoothness-prior)., (Local representation),,.,,., SIFT Haar LBP PCA, 2,,,. 1.2 LeCun [6] 1990, 3. Fukushima [7], Rumelhart [8], [9]., Smooth,.

6 : ROI-KNN 885 Fig. 3 3 Fig. 2 1.2.1 2 Manifold side of input space Local connection and structure of convolutional neural network (CNN), (Locally-connection). (Fully-connection) (Dense-connection).,, [5],,. Szegedy [10] 22 GoogLeNet, ILSVRC-2014. 1.2.2 / ( ),,,., Fukushima [7],. 1.2.3 Pooling,,.,, (Max pooling) (Avg pooling).,,,. 1.3 Glorot [11] (Deep sparse rectifier neural networks), Sigmoid (logistic/tanh) ReLU. 1.3.1 Barron [12] N 1/N.,,,. Bengio [5],,,. Hubel [13], V1, V2,.,. 1.3.2 ReLU Dayan [14],, 4, 0, Sigmoid, ReLU. Attwell [15],, 1 % 4 %, 0,,,. ReLU : ReLU(x) = max(0, x) Softplus : Softplus(x) = log(1 + e x ) Softplus ReLU, [ 1, 1], (Gradient vanish),

886 42 4 ( Glorot [11] ) Fig. 4 Graphs for different activation functions from Glorot [11],, [1]., 0, L1 Regularization.,, [11]. 1.4 Dropout Hinton [16] Dropout. Dropout : 1) : x, p 0,. : DropoutT rain(x) = RandomZero(p) x,,. 2) :,., x,. : DropoutT est(x) = (1 p) x Dropout., Dropout,, Attwell [15].,,,,. Darwin [17],. 1.5 1.5.1 : ( W = Uniform 1 ) 1, N N Xavier [18] Sigmoid : ( W = Uniform 1, F in + F out ) 1 F in + F out, F in, F out. Bishop [4], N,,,. W P (W )., W. Krizhevsky [1] Hinton [16] ILSVRC- 2012, W,. 1.5.2 Krizhevsky [1] Hinton [16] ( ) 1 0,.,. 2 2.1 5, 32 32 ( 1), 3 Max pooling 1 1 Softmax., CNN-64 CNN-96 CNN-128. CNN-64: [32, 32, 64, 64] CNN-96: [48, 48, 96, 200] CNN-128: [64, 64, 128, 300]

6 : ROI-KNN 887 5 (,.) Fig. 5 Structure of DNN (? represents uncertain parameters with many candidate solutions.), p = 0.5 Dropout, L2 Regularization. Softmax, ReLU,, Max pooling. W Krizhevsky [1] (Standard deviation, STD). STD : [0.0001, 0.001, 0.001, 0.01, 0.1] Krizhevsky [1]. 2.2 6, 32 32 ( 1), 3 1 Softmax. 6 Fig. 6 Structure of deep sparse rectifier net, DNN- 1000, DNN-2000. DNN-1000: [1 000, 1 000, 1 000] DNN-2000: [2 000, 2 000, 2 000], p = 0.2 Dropout. Softmax, ReLU. W STD : [0.1, 0.1, 0.1, 0.1]., 1, 0. 2.3, 32 32 1 024.,., DNN, 128.0. (Early stopping). lr 0.01, momentum 0.9., lr,,, 0.0001, 3. 2.4 ROI-KNN Xavier [18 19],,.,,, 9 (Region of interest, ROI), 7,. 7 Fig. 7 9 ROI ( ) Nine ROI regions (cut, flip, cover, center focus) ROI,. ROI,,,.,,... ( ). ROI 9,, ROI,. ROI ROI ( ), ROI ( ). Bengio [5] : (Distributed representation),. Smooth-prior (Local representation),

888 42. ROI. ROI,. ROI.,., KNN,,,,., ROI-KNN,, 9 ROI,,. ROI-KNN Distributed representation, ROI,,. ROI ROI,, Local representation. ROI,,.. 2.5 Lopes [2],.,,. :.,,, ROI,.,,. Lopes [2],,., Wild. 3 2.1 2.2, : ROI ROI-KNN.. 3.1 CK+. 4, 500 Wild,., CK+, 1 200,, CK+,, Wild. CK+ 700 200 900. 5, 900. 300 300. 5, 300. 3.2 ROI ROI, Distributed representation. 3.1 5 4 500 5 1 500. 4 500 ROI, 4 500 9 = 40 500,. 1, ROI, ROI., ROI 4 % 5 %,.,, 25.8 %.,.,,. 3.1 :, Wild,,, Lopes [2] CK+.,, Lopes [2],,. 1 ROI (%) Table 1 Test set error rate of ROI auxiliary (%) CNN-64 4.7 32.7 54.3 33 40.3 33.3 CNN-64* 5.6 36.3 59.3 20.0 31.7 30.6 CNN-96* 5.0 36.7 53.3 20.7 24.7 28.6 CNN-128 3.3 32.0 51.0 27.0 37.7 30.2 CNN-128* 3.0 31.0 55.7 18.7 24.3 26.6 DNN-1000 3.0 37.7 65.3 38.3 36.7 36.2 DNN-1000* 2.3 39.0 52.0 30.0 31.7 31.0 DNN-2000* 2.0 43.3 55.0 24.7 32.7 31.5

6 : ROI-KNN 889 3.3 2.5,, : 1) I. CK+,,. [2], α : α N(0, 3 o ) 5, 700 11, 3.1 4 500, 5 700 11 + 4 500 = 43 000,,. 2) II. I 43 000, 3.2 40 500, 83 500,,. 3.2 ROI, 2, * I, + II+ROI, II ROI-KNN. 2 (%) Table 2 Test set error rate of rotating generated sample (%) CNN-128 3.3 32.0 51.0 27.0 37.7 30.2 CNN-128* 4.7 41.3 52.7 32.7 35.0 33.2 CNN-128+ 3.0 37.0 51.7 15.7 24.0 26.3 CNN-128ˆ 0.0 30.0 54.0 13.0 26.7 24.7 DNN-1000 3.0 37.7 65.3 38.3 36.7 36.2 DNN-1000* 1.3 39.7 62.0 37.3 42.0 36.5 DNN-1000+ 2.3 41.3 57.0 30.0 35.7 33.3 DNN-1000ˆ 1.3 43.0 67.7 31.0 33.7 35.3,., I, CNN- 128 DNN-1000 43 000, 4 500, 38 500, Wild, Lopes [2], CK+., II, ROI, ROI-KNN, DNN-1000. 3.4, ROI-KNN Distributed representation, ROI- KNN, Distributed representation.,,.,,,,,,.,,,. 3.4 ROI-KNN ROI-KNN KNN, 2.4, Distributed representation. 3, ROI, * ROI-KNN. 3 ROI-KNN (%) Table 3 Test set error rate with ROI-KNN (%) CNN-64 5.6 36.3 59.3 20.0 31.7 30.6 CNN-64* 1.0 29.7 56.0 17.0 30.0 26.7 CNN-96 5.0 36.7 53.3 20.7 24.7 28.6 CNN-96* 0.3 26.0 56.3 16.0 26.7 25.8 CNN-128 3.0 31.0 55.7 18.7 24.3 26.6 CNN-128* 0.6 22.7 57.0 12.0 26.3 23.7 DNN-1000 2.3 39.0 52.0 30.0 31.7 31.0 DNN-1000* 0.3 37.3 61.0 31.7 31.0 32.2 DNN-2000 2.0 43.3 55.0 24.7 32.7 31.5 DNN-2000* 0.3 40.0 68.0 26.3 33.3 33.6, KNN 4 % 5 %,,,.,,,.,,,. KNN ( Distributed representation),,.,,. 3.5 ROI-KNN SVM,, JAFFE, SVM PCA, CNN-128 ROI-KNN. 4, SVM,.

890 42 4 Table 4 JAFFE Comparisons on JAFFE (%) Kumbhar [20] Image feature 30 40 Lekshmi [21] SVM 13.1 Zhao [22] PCA and NMF 6.28 Zhi [23] 2D-DLPP 4.09 Lee [24] RDA 3.3 ROI-KNN+CNN 2.81 4.,,. Wild, ROI ;,, :,.,,. Regularization, L2 Regularization Dropout.,,, (Recurrent neural network, RNN). Distributed representation, SIFT,., SIFT Haar LBP,, Distributed representation,. ROI-KNN,, Distributed representation,,.,,,. Theano [25],. References 1 Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems 25. Lake Tahoe, Nevada, USA: Curran Associates, Inc., 2012. 1097 1105 2 Lopes A T, de Aguiar E, Oliveira-Santos T. A facial expression recognition system using convolutional networks. In: Proceedings of the 28th SIBGRAPI Conference on Graphics, Patterns and Images. Salvador: IEEE, 2015. 273 280 3 Lucey P, Cohn J F, Kanade T, Saragih J, Ambadar Z, Matthews I. The extended Cohn-Kanade dataset (CK+): a complete dataset for action unit and emotion-specified expression. In: Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). San Francisco, CA: IEEE, 2010. 94 101 4 Bishop C M. Pattern Recognition and Machine Learning. New York: Springer, 2007. 5 Bengio Y. Learning deep architectures for AI. Foundations and Trends in Machine Learning. Hanover, MA, USA: Now Publishers Inc., 2009. 1 127 6 LeCun Y, Boser B, Denker J S, Howard R E, Hubbard W, Jackel L D, Henderson D. Handwritten digit recognition with a back-propagation network. In: Proceedings of Advances in Neural Information Processing Systems 2. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 1990. 396 404 7 Fukushima K. Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biological Cybernetics, 1980, 36(4): 193 202 8 Rumelhart D E, Hinton G E, Williams R J. Learning representations by back-propagating errors. Nature, 1986, 323(6088): 533 536 9 LeCun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 1998, 86(11): 2278 2324 10 Szegedy C, Liu W, Jia Y Q, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A. Going deeper with convolutions. In: Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Boston, MA: IEEE, 2015. 1 9 11 Glorot X, Bordes A, Bengio Y. Deep sparse rectifier neural networks. In: Proceedings of the 14th International Conference on Artificial Intelligence and Statistics (AISTATS). Fort Lauderdale, FL, USA, 2011, 15: 315 323 12 Barron A R. Universal approximation bounds for superpositions of a sigmoidal function. IEEE Transactions on Information Theory, 1993, 39(3): 930 945 13 Hubel D H, Wiesel T N, LeVay S. Visual-field representation in layer IV C of monkey striate cortex. In: Proceedings of the 4th Annual Meeting, Society for Neuroscience. St. Louis, US, 1974. 264

6 : ROI-KNN 891 14 Dayan P, Abott L F. Theoretical Neuroscience. Cambridge: MIT Press, 2001. 15 Attwell D, Laughlin S B. An energy budget for signaling in the grey matter of the brain. Journal of Cerebral Blood Flow and Metabolism, 2001, 21(10): 1133 1145 25 Bastien F, Lamblin P, Pascanu R, Bergstra J, Goodfellow I J, Bergeron A, Bouchard N, Warde-Farley D, Bengio Y. Theano: new features and speed improvements. In: Conference on Neural Information Processing Systems (NIPS) Workshop on Deep Learning and Unsuper Vised Feature Learning. Lake Tahoe, US, 2012. 16 Hinton G E, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov R R. Improving neural networks by preventing co-adaptation of feature detectors. arxiv: 1207.0580, 2012. 17 Darwin C. On the Origin of Species. London: John Murray, Albemarle Street, 1859. 18 Xavier G, Yoshua B. Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the 13th International Conference on Artificial Intelligence and Statistics (AISTATS 2010). Chia Laguna Resort, Sardinia, Italy, 2010, 9: 249 256 19 Sun Y, Wang X, Tang X. Deep learning face representation from predicting 10 000 classes. In: Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Columbus, OH: IEEE, 2014. 1891 1898 20 Kumbhar M, Jadhav A, Patil M. Facial expression recognition based on image feature. International Journal of Computer and Communication Engineering, 2012, 1(2): 117 119 21 Lekshmi V P, Sasikumar M. Analysis of facial expression using Gabor and SVM. International Journal of Recent Trends in Engineering, 2009, 1(2): 47 50 22 Zhao L H, Zhuang G B, Xu X H. Facial expression recognition based on PCA and NMF. In: Proceedings of the 7th World Congress on Intelligent Control and Automation. Chongqing, China: IEEE, 2008. 6826 6829 23 Zhi R C, Ruan Q Q. Facial expression recognition based on two-dimensional discriminant locality preserving projections. Neurocomputing, 2008, 71(7 9): 1730 1734 24 Lee C C, Huang S S, Shih C Y. Facial affect recognition using regularized discriminant analysis-based algorithms. EURASIP Journal on Advances in Signal Processing, 2010, article ID 596842(doi:10.1155/2010/596842).,.. E-mail: sunx@hfut.edu.cn (SUN Xiao Associate professor at the Institute of Affective Computing, Hefei University of Technology. His research interest covers natural language processing, affective computing, machine learning and human-machine interaction. Corresponding author of this paper.).,. E-mail: neopenx@mail.hfut.edu.cn (PAN Ting Bachelor student at the School of Computer Science and Information, Hefei University of Technology. His research interest covers the theory of deep learning and Bayesian learning, and corresponding applications in computer vision and natural language processing.),.,,,. E-mail: ren@is.tokushima-u.ac.jp (REN Fu-Ji Professor at the Institute of Affective Computing, Hefei University of Technology and Tokushima University. His research interest coves artificial intelligent, affective computing, natural language processing, machine learning, and human-machine interaction.)