Predicting Salient Face in Multiple-face Videos
2017
会议录名称30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017)
ISSN1063-6919
卷号2017-January
页码3224-3232
发表状态已发表
DOI10.1109/CVPR.2017.343
摘要Although the recent success of convolutional neural network (CNN) advances state-of-the-art saliency prediction in static images, few work has addressed the problem of predicting attention in videos. On the other hand, we nd that the attention of different subjects consistently focuses on a single face in each frame of videos involving multiple faces. Therefore, we propose in this paper a novel deep learning (DL) based method to predict salient face in multiple-face videos, which is capable of learning features and transition of salient faces across video frames. In particular, we rst learn a CNN for each frame to locate salient face. Taking CNN features as input, we develop a multiple-stream long short-term memory (M-LSTM) network to predict the temporal transition of salient faces in video sequences. To evaluate our DL-based method, we build a new eye-tracking database of multiple-face videos. The experimental results show that our method outperforms the prior state-of-the-art methods in predicting visual attention on faces in multipleface videos.
出版地345 E 47TH ST, NEW YORK, NY 10017 USA
会议地点Honolulu, HI, United states
会议日期21-26 July 2017
URL查看原文
收录类别CPCI ; EI
语种英语
资助项目Fok Ying-Tong education foundation[151061]
WOS研究方向Computer Science ; Engineering
WOS类目Computer Science, Artificial Intelligence ; Computer Science, Theory & Methods ; Engineering, Electrical & Electronic
WOS记录号WOS:000418371403032
出版者IEEE
EI入藏号20181304947391
EI主题词Behavioral research ; Computer vision ; Deep learning ; Eye tracking ; Forecasting ; Neural networks ; Pattern recognition
EI分类号Computer Applications:723.5 ; Social Sciences:971
WOS关键词MODEL
原始文献类型Proceedings Paper
来源库IEEE
引用统计
正在获取...
文献类型会议论文
条目标识符https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/16321
专题信息科学与技术学院
信息科学与技术学院_PI研究组_何旭明组
信息科学与技术学院_硕士生
信息科学与技术学院_博士生
通讯作者Xu, Mai
作者单位
1.Beihang Univ, Beijing, Peoples R China
2.ShanghaiTech Univ, Shanghai, Peoples R China
推荐引用方式
GB/T 7714
Liu, Yufan,Zhang, Songyang,Xu, Mai,et al. Predicting Salient Face in Multiple-face Videos[C]. 345 E 47TH ST, NEW YORK, NY 10017 USA:IEEE,2017:3224-3232.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Liu, Yufan]的文章
[Zhang, Songyang]的文章
[Xu, Mai]的文章
百度学术
百度学术中相似的文章
[Liu, Yufan]的文章
[Zhang, Songyang]的文章
[Xu, Mai]的文章
必应学术
必应学术中相似的文章
[Liu, Yufan]的文章
[Zhang, Songyang]的文章
[Xu, Mai]的文章
相关权益政策
暂无数据
收藏/分享
文件名: salientface_cvpr17.pdf
格式: Adobe PDF
此文件暂不支持浏览
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。