点击上方“专知”关注获取更多AI知识!
【导读】第25届ACM国际多媒体会议(ACM International Conference on Multimedia, 简称ACMMM)于2017年10月23日至27日在美国硅谷Mountain View隆重举行。自1993年首次召开以来,ACMMM每年召开一次,已经成为多媒体领域顶级会议,也是中国计算机学会推荐的A类国际学术会议热门方向有大规模图像视频分析、社会媒体研究、多模态人机交互、计算视觉、计算图像等等。
昨天我们分享了由ACM SIGMM China Chapter准备在10月17号在北京举行ACM MM 2017 Pre-Conference,欢迎查看参加!
【学术盛宴 】多媒体顶级会议ACM Multimedia 2017 China Pre-conference论文宣讲研讨会
今天我们分享ACM Multimedia 2017 文章,欢迎查看!
大会网址:http://www.acmmm.org/2017/
| Exploring Outliers in Crowdsourced Ranking for QoE Qianqian Xu , Institute of Information Engineering of Chinese Academy of Sciences); Ming Yan ; Chendi Huang ; Jiechao Xiong ; Qingming Huang ; Yuan Yao  |  
                
| Towards forward-looking online bitrate adaptation for DASH Bo Wang ; Fengyuan Ren  |  
                
| Weighted Sparse Representation Regularized Graph Learning for RGB-T Object Tracking Chenglong Li ; Nan Zhao ; Yijuan Lu ; Chengli Zhu ; Jin Tang  |  
                
| Multi-Scale Cascade Network for Salient Object Detection Xin Li ; Fan Yang ; Hong Cheng ; Junyu Chen ; Yuxiao Guo ; Leiting Chen  |  
                
| 3D CNNs on Distance Matrices for Human Action Recognition Alejandro José Hernández Ruiz ; Lorenzo Porzi ; Samuel Rota Bulò ; Francesc Moreno Noguer  |  
                
| 16K Cinematic VR Streaming Patrice Rondao Alface ; Maarten Aerts ; Donny Tytgat ; Sammy Lievens ; Christoph Stevens ; Nico Verzijp ; Jean-François Macq  |  
                
| Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive Architectures Gaurav Mittal ; Tanya Marwah ; Vineeth N Balasubramanian  |  
                
| On Server Provisioning for Cloud Gaming Yusen Li ; Yunhua Deng ; Xueyan Tang ; Wentong Cai ; Xiaoguang Liu ; Gang Wang  |  
                
| Region-based Image Retrieval Revisited by Semantic Region Specification and Spatial Relationship Recommendation Ryota Hinami ; Yusuke Matsui ; Shin'Ichi Satoh  |  
                
| Enhancing Micro-video Understanding by Harnessing External Sounds Liqiang Nie ; Xiang Wang ; Jianglong Zhang ; Xiangnan He ; Hanwang Zhang ; Richang Hong ; Qi Tian  |  
                
| Semi-Relaxtion Supervised Hashing for Cross-Modal Retrieval Peng-Fei Zhang ; Chuan-Xiang Li ; Meng-Yuan Liu ; Liqiang Nie ; Xin-Shun Xu  |  
                
| Sketch Recognition with Deep Visual-Sequential Fusion Model Jun-Yan He ; Xiao Wu ; Yu-Gang Jiang ; Bo Zhao ; Qiang Peng  |  
                
| From Part to Whole: Who is Behind the Painting? Daiqian Ma ; Feng Gao ; Yan Bai ; Yihang Lou ; Shiqi Wang ; Tiejun Huang ; Ling-Yu Duan  |  
                
| Adversarial Cross-Modal Retrieval Bokun Wang ; Yang Yang ; Xing Xu ; Alan Hanjalic ; Heng Tao Shen  |  
                
| Catching the Temporal Regions-of-Interest for Video Captioning Ziwei Yang ; Yahong Han ; Zheng Wang  |  
                
| Image quality assessment for DIBR synthesized views using elastic metric Suiyi Ling ; Patrick Le Callet  |  
                
| What your Facebook Profile Picture Reveals about your Personality Cristina Segalin ; Fabio Celli ; Luca Polonio ; David Stillwell ; Michal Kosinski ; Nicu Sebe ; Marco Cristani ; Bruno Lepri  |  
                
| Deep Asymmetric Pairwise Hashing Xin Gao ; Fumin Shen ; Li Liu ; Yang Yang ; Heng Tao Shen  |  
                
| Real-time Monocular Dense Mapping for Augmented Reality Tangli Xue ; Hongcheng Luo ; Zikang Yuan ; Xin Yang  |  
                
| Learning Object-Centric Transformation for Video Prediction Xiongtao Chen ; Wenmin Wang ; Jinzhuo Wang ; Weimian Li  |  
                
| Capturing spatial and temporal patterns for distinguishing between posed and spontaneous expressions Jiajia Yang ; Shangfei Wang  |  
                
| Deep Low-rank Sparse Collective Factorization for Cross-Domain Recommendation Shuhui Jiang ; Zhengming Ding ; Yun Fu  |  
                
| Detecting Temporal Proposal for Action Localization with Tree-structured Search Policy Xinyang Jiang ; Siliang Tang ; Yang Yang ; Zhou Zhao ; Fei Wu ; Yueting Zhuang  |  
                
| Fluency-Guided Cross-Lingual Image Captioning Weiyu Lan ; Xirong Li ; Jianfeng Dong  |  
                
| Learning Non-local Image Diffusion for Image Denoising Peng Qiao ; Yong Dou ; Wensen Feng ; Yunjin Chen  |  
                
| An Image-based Deep Spectrum Feature Representation for the Recognition of Emotional Speech Nicholas Cummins ; Shahin Amiriparian ; Gerhard Hagerer ; Anton Batliner ; Stefan Steidl ; Björn Schuller  |  
                
| FastShrinkage: Perceptually-aware Retargeting Toward Mobile Platforms Zhenguang Liu ; Luming Zhang ; Rajiv Ratn ; Yi Yang ; Xuelong Li  |  
                
| QUETRA: A Queuing Theory Approach to DASH Rate Adaptation Praveen Kumar Yadav ; Arash Shafiei ; Wei Tsang Ooi  |  
                
| ElasticPlay: Responsive Video Summarization with Dynamic Time Budget Haojian Jin ; Yale Song ; Koji Yatani  |  
                
| Learning Fashion Compatibility with Bidirectional LSTMs Xintong Han ; Zuxuan Wu ; Yu-Gang Jiang ; Larry Davis  |  
                
| H-TIME: Haptic-enabled Tele-Immersive Musculoskeletal Examination Yuan Tian ; Suraj Raghuraman ; Thiru Annaswamy ; Aleksander Borresen ; Klara Nahrstedt ; Balakrishnan Prabhakaran  |  
                
| Two-Stream Attentive CNNs for Image Retrieval Fei Yang ; Jia Li ; Shikui Wei ; Qinjie Zheng ; Ting Liu ; Yao Zhao  |  
                
| Magic-wall: Visualizing Room Decoration Ting Liu ; Yunchao Wei ; Yao Zhao ; Si Liu ; Shikui Wei  |  
                
| Automatic Music Video Generation Based on Simultaneous Soundtrack Recommendation and Video Editing Jen-Chun Lin ; Wen-Li Wei ; James Yang ; Hsin-Min Wang ; Hong-Yuan Mark Liao  |  
                
| DeepArt: Learning Joint Representations of Visual Arts Hui Mao ; Ming Cheung ; James She  |  
                
| Automatic Generation of Lyrics Parodies Lorenzo Gatti ; Gözde Özbal ; Oliviero Stock ; Carlo Strapparava  |  
                
| Anti-camera LED Lighting Xiao Shu ; Xiaolin Wu ; Qifan Gao  |  
                
| Mr.MAPP: Mixed Reality for MAnaging Phantom Pain Kanchan Bahirat ; Thiru Annaswamy ; Balakrishnan Prabhakaran  |  
                
| Video Captioning with Guidance of Multimodal Latent Topics Shizhe Chen ; Jia Chen ; Qin Jin ; Alexander Hauptmann  |  
                
| Where are the sweet spots? A systematic approach to reproducible DASH Player comparisons Denny Stohr ; Alexander Frömmgen ; Amr Rizk ; Michael Zink ; Ralf Steinmetz ; Wolfgang Effelsberg  |  
                
| Cross-modal Recipe Retrieval with Rich Food Attributes Jingjing Chen ; Chong-Wah Ngo ; Tat-Seng Chua  |  
                
| Integrated Face Analytics Networks through Cross-Dataset Hybrid Training Jianshu Li ; Shengtao Xiao ; Fang Zhao ; Jian Zhao ; Jianan Li ; Jiashi Feng ; Shuicheng Yan ; Terence Sim  |  
                
| Vocktail: A Virtual Cocktail for Pairing Digital Taste, Smell, and Color Sensations Nimesha Ranasinghe ; Thi Ngoc Tram Nguyen ; Yan Liangkun ; Lien-Ya Lin ; David Tolley ; Ellen Yi-Luen Do  |  
                
| Hashtag-centric Immersive Search on Social Media Yuqi Gao ; Jitao Sang ; Tongwei Ren ; Changsheng Xu  |  
                
| Affect Recognition in Ads with Application to Computational Advertising Abhinav Shukla ; Shruti Gullapuram ; Harish Katti ; Narasimha Karthik Yadati ; Mohan Kankanhalli ; Ramanathan Subramanian , University of Illinois at Urbana-Champaign)  |  
                
| Exploring the use of Time-Dependent Cross-Network Information for Personalized Recommendations Dilruk Perera ; Roger Zimmermann  |  
                
| Learning Multimodal Attention LSTM Networks for Video Captioning} Jun Xu ; Ting Yao ; Yongdong Zhang ; Tao Mei  |  
                
| Spatio-Temporal AutoEncoder for Video Anomaly Detection Yiru Zhao ; Bing Deng ; Chen Shen ; Yao Liu ; Hongtao Lu ; Xian-Sheng Hua  |  
                
| Deep Siamese Network with Multi-level Similarity Perception for Person Re-identification Chen Shen ; Zhongming Jin ; Yiru Zhao ; Zhihang Fu ; Rongxin Jiang ; Yaowu Chen ; Xian-Sheng Hua  |  
                
| Fashion World Map: Understanding Cities Through Streetwear Fashion Yu-Ting Chang ; Wen-Huang Cheng ; Bo Wu ; Kai-Lung Hua  |  
                
| Automatic Adjustment of Stereoscopic Content for Long-Range Projections in Outdoor Areas Behnam Maneshgar ; Leila Sujir ; Sudhir Mudur ; Charalambos Poullis  |  
                
| SketchParse: Towards Rich Descriptions For Poorly Drawn Sketches Using Multi-Task Deep Networks Ravi Kiran Sarvadevabhatla ; Isht Dwivedi ; Abhijat Biswas ; Sahil Manocha ; Venkatesh Babu R.  |  
                
| Place-centric Visual Urban Perception with Hierarchical Deep Multi-instance Regression Xiaobai Liu ; Qi Chen ; Yuanlu Xu ; Lei Zhu ; Xuming He  |  
                
| A Delicious Recipe Analysis Framework for Exploring Multi-Modal Recipes with Various Attributes Weiqing Min ; Shuqiang Jiang ; Shuhui Wang ; Jitao Sang ; Shuhuan Mei  |  
                
| Temporal Binary Coding for Large-Scale Video Search Ke Xia ; Yuqing Ma ; Xianglong Liu ; Yadong Mu ; Li Liu  |  
                
| Learning to Compose with Professional Photographs on the Web Yi-Ling Chen ; Jan Klopp ; Min Sun ; Shao-Yi Chien ; Kwan-Liu Ma  |  
                
| StructCap: Structured Semantic Embedding for Image Captioning Fuhai Chen ; Rongrong Ji ; Jinsong Su ; Yongjian Wu ; Yunsheng Wu  |  
                
| Unconstrained Fashion Landmark Detection Sijie Yan ; Ziwei Liu ; Ping Luo ; Xiaogang Wang ; Xiaoou Tang  |  
                
| Skeleton-aided Articulated Motion Generation Yichao Yan ; Jingwei Xu ; Bingbing Ni ; Wendong Zhang ; Xiaokang Yang  |  
                
| One-Shot Fine-Grained Instance Retrieval Hantao Yao , Chinese Academy of Sciences; University of Chinese Academy of Sciences); Shiliang Zhang ; Yongdong Zhang , Chinese Academy of Sciences); Jintao Li , Chinese Academy of Sciences); Qi Tian  |  
                
| GLAD: Global-Local-Alignment Descriptor for Pedestrian Retrieval Longhui Wei ; Shiliang Zhang ; Hantao Yao ; Wen Gao ; Qi Tian  |  
                
| Deep progressive hashing for image retrieval Jiale Bai ; Bingbing Ni ; Minsi Wang ; Hanjiang Lai ; Yang Shen ; Lin Mei ; Chongyang Zhang ; Chuanping Hu  |  
                
| FaceCollage: A Rapidly Deployable System for Real-time Head Reconstruction for On-The-Go 3D Telepresence Fuwen Tan ; Chi-Wing Fu ; Jianfei Cai ; Teng Deng ; Tat-Jen Cham  |  
                
| Protest Activity Detection and Perceived Violence Estimation from Social Media Images Donghyeon Won ; Zachary Steinert-Threlkeld ; Jungseock Joo  |  
                
| LiveJack: Integrating CDNs and Edge Clouds for Live Content Broadcasting Bo Yan ; Shu Shi ; Yong Liu ; Weizhe Yuan ; Haoqin He ; Rittwik Jana ; Yang Xu ; H. Jonathan Chao  |  
                
| Modeling the Intransitive Pairwise Image Preference from Multiple Angles Jun Chen ; Chaokun Wang ; Jianmin Wang  |  
                
| Fast Deep Matting for Portrait Animation on Mobile Phone Bingke Zhu ; Yingying Chen ; Si Liu ; Bo Zhang ; Jinqiao Wang ; Ming Tang  |  
                
| Pedestrian Path Forecasting in Crowd: A Deep Spatio-Temporal Perspective Yuke Li  |  
                
| ReGLe: Spatially Regularized Graph Learning for Visual Tracking Chenglong Li ; Xiaohao Wu ; Zhimin Bao ; Jin Tang  |  
                
| 360ProbDASH: Improving QoE of 360 Video Streaming Using Tile-based HTTP Adaptive Streaming Lan Xie ; Zhimin Xu ; Yixuan Ban ; Xinggong Zhang ; Zongming Guo  |  
                
| Deep Unsupervised Convolutional Domain Adaptation Junbao Zhuo , Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China); Shuhui Wang , Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China); Weigang Zhang ); Qingming Huang  |  
                
| PD-Survey - Supporting Audience-Centric Research through Surveys on Public Display Networks Florian Alt  |  
                
| Improving Event Extraction via Cross-Modal Integration Tongtao Zhang ; Spencer Whitehead ; Hanwang Zhang ; Hongzhi Li ; Joseph Ellis ; Lifu Huang ; Wei Liu ; Heng Ji ; Shih-Fu Chang  |  
                
| Indefinite Kernel Logistic Regression Fanghui Liu ; Xiaolin Huang ; Jie Yang  |  
                
| Multimodal Learning for Web Information Extraction Dihong Gong ; Daisy Wang ; Yang Peng  |  
                
| Query-adaptive Video Summarization via Quality-aware Relevance Estimation Arun Balajee Vasudevan ; Michael Gygli ; Anna Volokitin ; Luc Van Gool  |  
                
| Predicting Human Intentions from Motion Cues Only: A 2D+3D Fusion Approach Andrea Zunino ; Jacopo Cavazza ; Atesh Koul ; Andrea Cavallo ; Cristina Becchio ; Vittorio Murino  |  
                
| RGB-D Scene Recognition with Object-to-Object Relation Xinhang Song ); Chengpeng Chen ); Shuqiang Jiang )  |  
                
| Exploring Consistent Preferences: Discrete Hashing with Pair-Exemplar for Scalable Landmark Search Lei Zhu ; Zi Huang ; Xiaojun Chang ; Jingkuan Song ; Heng Tao Shen  |  
                
| Data Generation for Improving Person Re-identification Lin Chen ; Hua Yang ; Shuang Wu ; Zhiyong Gao  |  
                
| Fast and Accurate Pedestrian Detection using Dual-Stage Group Cost-Sensitive RealBoost with Vector Form Filters Chengju Zhou ; Meiqing Wu ; Siew-Kei Lam  |  
                
| Positive and Unlabeled Learning for Anomaly Detection with Multi-features Jiaqi Zhang ; Zhenzhen Wang ; Junsong Yuan ; Yap Peng Tan  |  
                
| Learning Visual Emotion Distributions via Multi-Modal Features Fusion Sicheng Zhao ; Guiguang Ding ; Yue Gao ; Jungong Han  |  
                
| ShareRender: Bypass GPU Virtualization to Enable Fine-grained Resource Sharing for Cloud Gaming Wei Zhang ; Xiaofei Liao ; Hai Jin ; Peng Li ; Li Lin  |  
                
| Vivepaper: Augmented Reality Virtual Book for Immersive Reading Experience Zhongyang Zheng ; Bo Wang ; Yakun Wang ; Catherine Yang ; Zhongqian Dong ; Tianyang Yi ; Cyrus Choi ; Edward Chang  |  
                
| Online Cross-Modal Scene Retrieval by Binary Representation and Semantic Graph Mengshi Qi ; Yunhong Wang ; Annan Li  |  
                
| NeuroStylist: Neural Compatibility Modeling for Clothing Matching Xuemeng Song ; Fuli Feng ; Jinhuan Liu ; Zekun Li ; Liqiang Nie ; Jun Ma  |  
                
| Sports VR Content Generation from Regular Camera Feeds Kiana Calagari ; Mohamed Elgharib ; Mohamed Hefeeda ; Shervin Shirmohammadi  |  
                
| Two Birds One Stone: On both Cold-Start and Long-Tail Recommendation Jingjing Li ; Ke Lu ; Zi Huang ; Heng Tao Shen  |  
                
| Multi-Networks Joint Learning for Large-Scale Cross-Modal Retrieval Liang Zhang ; Bingpeng Ma ; Guorong Li ; Qingming Huang ; Qi Tian  |  
                
| Salient Object Detection with Chained Multi-Scale Fully Convolutional Network Youbao Tang ; Xiangqian Wu  |  
                
| Fine-grained Discriminative Localization via Saliency-guided Faster R-CNN Xiangteng He ; Yuxin Peng ; Junjie Zhao  |  
                
| Exploiting High-Level Semantics for No-Reference Image Quality Assessment of Realistic Blur Images Dingquan Li ; Tingting Jiang ; Ming Jiang  |  
                
| Learning to Recognise Unseen Classes by A Few Similes Yang Long ; Ling Shao  |  
                
| Deep Cross-Modality Alignment for Multi-Shot Person Re-IDentification Zhichao Song ; Bingbing Ni ; Yichao Yan ; Zhe Ren ; Yi Xu ; Xiaokang Yang  |  
                
| Hierarchical Recurrent Neural Network for Video Summarization Bin Zhao ; Xuelong Li ; Xiaoqiang Lu  |  
                
| A simplified topological representation of text for local and global context Ishrat Rahman Sami ; Katayoun Farrahi  |  
                
| Improved Multimodal Representation Learning with Skip Connections Ning Zhang ; Yu Cao ; Yan Luo ; Benyuan Liu  |  
                
| Modeling Image Virality with Pairwise Spatial Transformer Nets Abhimanyu Dubey ; Sumeet Agarwal  |  
                
| Metric-based Generative Adversarial Network Guoxian Dai ; Jin Xie ; Yi Fang  |  
                
| More Than An Answer: Neural Pivot Network for Visual Question Answering Yiyi Zhou ; Rongrong Ji ; Jinsong Su ; Yongjian Wu ; Yunsheng Wu  |  
                
| Photo2Trip: Exploiting Visual Contents in Geo-tagged Photos for Personalized Tour Recommendation Pengpeng Zhao ; Xiefeng Xu ; Yanchi Liu ; Victor S. Sheng ; Kai Zheng ; Hui Xiong  |  
                
| Deep Active Learning Through Cognitive Information Parcels Wencang Zhao ; Yu Kong ; Zhengming Ding ; Shangqian Gao ; Yun Fu  |  
                
| A Paralinguistic Approach To Holistic Speaker Diarisation -- Using Age, Gender, Voice Likability and Personality Traits Yue Zhang ; William McGehee ; Maximilian Schmitt ; Florian Eyben ; Björn Schuller  |  
                
| OpTile: Toward Optimal Tiling in 360-degree Video Streaming Mengbai Xiao ; Chao Zhou ; Yao Liu ; Songqing Chen  |  
                
| 3DensiNet: A Robust Neural Network Architecture Towards 3D Volumetric Object Prediction From 2D Image Meng Wang ; Lingjing Wang ; Yi Fang  |  
                
| Towards Micro-video Understanding by Joint Sequential-Sparse Modeling Meng Liu ; Liqiang Nie ; Meng Wang ; Baoquan Chen  |  
                
| LEAF: Latent Extended Attribute Features Discovery for Visual Classification Hua Zhang ; Rui Wang ; Changqing Zhang ; Xiaochun Cao  |  
                
| Single Shot Temporal Action Detection Tianwei Lin ; Xu Zhao ; Zheng Shou  |  
                
| Too Many Pixels to Perceive: Subpixel Shutoff for Display Energy Reduction on OLED Smartphones Zhisheng Yan ; Chang Wen Chen  |  
                
| Finding the Secret of CNN Parameter Layout under Strict Size Constraint Liao Lixin ; Yao Zhao ; Shikui Wei ; Wang Jingdong ; Liu Ruoyu  |  
                
| It’s All Around You: Exploring 360° Video Viewing Experiences on Mobile Devices Marc van den Broeck ; Fahim Kawsar ; Johannes Schöning  |  
                
| Visualization of Stone Trajectories in Live Curling Broadcasts using Online Machine Learning Masaki Takahashi ); Shinsuke Yokozawa ); Hideki Mitsumine ); Tomoyuki Mishina ); Yasuyuki Matsuhisa ); Sawako Muramatsu )  |  
                
| Exploring Domain Knowledge for Affective Video Content Analyses Tanfang Chen ; Yaxin Wang ; Shangfei Wang ; Shiyu Chen  |  
                
| Deep Temporal Models using Identity Skip-Connections for Speech Emotion Recognition Jaebok Kim ; Gwenn Englebienne ; Khiet Truong ; Vanessa Evers  |  
                
| Video Description with Spatial-Temporal Attention Yunbin Tu ; Xishan Zhang ; Bingtao Liu ; Chenggang Yan  |  
                
| Deep Binary Reconstruction for Cross-modal Hashing Xuelong Li ; Di Hu ; Feiping Nie  |  
                
| Pedestrian Detection via Bi-directional Multi-scale Analysis Zhenyu Duan ; Jinpeng Lan ; Yi Xu ; Bingbing Ni ; Lixue Zhuang ; Xiaokang Yang  |  
                
| Rethinking HTTP Adaptive Streaming with the Mobile User Perception Chao Wu ; Wenwu Zhu ; Qiushi Li ; Yaoxue Zhang  |  
                
| Fine-grained Recognition via Attribute-guided Attentive Feature Aggregation Yichao Yan ; Bingbing Ni ; Xiaokang Yang  |  
                
| NormFace: $L_2$ HyperSphere Embedding for Face Verification Feng Wang ; Xiang Xiang ; Jian Cheng ; Alan Yuille  |  
                
| Semi-Dense Depth Interpolation using Deep Convolutional Neural Networks Ilya Makarov ; Vladimir Aliev ; Olga Gerasimova  |  
                
| Occlusion-aware Video Temporal Consistency Chun-Han Yao ; Chia-Yang Chang ; Shao-Yi Chien  |  
                
| Video Question Answering via Hierarchical Dual-Level Attention Network Learning Zhou Zhao ; Jinghao Lin ; Xinghua Jiang ; Deng Cai ; Xiaofei He ; Yueting Zhuang  |  
                
| Region-based Activity Recognition Using Conditional GAN Xinyu Li ; Yanyi Zhang ; Jianyu Zhang ; Yueyang Chen ; Huangcan Li ; Ivan Marsic ; Randall Burd  |  
                
| Learning a Target Sample Re-Generator for Cross-Database Micro-Expression Recognition Yuan Zong ; Xiaohua Huang ; Wenming Zheng ; Zhen Cui ; Guoying Zhao  |  
                
| REQUEST: Seamless Dynamic Adaptive Streaming over HTTP for Multi-Homed Smartphone under Resource Constraints Jonghoe Koo ; Juheon Yi ; Joongheon Kim ; Mohammad A. Hoque ; Sunghyun Choi  |  
                
| Cross-media retrieval by learning rich semantic embeddings of multimedia Mengdi Fan ; Wenmin Wang ; Peilei Dong ; Liang Han ; Ronggang Wang ; Ge Li  |  
                
| Optimal Set of 360-Degree Videos for Viewport-Adaptive Streaming Xavier Corbillon ; Gwendal Simon ; Alisa Devlic ; Jacob Chakareski  |  
                
| WebRTC Congestion Control using Forward-Error Correction Balázs Kreith ; Varun Singh ; Jörg Ott  |  
                
| Visual Sentiment Analysis for Review Images with Item-Oriented and User-Oriented CNN Quoc-Tuan Truong ; Hady Lauw  |  
                
| From Multimedia Logs to Personal Chronicles Hyungik Oh ; Ramesh Jain  |  
                
| Experimental Analysis of Bandwidth Allocation in Automated Video Surveillance Systems Sina Gholamnejad Davani ; Nabil Sarhan  |  
                
| Mutually Guided Image Filtering Xiaojie Guo ; Yu Li ; Jiayi Ma  |  
                
| Learning Semantic Feature Map for Visual Content Recognition Rui-Wei Zhao ; Zuxuan Wu ; Jianguo Li ; Yu-Gang Jiang  |  
                
| Video Visual Relation Detection Xindi Shang ; Tongwei Ren ; Jingfan Guo ; Hanwang Zhang ; Tat-Seng Chua  |  
                
| Deep Location-Specific Tracking Lingxiao Yang ; Risheng Liu ; David Zhang ; Lei Zhang  |  
                
| A Multi-Task Framework for Weather Recognition Zhigang Wang ; Xuelong Li ; Xiaoqiang Lu  |  
                
| From Hard to Soft: Towards more Human-like Emotion Recognition by Modelling the Perception Uncertainty Jing Han ; Zixing Zhang ; Maximilian Schmitt ; Maja Pantic ; Björn Schuller  |  
                
| When Cloud Meets Uncertain Crowd: An Auction Approach for Crowdsourced Livecast Transcoding Yifei Zhu ; Jiangchuan Liu ; Zhi Wang ; Cong Zhang  |  
                
| Multimedia Semantic Integrity Assessment Using Joint Embedding Of Images and Text Ayush Jaiswal ; Ekraam Sabir ; Wael Abd-Almageed ; Prem Natarajan  |  
                
| Discriminative Training of Complex-valued Deep Recurrent Neural Network for Singing Voice Separation Yuan-Shan Lee ); Kuo Yu ); Sih-Huei Chen ); Jia-Ching Wang  |  
                
| Multicamera Summarization of Rehabilitation Sessions in Home Environment Tarek Elgamal ; Klara Nahrstedt  |  
                
| Adaptive Low-Rank Multi-Label Active Learning for Image Classification Jian Wu ; Anqian Guo ; Victor S. Sheng ; Pengpeng Zhao ; Zhiming Cui  |  
                
| Modeling the Resource Requirements of Convolutional Neural Networks on Mobile Devices Zongqing Lu ; Swati Rallapalli ; Kevin Chan ; Thomas La Porta  |  
                
| Adaptively Attending to Visual Attributes and Linguistic Knowledge for Captioning Yi Bin ; Yang Yang ; Jie Zhou ; Zi Huang ; Heng Tao Shen  |  
                
| Efficient Binary Coding for Subspace-based Query-by-Image Video Retrieval Ruicong Xu ; Yang Yang ; Fumin Shen ; Ning Xie ; Heng Tao Shen  |  
                
| Adaptive Audio Classification for Smartphone in Noisy Car Environment Myounggyu Won ; Haitham Alsaadan ; Yongsoon Eun  |  
                
| Real-Time False-Contours Removal for Inverse Tone Mapped High Dynamic Range Content using Projection Onto Convex Sets theory Gonzalo Luzardo ; Jan Aelterman ; Hiep Luong ; Wilfried Philips ; Daniel Ochoa  |  
                
| Incremental accelerated kernel discriminant analysis Nikolaos Gkalelis ; Vasileios Mezaris  |  
                
| Venues in Social Media: Examining Ambiance Perception Through Scene Semantics Yassir Benkhedda ; Darshan Santani ; Daniel Gatica-Perez  |  
                
| Pseudo label based Unsupervised Deep discriminative Hashing for image retrieval Qinghao Hu ; Jiaxiang Wu ; Jian Cheng ; Hanqing Lu  |  
                
| Moving as a Leader: Detecting Emergent Leadership in Small Groups using Body Pose Cigdem Beyan ; Vasiliki-Maria Katsageorgiou ; Vittorio Murino  |  
                
| A Novel System for Visual Navigation of Educational Videos Using Multimodal Cues Baoquan Zhao ; Xiaonan Luo ; Shujin Lin ; Songhua Xu ; Ruomei Wang  |  
                
| #VisualHashtags: Visual Summarization of Social Media Events Using Mid-Level Visual Elements Sonal Goel ; Sarthak Ahuja ; A V Subramanyam ; Ponnurangam Kumaraguru  |  
                
| Mulit-scale Context based Attention for Dynamic Music Emotion Prediction Ye Ma ; Xinxing Li ; Mingxing Xu ; Lianhong Cai  |  
                
| Outlining objects for interactive segmentation on touch devices Matthieu Pizenberg ; Axel Carlier ; Emmanuel Faure ; Vincent Charvillat  |  
                
| Deep Matching and Validation Network: An End-to-End Solution to Constrained Image Splicing Localization and Detection Yue Wu ; Wael Abdalmageed ; Prem Natarajan  |  
                
| Multi-modal localization and enhancement of multiple sound sources from a micro aerial vehicle Ricardo Sanchez-Matilla ; Lin Wang ; Andrea Cavallaro  |  
                
| Temporally Selective Attention Model for Social and Affective State Recognition in Multimedia Content Hongliang Yu ; Liangke Gui ; Michael Madaio ; Amy Ogan ; Justine Cassell ; Louis-Philippe Morency  |  
                
| Adaptive 360-Degree Video Streaming using Scalable Video Coding Afshin Taghavi Nasrabadi ; Anahita Mahzari ; Joseph D. Beshay ; Ravi Prakash  |  
                
| Deep Supervised Quantization by Self-Organized Map Min Wang ; Wengang Zhou ; Qi Tian ; Junfu Pu ; Houqiang Li  |  
                
| Selective Deep Convolutional Features for Image Retrieval Tuan Hoang Nguyen Anh ; Thanh-Toan Do ; Dang-Khoa Le Tan ; Ngai-Man Cheung  |  
                
| Quality-of-Experience of Adaptive Video Streaming: Exploring the Space of Adaptations Zhengfang Duanmu ; Kede Ma ; Zhou Wang  |  
                
| Statistical Inference of Gaussian-Laplace Distribution for Person Verification Zheng Wang ; Ruimin Hu ; Yi Yu ; Junjun Jiang ; Jiayi Ma ; Shin'Ichi Satoh  |  
                
| Beyond Human-level License Plate Super-resolution with Progressive Vehicle Search and Domain Priori GAN Wu Liu ; Xinchen Liu ; Huadong Ma ; Peng Cheng  |  
                
| Learning to Generate and Edit Hairstyles Weidong Yin ; Yanwei Fu ; Yiqing Ma ; Yugang Jiang ; Tao Xiang ; Xiangyang Xue  |  
                
| Adaptively Weighted Multi-task Deep Network for Person Attribute Classification Keke He ; Zhanxiong Wang ; Yanwei Fu ; Yu-Gang Jiang ; Rui Feng ; Xiangyang Xue  |  
                
| Laplacian-Steered Neural Style Transfer Shaohua Li ; Xinxing Xu ; Liqiang Nie ; Tat-Seng Chua  |  
                
| Video Question Answering via Gradually Refined Attention over Appearance and Motion Dejing Xu ; Zhou Zhao ; Jun Xiao ; Fei Wu ; Hanwang Zhang ; Xiangnan He ; Yueting Zhuang  |  
                
| Cross-Domain Image Retrieval with Attention Modeling Xin Ji ; Wei Wang ; Meihui Zhang ; Yang Yang  |  
                
| PQk-means: Billion-scale Clustering for Product-quantized Codes Yusuke Matsui ; Keisuke Ogaki ; Toshihiko Yamasaki ; Kiyoharu Aizawa  |  
                
| Face Aging with Contextural Generative Adversarial Nets Si Liu ; Yao Sun ; Wei Wang ; Renda Bao ; Defa Zhu ; Shuicheng Yan  |  
                
| Attention Transfer from Web Images for Video Recognition Junnan Li ; Yongkang Wong ; Qi Zhao ; Mohan Kankanhalli  |  
                
| A Unified Personalized Video Recommendation via Dynamic Recurrent Neural Networks Junyu Gao ; Tianzhu Zhang ; Changsheng Xu  |  
                
| Is Foveated Rendering Perceivable to VR Users? A Study on the Efficiency and Consistency of Subjective Assessment Methods Chih-Fan Hsu ; Anthony Chen ; Cheng-Hsin Hsu ; Chun-Ying Huang ; Chin-Laung Lei ; Kuan-Ta Chen  |  
                
| Wheel: Accelerating CNNs with Distributed GPUs via Hybrid Parallelism and Alternate Strategy Xiaoyu Du ; Jinhui Tang ; Zechao Li ; Zhiguang Qin  |  
                
| Multiview and Multimodal Pervasive Indoor Localization Zhenguang Liu ; Li Cheng ; Anan Liu ; Luming Zhang ; Xiangnan He ; Roger Zimmermann  |  
                
| Future-Supervised Retrieval of Unseen Queries for Live Video Spencer Cappallo ; Cees Snoek  |  
                
| Deep Attribute-preserving Metric Learning for Natural Language Object Retrieval Jianan Li ; Yunchao Wei ; Xiaodan Liang ; Fang Zhao ; Jianshu Li ; Tingfa Xu ; Jiashi Feng  |  
                
| Understanding Fashion Trends from Street Photos via Neighbor-Constrained Embedding Learning Xiaoling Gu ; Yongkang Wong ; Pai Peng ; Lidan Shou ; Gang Chen ; Mohan S. Kankanhalli  |  
                
| Multi-Modal Knowledge Representation Learning via Webly-Supervised Relationships Mining Fudong Nian ; Bingkun Bao ; Teng Li ; Changsheng Xu  |  
                
| The Role of Visual Attention in Sentiment Prediction Shaojing Fan ; Ming Jiang ; Zhiqi Shen ; Bryan Koenig ; Mohan Kankanhalli ; Qi Zhao  |  
                
| Searching Personal Photos on the Phone with Instant Visual Query Suggestion and Joint Text-Image Hashing Zhaoyang Zeng ; Jianlong Fu ; Hongyang Chao ; Tao Mei  |  
                
| Robust Visual Object Tracking with Top-down Reasoning Mengdan Zhang ; Jiashi Feng ; Weiming Hu  |  
                
| Stylized Adversarial Autoencoder for Image Generation Yiru Zhao ; Bing Deng ; Jianqiang Huang ; Hongtao Lu ; Xian-Sheng Hua  |  
                
| An HTTP/2-Based Adaptive Streaming Framework for 360 Virtual Reality Videos Stefano Petrangeli ; Viswanathan Swaminathan ; Mohammad Hosseini ; Filip De Turck  |  
                
| Multimodal Fusion with Recurrent Neural Networks for Rumor Detection on Microblogs Zhiwei Jin ; Han Guo ; Juan Cao ; Yongdong Zhang ; Jiebo Luo  |  
                
| A Dual-Network Progressive Approach to Weakly Supervised Object Detection Xuanyi Dong ; Deyu Meng ; Fan Ma ; Yi Yang  |  
                
获取更多关于ACM Multimedia 2017的资料,请登录www.zhuanzhi.ai, 搜索 ACM Multimedia 查看!
欢迎转发分享到微信群和朋友圈!
请扫描小助手,加入专知人工智能群,交流分享~
获取更多关于ACM Multimedia 2017的资料,以及机器学习以及人工智能知识资料,请访问www.zhuanzhi.ai, 或者点击阅读原文,即可得到!
-END-
欢迎使用专知
专知,一个新的认知方式!目前聚焦在人工智能领域为AI从业者提供专业可信的知识分发服务, 包括主题定制、主题链路、搜索发现等服务,帮你又好又快找到所需知识。
使用方法>>访问www.zhuanzhi.ai, 或点击文章下方“阅读原文”即可访问专知
中国科学院自动化研究所专知团队
@2017 专知
专 · 知
关注我们的公众号,获取最新关于专知以及人工智能的资讯、技术、算法、深度干货等内容。扫一扫下方关注我们的微信公众号。
点击“阅读原文”,使用专知!