ACM DL

Multimedia Computing, Communications, and Applications (TOMM)

Menu

Search Issue
enter search term and/or author name

Archive


ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 13 Issue 1, January 2017

Table of Contents
Zheng Yan
Article No.: 1e
DOI: 10.1145/3026796

Learning from Collective Intelligence: Feature Learning Using Social Images and Tags
Hanwang Zhang, Xindi Shang, Huanbo Luan, Meng Wang, Tat-Seng Chua
Article No.: 1
DOI: 10.1145/2978656

Feature representation for visual content is the key to the progress of many fundamental applications such as annotation and cross-modal retrieval. Although recent advances in deep feature learning offer a promising route towards these tasks, they...

Prediction of Virality Timing Using Cascades in Social Media
Ming Cheung, James She, Alvin Junus, Lei Cao
Article No.: 2
DOI: 10.1145/2978771

Predicting content going viral in social networks is attractive for viral marketing, advertisement, entertainment, and other applications, but it remains a challenge in the big data era today. Previous works mainly focus on predicting the possible...

Approximate Asymmetric Search for Binary Embedding Codes
Chih-Yi Chiu, Yu-Cyuan Liou, Amorntip Prayoonwong
Article No.: 3
DOI: 10.1145/2990504

In this article, we propose a method of approximate asymmetric nearest-neighbor search for binary embedding codes. The asymmetric distance takes advantage of less information loss at the query side. However, calculating asymmetric distances...

QoE-Based Low-Delay Live Streaming Using Throughput Predictions
Konstantin Miller, Abdel-Karim Al-Tamimi, Adam Wolisz
Article No.: 4
DOI: 10.1145/2990505

Recently, Hypertext Transfer Protocol (HTTP)-based adaptive streaming has become the de facto standard for video streaming over the Internet. It allows clients to dynamically adapt media characteristics to the varying network conditions to ensure...

Digital Lollipop: Studying Electrical Stimulation on the Human Tongue to Simulate Taste Sensations
Nimesha Ranasinghe, Ellen Yi-Luen Do
Article No.: 5
DOI: 10.1145/2996462

Among the five primary senses, the sense of taste is the least explored as a form of digital media applied in Human--Computer Interface. This article presents an experimental instrument, the Digital Lollipop, for digitally simulating the sensation...

Fixation Prediction through Multimodal Analysis
Xiongkuo Min, Guangtao Zhai, Ke Gu, Xiaokang Yang
Article No.: 6
DOI: 10.1145/2996463

In this article, we propose to predict human eye fixation through incorporating both audio and visual cues. Traditional visual attention models generally make the utmost of stimuli’s visual features, yet they bypass all audio information. In...

Predicting Occupation from Images by Combining Face and Body Context Information
Wei-Ta Chu, Chih-Hao Chiu
Article No.: 7
DOI: 10.1145/3009911

Facial images embed age, gender, and other rich information that is implicitly related to occupation. In this work, we advocate that occupation prediction from a single facial image is a doable computer vision problem. We extract multilevel...

Consistent Synchronization of Action Order with Least Noticeable Delays in Fast-Paced Multiplayer Online Games
Jingxi Xu, Benjamin W. Wah
Article No.: 8
DOI: 10.1145/3003727

When running multiplayer online games on IP networks with losses and delays, the order of actions may be changed when compared to the order run on an ideal network with no delays and losses. To maintain a proper ordering of events, traditional...

Audiovisual Tool for Solfège Assessment
Rodrigo Schramm, Helena De Souza Nunes, Cláudio Rosito Jung
Article No.: 9
DOI: 10.1145/3007194

Solfège is a general technique used in the music learning process that involves the vocal performance of melodies, regarding the time and duration of musical sounds as specified in the music score, properly associated with the...

Identification of Reconstructed Speech
Haojun Wu, Yong Wang, Jiwu Huang
Article No.: 10
DOI: 10.1145/3004055

Both voice conversion and hidden Markov model-- (HMM) based speech synthesis can be used to produce artificial voices of a target speaker. They have shown great negative impacts on speaker verification (SV) systems. In order to enhance the...

Drift-Compensated Robust Watermarking Algorithm for H.265/HEVC Video Stream
Sibaji Gaj, Aditya Kanetkar, Arijit Sur, Prabin Kumar Bora
Article No.: 11
DOI: 10.1145/3009910

It has been observed in the recent literature that the drift error due to watermarking degrades the visual quality of the embedded video. The existing drift error handling strategies for recent video standards such as H.264 may not be directly...

An Efficient Framework for Compressed Domain Watermarking in P Frames of High-Efficiency Video Coding (HEVC)--Encoded Video
Tanima Dutta, Hari Prabhat Gupta
Article No.: 12
DOI: 10.1145/3002178

Digital watermarking has received much attention in recent years as a promising solution to copyright protection. Video watermarking in compressed domain has gained importance since videos are stored and transmitted in a compressed format. This...