ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 14 Issue 1, January 2018

Emotion Recognition Using Multiple Kernel Learning toward E-learning Applications
Oryina Kingsley Akputu, Kah Phooi Seng, Yunli Lee, Li-Minn Ang
Article No.: 1
DOI: 10.1145/3131287

Adaptive Educational Hypermedia (AEH) e-learning models aim to personalize educational content and learning resources based on the needs of an individual learner. The Adaptive Hypermedia Architecture (AHA) is a specific implementation of the AEH...

Learning Label Preserving Binary Codes for Multimedia Retrieval: A General Approach
Kai Li, Guo-Jun Qi, Kien A. Hua
Article No.: 2
DOI: 10.1145/3152126

Learning-based hashing has been researched extensively in the past few years due to its great potential in fast and accurate similarity search among huge volumes of multimedia data. In this article, we present a novel multimedia hashing framework,...

Implicit Emotion Communication: EEG Classification and Haptic Feedback
Rodrigo Ceballos, Beatrice Ionascu, Wanjoo Park, Mohamad Eid
Article No.: 3
DOI: 10.1145/3152128

Today, ubiquitous digital communication systems do not have an intuitive, natural way of communicating emotion, which, in turn, affects the degree to which humans can emotionally connect and interact with one another. To address this problem, a...

Delay-Aware Quality Optimization in Cloud-Assisted Video Streaming System
Jiyan Wu, Bo Cheng, Yuan Yang, Ming Wang, Junliang Chen
Article No.: 4
DOI: 10.1145/3152116

Cloud-assisted video streaming has emerged as a new paradigm to optimize multimedia content distribution over the Internet. This article investigates the problem of streaming cloud-assisted real-time video to multiple destinations (e.g., cloud...

Deep Bidirectional Cross-Triplet Embedding for Online Clothing Shopping
Shuhui Jiang, Yue Wu, Yun Fu
Article No.: 5
DOI: 10.1145/3152114

In this article, we address the cross-domain (i.e., street and shop) clothing retrieval problem and investigate its real-world applications for online clothing shopping. It is a challenging problem due to the large discrepancy between street and...

DeepSearch: A Fast Image Search Framework for Mobile Devices
Peisong Wang, Qinghao Hu, Zhiwei Fang, Chaoyang Zhao, Jian Cheng
Article No.: 6
DOI: 10.1145/3152127

Content-based image retrieval (CBIR) is one of the most important applications of computer vision. In recent years, there have been many important advances in the development of CBIR systems, especially Convolutional Neural Networks (CNNs) and...

Robust Multi-Variate Temporal Features of Multi-Variate Time Series
Sicong Liu, Silvestro Roberto Poccia, K. Selçuk Candan, Maria Luisa Sapino, Xiaolan Wang
Article No.: 7
DOI: 10.1145/3152123

Many applications generate and/or consume multi-variate temporal data, and experts often lack the means to adequately and systematically search for and interpret multi-variate observations. In this article, we first observe that multi-variate time...

Online Early-Late Fusion Based on Adaptive HMM for Sign Language Recognition
Dan Guo, Wengang Zhou, Houqiang Li, Meng Wang
Article No.: 8
DOI: 10.1145/3152121

In sign language recognition (SLR) with multimodal data, a sign word can be represented by multiply features, for which there exist an intrinsic property and a mutually complementary relationship among them. To fully explore those relationships,...

Joint Estimation of Age and Expression by Combining Scattering and Convolutional Networks
Huei-Fang Yang, Bo-Yao Lin, Kuang-Yu Chang, Chu-Song Chen
Article No.: 9
DOI: 10.1145/3152118

This article tackles the problem of joint estimation of human age and facial expression. This is an important yet challenging problem because expressions can alter face appearances in a similar manner to human aging. Different from previous...

Egocentric Hand Detection Via Dynamic Region Growing
Shao Huang, Weiqiang Wang, Shengfeng He, Rynson W. H. Lau
Article No.: 10
DOI: 10.1145/3152129

Egocentric videos, which mainly record the activities carried out by the users of wearable cameras, have drawn much research attention in recent years. Due to its lengthy content, a large number of ego-related applications have been developed to...

Visual Background Recommendation for Dance Performances Using Deep Matrix Factorization
Jiqing Wen, James She, Xiaopeng Li, Hui Mao
Article No.: 11
DOI: 10.1145/3152463

The stage background is one of the most important features for a dance performance, as it helps to create the scene and atmosphere. In conventional dance performances, the background images are usually selected or designed by professional stage...

Adaptive Fractional-Pixel Motion Estimation Skipped Algorithm for Efficient HEVC Motion Estimation
Zhaoqing Pan, Jianjun Lei, Yajuan Zhang, Fu Lee Wang
Article No.: 12
DOI: 10.1145/3159170

High-Efficiency Video Coding (HEVC) efficiently addresses the storage and transmit problems of high-definition videos, especially for 4K videos. The variable-size Prediction Units (PUs)--based Motion Estimation (ME) contributes a significant...

A Discriminatively Learned CNN Embedding for Person Reidentification
Zhedong Zheng, Liang Zheng, Yi Yang
Article No.: 13
DOI: 10.1145/3159171

In this article, we revisit two popular convolutional neural networks in person re-identification (re-ID): verification and identification models. The two models have their respective advantages and limitations due to different loss functions....

Robust Privacy-Preserving Image Sharing over Online Social Networks (OSNs)
Weiwei Sun, Jiantao Zhou, Shuyuan Zhu, Yuan Yan Tang
Article No.: 14
DOI: 10.1145/3165265

Sharing images online has become extremely easy and popular due to the ever-increasing adoption of mobile devices and online social networks (OSNs). The privacy issues arising from image sharing over OSNs have received significant attention in...