ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 13 Issue 4, October 2017

Minh Son Dao
Article No.: 47e
DOI: 10.1145/3143786

Sparse Representation-Based Semi-Supervised Regression for People Counting
Hong-Bo Zhang, Bineng Zhong, Qing Lei, Ji-Xiang Du, Jialin Peng, Duansheng Chen, Xiao Ke
Article No.: 47
DOI: 10.1145/3106156

Label imbalance and the insufficiency of labeled training samples are major obstacles in most methods for counting people in images or videos. In this work, a sparse representation-based semi-supervised regression method is proposed to count...

Caching Online Video: Analysis and Proposed Algorithm
Shahid Akhtar, Andre Beck, Ivica Rimac
Article No.: 48
DOI: 10.1145/3106157

Online video presents new challenges to traditional caching with over a thousand-fold increase in number of assets, rapidly changing popularity of assets and much higher throughput requirements.

We propose a new hierarchical filtering...

Multimodal Retrieval with Diversification and Relevance Feedback for Tourist Attraction Images
Duc-Tien Dang-Nguyen, Luca Piras, Giorgio Giacinto, Giulia Boato, Francesco G. B. DE Natale
Article No.: 49
DOI: 10.1145/3103613

In this article, we present a novel framework that can produce a visual description of a tourist attraction by choosing the most diverse pictures from community-contributed datasets, which describe different details of the queried location. The...

Mixtape: Using Real-Time User Feedback to Navigate Large Media Collections
Luciana Fujii Pontello, Pedro H. F. Holanda, Bruno Guilherme, João Paulo V. Cardoso, Olga Goussevskaia, Ana Paula Couto Da Silva
Article No.: 50
DOI: 10.1145/3105969

In this work, we explore the increasing demand for novel user interfaces to navigate large media collections. We implement a geometric data structure to store and retrieve item-to-item similarity information and propose a novel navigation...

Securing Speech Noise Reduction in Outsourced Environment
Abukari M. Yakubu, Namunu C. Maddage, Pradeep K. Atrey
Article No.: 51
DOI: 10.1145/3105970

Cloud data centers (CDCs) are becoming a cost-effective method for processing and storage of multimedia data including images, video, and audio. Since CDCs are physically located in different jurisdictions, and are managed by external parties,...

Interactive Film Recombination
Fabrizio Guerrini, Nicola Adami, Sergio Benini, Alberto Piacenza, Julie Porteous, Marc Cavazza, Riccardo Leonardi
Article No.: 52
DOI: 10.1145/3103241

In this article, we discuss an innovative media entertainment application called Interactive Movietelling. As an offspring of Interactive Storytelling applied to movies, we propose to integrate narrative generation through artificial intelligence...

Complexity Correlation-Based CTU-Level Rate Control with Direction Selection for HEVC
Mingliang Zhou, Yongfei Zhang, Bo Li, Xupeng Lin
Article No.: 53
DOI: 10.1145/3107616

Rate control is a crucial consideration in high-efficiency video coding (HEVC). The estimation of model parameters is very important for coding tree unit (CTU)-level rate control, as it will significantly affect bit allocation and thus coding...

Modeling and Analysis of Power Consumption in Live Video Streaming Systems
Yousef O. Sharrab, Nabil J. Sarhan
Article No.: 54
DOI: 10.1145/3115505

This article develops an aggregate power consumption model for live video streaming systems, including many-to-many systems. In many-to-one streaming systems, multiple video sources (i.e., cameras and/or sensors) stream videos to a monitoring...

When Smart Devices Interact With Pervasive Screens: A Survey
Pai Chet Ng, James She, Kang Eun Jeon, Matthias Baldauf
Article No.: 55
DOI: 10.1145/3115933

The meeting of pervasive screens and smart devices has witnessed the birth of screen-smart device interaction (SSI), a key enabler to many novel interactive use cases. Most current surveys focus on direct human-screen interaction, and to the best...

O-Mopsi: Mobile Orienteering Game for Sightseeing, Exercising, and Education
Pasi Fränti, Radu Mariescu-Istodor, Lahari Sengupta
Article No.: 56
DOI: 10.1145/3115935

Location-based games have been around already since 2000 but only recently when PokemonGo came to markets it became clear that they can reach wide popularity. In this article, we perform a literature-based analytical study of what kind of issues...

Performance Analysis of Game Engines on Mobile and Fixed Devices
Farouk Messaoudi, Adlen Ksentini, Gwendal Simon, Philippe Bertin
Article No.: 57
DOI: 10.1145/3115934

Mobile gaming is an emerging concept wherein gamers are using mobile devices, like smartphones and tablets, to play best-seller games. Compared to dedicated gaming boxes or PCs, these devices still fall short of executing newly complex 3D video...

An Efficient Computation Framework for Connection Discovery using Shared Images
Ming Cheung, Xiaopeng Li, James She
Article No.: 58
DOI: 10.1145/3115951

With the advent and popularity of the social network, social graphs become essential to improve services and information relevance to users for many social media applications to predict follower/followee relationship, community membership, and so...

A Distributed Streaming Framework for Connection Discovery Using Shared Videos
Xiaopeng Li, Ming Cheung, James She
Article No.: 59
DOI: 10.1145/3120996

With the advances in mobile devices and the popularity of social networks, users can share multimedia content anytime, anywhere. One of the most important types of emerging content is video, which is commonly shared on platforms such as Instagram...

Semantic Reasoning in Zero Example Video Event Retrieval
Maaike H. T. De Boer, Yi-Jie Lu, Hao Zhang, Klamer Schutte, Chong-Wah Ngo, Wessel Kraaij
Article No.: 60
DOI: 10.1145/3131288

Searching in digital video data for high-level events, such as a parade or a car accident, is challenging when the query is textual and lacks visual example images or videos. Current research in deep neural networks is highly beneficial for the...

An Efficient Motion Detection and Tracking Scheme for Encrypted Surveillance Videos
Jianting Guo, Peijia Zheng, Jiwu Huang
Article No.: 61
DOI: 10.1145/3131342

Performing detection on surveillance videos contributes significantly to the goals of safety and security. However, performing detection on unprotected surveillance video may reveal the privacy of innocent people in the video. Therefore, striking...

PLACID: A Platform for FPGA-Based Accelerator Creation for DCNNs
Mohammad Motamedi, Philipp Gysel, Soheil Ghiasi
Article No.: 62
DOI: 10.1145/3131289

Deep Convolutional Neural Networks (DCNNs) exhibit remarkable performance in a number of pattern recognition and classification tasks. Modern DCNNs involve many millions of parameters and billions of operations. Inference using such DCNNs, if...