ACM MM 2011

ACM MM 2011 | Nov 28 - Dec 1 | Scottsdale, Arizona, USA

Other Links:

Program on Thursday (Dec.1, 2011)

Keynote 3: Digital Media Distribution – The Future

Speaker: Arnaud Robert, Disney

Abstract: Consumer devices and technologies are evolving faster than ever, allowing for rich, interactive user experiences. But more importantly, media consumption behavior and expectations are changing and the promise of digital media remains to be fulfilled. In this presentation, we will explore major technology trends, consumer trends, and how they intersect with the future digital media experiences and business models.

Bio: Arnaud Robert is a renowned leader in digital media strategies, and in delivering innovative, cross-platforms consumer products for Disney, Technicolor, Microsoft, Nagravision-Kudelski and various start-ups.
As Senior Vice President of Technology at The Walt Disney Studios, Arnaud guides the Studio through the rapidly changing digital media ecosystem and manages the engineering, product development and user experience teams, as well as the piracy management efforts.
Arnaud received his PhD from the Swiss Institute of Technology and now also serves on the Media Council of the World Economic Forum and is a member of the National Academy of Engineering. Arnaud has 35 issued or pending patents, published over 30 broadly distributed scientific papers, and is a sought after speaker at corporate retreats and public events alike on wide range of topics from the future of digital entertainment to advances in signal processing.

Back to Overview

Oral Session Day 3

Research 9: Media analysis and search Session Chair: Lexing Xie, ANU
Bilinear Deep Learning for Image Classification	Shenghua ZHONG*, HK Polytechnic University; Yan LIU, HK Polytechnic University; Yang Liu, The HK Polytechnic University
Retrieval Based Facial Image Annotation with Weak Label Regularized Local Coordinate Coding	Dayong Wang*, NTU,singapore; Steven C.H. Hoi, NTU, Singapore; Ying He, Nanyang Technological Universi
Learning to Judge Image Search Results	Xinmei Tian*, Texas State University; Yijuan Lu, Texas State University; Linjun Yang, Microsoft Research Asia; Qi Tian, UTSA
Prediction of the Inter-Observer Visual Congruency (IOVC) and application to image ranking	Olivier Le Meur*, University of Rennes 1; Thierry Baccino, Univeristy of Paris VIII; Aline Roumy, INRIA

Research 10: Multimedia systems and middleware 1 Session Chair: Pal Halvorsen, Simular Research
Modeling 3D Articulated Motions with Conformal Geometry Videos (CGVs)	Dao Thi Phuong Quynh, Nanyang Technological University; Ying He*, Nanyang Technological Universi; Jiazhi Xia, Nanyang Technological University; Xiaoming Chen, Nanyang Technological University; Qian Sun, Nanyang Technological University; Steven C.H. Hoi, NTU, Singapore
Random Partial Paired Comparison for Subjective Video Quality Assessment via HodgeRank	Qianqian Xu*, Graduate University of Chinese; Yuan Yao, ; Tingting Jiang, ; Qingming Huang, ; Bowei Yan, ; Weisi Lin,
Saving Bitrate vs. Pleasing Users: Where's the Break-Even Point of Mobile Video Quality?	Wei Song*, Queensland University of Techn
Implementation of the Discrete Wavelet Transform and Multiresolution Analysis in the Encrypted Domain	Peijia Zheng, ; Jiwu Huang*, Sun Yat-sen University

Research 11: Media analysis and search Session Chair: Marcel Worring, University of Amsterdam
Multiple Feature Hashing for Real-time Large Scale Near-duplicate Video Retrieval	Jingkuan Song*, University of queensland; Yi Yang, The University of Queensland; Helen Huang, ; Heng Tao Shen, "The University of Queensland, Australia"; Richang Hong, Hefei University of Technology
Learning Heterogeneous Data for Hierarchical Web Video Classification	Xianming Liu*, Harbin Institute of Technology; Hongxun Yao, Harbin Institute of Technology; Rongrong Ji, Harbin Institute of Technology; Pengfei Xu, Harbin Institute of Technology; Sun Xiaoshuai, Harbin Institute of Technology; Qi Tian, UTSA
Coached Active Learning for Interactive Video Search	Xiao-Yong Wei*, Sichuan University of China; Zhen-Qun Yang, Sichuan University of China
Learning Concept Bundles for Video Search with Complex Queries	Jin Yuan*, National University of Singapo; Zheng-jun Zha, ; Yan-tao Zheng, ; Meng Wang, ; Xiangdong Zhou, Fudan University; Tat-Seng Chua, National Univ of Singapore

Research 12: Multimedia systems and middleware 2 Session Chair: Wei-Tsang Ooi, NUS
Flicker Effect in Scalable Lossless Video Streaming to Handheld Devices	Pengpeng Ni*, Simula Research Laboratory; Ragnhild Eg, Simula Research Laboratory; Carsten Griwodz, Simula Research Laboratory; Pal Halvorsen, ; Alexander Eichhorn, Simula Research Laboratory
An Empirical Evaluation of Battery Power Consumption for Streaming Data Transmission to Mobile Devices	Yao Liu*, George Mason University; Lei Guo, ; Fei Li, ; Songqing Chen, George Mason University
Tennis Real Play: an Interactive Tennis Game with Models from Real Videos	Jui-Hsin Lai*, National Taiwan University; Chieh-Li Chen, ; Po-Chen Wu, ; Chieh-Chi Kao, National Taiwan University; Shao-Yi Chien, National Taiwan University
Celerity: A Low-Delay Multi-Party Conferencing Solution	Chen Xiangwen*, CUHK; Minghua Chen, The Chinese University of Hong Kong; Baochun Li, University of Toronto; Zhao Yao, ; Wu Yunnan, ; Jin Li, Microsoft Research

Research 13: Media analysis and search Session Chair: Shinichi Satoh
Contextual Synonym Dictionary for Visual Object Retrieval	Wenbin Tang*, Tsinghua University; Rui Cai, ; Zhiwei Li, ; Lei Zhang, Microsoft Research Asia
Contextual image search	Wenhao Lu, Tsinghua University; Jingdong Wang*, Microsoft Research Asia; Xian-Sheng Hua, Microsoft; Shengjin Wang, Tsinghua University; Shipeng Li, Microsoft Research Asia
MUSIZ: A Generic Framework for Music Resizing with Stretching and Cropping	Zhang Liu, Tsinghua University; Chaokun Wang*, Tsinghua University
Robust Visual Reranking via Sparsity and Ranking Constraints	Nobuyuki Morioka*, University of New South Wales; Jingdong Wang, Microsoft Research Asia

Research 14: Applications Session Chair: Ichiro Ide
Motor Learning using a Kinematic-Vibrotactile Mapping targeting Fundamental Movements	Troy McDaniel*, Arizona State University; Morris Goldberg, ; Daniel Villanueva, ; Lakshmie Narayan Viswanathan, ; Sethuraman Panchanathan, Arizona State University
Affect-based Adaptive Presentation of Home Videos	Xiaohong Xiang*, NUS; Mohan Kankanhalli, NUS
Example-based Home Video Remixing Support System	Naoko Nitta*, Osaka University; Noboru Babaguchi, Osaka University
Towards Low Bit Rate Mobile Visual Search with Multiple-Channel Coding	Rongrong Ji, Harbin Institute of Technology; Ling-Yu Duan*, Peking University; Jie Chen, Peking University; Hongxun Yao, Harbin Institute of Technolgoy; Yong Rui, Microsoft Asia; Shih-Fu Chang, "Columbia University, USA"; Wen Gao, Peking University

Back to Overview

Poster Day 3

Session Chair: Bernard Merialdo

All papers, including short papers, long papers, workshop papers, and others, will also be presented in the poster plenary sessions on the same day as their regular presentations.

Short Papers
Explicit Instance Ranking Based Multi-Instance Learning for Cost Effective Video Classification	Hua Wang*, Univ. of Texas at Arlington; Feiping Nie, ; Heng Huang, University of Texas at Arlington, Department of Computer Science and Engineering
SIFT Match Verification by Geometric Coding for Large Scale Image Search	Wengang Zhou*, USTC; Houqiang Li, USTC; Yijuan Lu, USTC; Qi Tian, UTSA
Clothes Search in Consumer Photos via Color Matching and Attribute Learning	Xianwang Wang*, Hewlett-Packard Labs; Tong Zhang, Hewlett-Packard Labs
A Fast MAP Adaptation Technique for GMM-supervector-based Video Semantic Indexing Systems	Nakamasa Inoue*, Tokyo Institute of Technology; Koichi Shinoda, Tokyo Institute of Technology
Compositional object pattern: a new model for album event recognition	Shen-Fu Tsai*, UIUC; liangliang Cao, ; Feng Tang, HP Labs.; Thomas Huang,
Prototype-based Image Search Reranking	Linjun Yang*, Microsoft Research Asia; Alan Hanjalic, TU Delft
Semi-Supervised Face Image Retrieval Using Sparse Coding with Identity Constraint	Bor-Chun Chen*, National Taiwan University; Yin-Hsi Kuo, ; Yan-Ying Chen, ; Kuan-Yu Chu, ; Winston Hsu, National Taiwan U
Level Influence of Spatial Pyramid Matching in Object Classification	Renzhong Wei, Fudan University; Hong Lu*, Fudan University; Yanran Shen, Fudan University; Xiangyang Xue, Fudan University
Ensemble Approach based on Conditional Random Field for Multi-Label Image and video Annotation	Xin-Shun Xu*, Nanjing University
Refining Local Descriptors by Embedding Semantic Information for Visual Categorization	Yingbin Zheng, Fudan University; Renzhong Wei, Fudan University; Hong Lu*, Fudan University
Common Visual Pattern Discovery via Graph Matching	Hongtao Xie*, ICT; Ke Gao, ; Yongdong Zhang, ; Jintao Li, Institute of Computing Technology, Chinese Academy of Sciences
Environmental Sound Classification for Scene Recognition Using Local Discriminant Bases and HMM	Feng Su*, Nanjing University; Li Yang, Nanjing University; Tong Lu, Nanjing University; Gongyou Wang, Nanjing University
Manifold Ordinal Regression for Image Ranking	Yang Liu*, The HK Polytechnic University; Yan LIU, HK Polytechnic University; Shenghua ZHONG, HK Polytechnic University; Keith C.C. Chan, The HK Polytechnic University
Blurred Image Region Detection and Classification	Bolan Su*, National University of Singapo; Shijian Lu, Institute for Infocomm Research, Singapore; Chew Lim Tan, School of Computing, National University of Singapore
Human group activity analysis with fusion of motion and appearance context information	Zhongwei Cheng*, GUCAS; Lei Qin, ICT; Qingming Huang,
Exploring Latent Class Information for Image Retrieval using the Bag-of-Feature Model	Lingqiao Liu*, Australian National University; Lei Wang, University of Wollongong
Combining Latent Semantic Learning and Reduced Hypergraph Learning for Semi-Supervised Image Categorization	Zhiwu Lu, Peking University; Yuxin Peng*, Peking University
Optimal Batch Selection for Active Learning in Multi-label Classification	Shayok Chakraborty*, Arizona State University; Vineeth Balasubramanian, ; Sethuraman Panchanathan, Arizona State University
Extracting Intentionally Captured Regions Using Point Trajectories	Yuta Nakashima*, Osaka University; Noboru Babaguchi, Osaka University
Exploring Self-Similarity of Bag-of-Features for Image Classification	Chih-Fan Chen, Academia Sinica,; Yu-Chiang Frank Wang*, Academia Sinica
Non-rigid 3D Model Retrieval Using Multi-Scale Local Features	Pengjie Li*, Beijing University of Posts an; Ma Huadong, ; anlong Ming,
Marginals-based Visual Alphabets for Local Image Descriptors Aggregation	Miriam Redi*, Eurecom; Bernard Merialdo,
Modeling Multimedia Contents through Probabilistic Feature Signatures	Christian Beecks*, RWTH Aachen University; Anca Ivanescu, RWTH Aachen University; Steffen Kirchhoff, RWTH Aachen University; Thomas Seidl, RWTH Aachen University
Bag of colors for improved image search	Christian Wengert, Kooaba; Matthijs Douze, INRIA; Herve Jegou*, INRIA
Asymmetric Hamming Embedding	Mihir Jain, INRIA; Hervé Jégou*, INRIA; Patrick Gros, INRIA
Spatially-coherent Pyramid Matching Based on Max-pooling	Xiangang Cheng*, Nanyang Tech.l Univ.; Liang-Tien Chia,
Text and Image Subject Classifiers: Dense Works Better	Daan Vreeswijk*, Universiteit van Amsterdam; Bouke Huurnink, University of Amsterdam; Arnold Smeulders, University of Amsterdam
Automatic Concept-to-Query Mapping for Web-based Concept Detector Training	Damian Borth*, University of Kaiserslautern; Adrian Ulges, DFKI (Germany); Thomas Breuel,
Hypergraph Spectral Hashing for Similarity Search of Social Image	Yueting Zhuang, Zhejiang University; Yang Liu*, Zhejiang Uinversity; Fei Wu, Zhejiang University; Jian Shao, Zhejiang University; Yin Zhang, Zhejiang University
Selecting the Best Faces to Index Presentation Videos	Michele Merler*, Columbia University; John Kender,
A Biologically Inspired Computational Model for Image Saliency Detection	Sheng He*, Northwest Polytechnical Univer; Junwei Han, NWPU; Xintao Hu, ; Ming Xu, Nwpu; Lei Guo, ; Tianming Liu,
Unsupervised Fast Anomaly Detection in Crowds	Sun Xiaoshuai*, Harbin Institute of Technology; Hongxun Yao, Harbin Institute of Technology; Rongrong Ji, Harbin Institute of Technology; Xianming Liu, Harbin Institute of Technology; Pengfei Xu, Harbin Institute of Technology
Video Indexing and Recommendation Based on Affective Analysis of Viewers	Zhao Sicheng*, Harbin Institute of Technology; Hongxun Yao, Harbin Institute of Technology; Sun Xiaoshuai, Harbin Institute of Technology; Pengfei Xu, Harbin Institute of Technology; Rongrong Ji, Harbin Institute of Technology; Xianming Liu, Harbin Institute of Technology
Eventscapes: Visualizing events over time with emotive facets	Brett Adams*, Curtin University; Dinh Phung, Curtin University; Svetha Venkatesh, Curtin University
Signature-Image-Based Event Analysis for Personal Photo Albums	Minh-Son DAO*, University of Trento, Italy; Duc-Tien Dang-Nguyen, DISI - University of Trento, Italy; Francesco De Natale,
Leveraging Collective Wisdom for Web Video Retrieval through Heterogeneous Community Discovery	Lin Pang*, Chinese Academy of Sciences; Juan Cao, Chinese Academy of Sciences; Yongdong Zhang, ; Shouxun Lin,
Digitized Comic Search System with Automatic Preview Generation of Comic Episodes	Keiichiro Hoashi*, KDDI R&D Laboratories; Chihiro Ono, KDDI R&D Laboratories; Daisuke Ishii, Waseda University; hiroshi Watanabe, Waseda University
2D Geon Based Generic Object Recognition	Xiangqian Yu*, New Jersey Institute of Tech; Vincent Oria, New Jersey Institute of Tech; Pierre Gouton, Universite de Bourgogne; Genevieve Jomier, Universite de Paris-Dauphine
Image Annotation by Composite Kernel Learning with Group Structure	Yueting Zhuang, ; Ying Yuan*, ZheJiang University; Fei Wu,
Video-to-Shot Tag Allocation by Weighted Sparse Group Lasso	Xiaofeng ZHU, University of Queensland; Helen Huang, ; Heng Tao Shen*, "The University of Queensland, Australia"
Extracting Key Frames from Consumer Videos Using Bi-layer Group Sparsity	Zheshen Wang*, Arizona State University; mrityunjay Kumar, Kodak Research Laboratories; Jiebo Luo, Kodak; Baoxin Li, Arizona State University
Spatial Pooling For Transformation Invariant Image Representation	Xia Li*, UTSA; Yan Song, Univ of Sci. and Tech of China; Yijuan Lu, Texas State University; Qi Tian, UTSA
Multi-Feature pLSA for Combining Visual Features in Image Annotation	Rui Zhang*, Ryerson Multimedia Laboratory; Lei Zhang, Microsoft Research Asia; Xin-Jing Wang, Microsoft Research Asia; ling Guan, Ryerson University
Tag-Based Social Image Search with Visual-Text Joint Hypergraph Learning	Yue Gao*, Tsinghua University; Meng Wang, ; Jialie Shen, "Singapore Management University, Singapore"; Dacheng Tao, ; Qionghai Dai,
Integrating Rich Information for Video Recommendation with Multi-Task Rank Aggregation	Xiaojian ZHAO*, NUS; Guangda LI, ; Meng Wang, ; zheng-Jun Zha, ; Jin Yuan, National University of Singapo; Zhoujun LI, ; Tat-Seng Chua, National Univ of Singapore
Quantifying Human Reconstruction Accuracy for Voxel Carving in a Sporting Environment	David Monaghan*, Dublin City University; Philip Kelly, Dublin City University; Noel O'Connor, CLARITY: Centre for Sensor Web Technologies
Time-aware Co-Training for Indoors Localization in Visual Lifelogs	Vladislavs Dovgalecs*, University of Bordeaux, France; Remi Megret, IMS Lab, University of Bordeau; Yannick Berthoumieu, University of Bordeaux, France
Automatic Sentence Generation from Images	Yoshitaka Ushiku*, University of Tokyo; Tatsuya Harada, The Univ. of Tokyo; Yasuo Kuniyoshi, University of Tokyo
Audio-Visual Fusion Using Bayesian Model Combination for Web Video Retrieval	Vasant Manohar*, Raytheon BBN Technologies; Stavros Tsakalidis, Raytheon BBN Technologies; Pradeep Natarajan, Raytheon BBN Technologies; Rohit Prasad, Raytheon BBN Technologies; Prem Natarajan, Raytheon BBN Technologies
Enriching and Localizing Semantic Tags in Internet Videos	Lamberto Ballan, Università di Firenze - MICC; Marco Bertini*, ; Alberto Del Bimbo, Università di Firenze - MICC; Giuseppe Serra, Università di Firenze - MICC
Scene Segmentation of Wedding Party Videos by Scenario-based Matching with Example Videos	Kazuki Sawai, Nagoya University; Tomoakzu Takahashi*, Gifu Shotoku Gakuen University; Daisuke Deguchi, Nagoya University; Ichiro IDE, ; Hiroshi Murase, Nagoya University
Content-aware Photo-on-photo Composition For Consumer Photos	Aibo Tian*, University of Texas at Austin; Xuemei Zhang, ; Daniel Tretter, HP Labs
PhotoFeel: Feeling Your Photo Collection by Graph-based Audiovisual Flocking Simulation	Cheng-Te Li*, National Taiwan University; Hsieh Hsun-Ping,
Learning to Produce 3D Media from a Captured 2D Video	Minwoo Park*, Kodak.com; Jiebo Luo, Kodak; Andrew Gallagher, Eastman KODAK; Majid Rabbani, Eastman KODAK
A Tool for Authoring Unambiguous Links from Printed Content to Digital Media	Andreas Girgensohn*, FX Palo Alto Laboratory; Lynn Wilcox, FX PAL; Frank Shipman, Texas A&M University; Qiong Liu, FX Palo Alto Laboratory; Chunyuan Liao, FXPAL; Yuichi Oneda, Fuji Xerox
Interactive Digital Scrapbook Generation for Travel Photos Based on Design Principles of Typography	Jung-Yu Yeh, National Taiwan University; Min-Chun Hu*, National Taiwan University; Wen-Huang Cheng, Research Center for Information Technology Innovation (CITI), Academia Sinica; Ja-Ling Wu, National Taiwan University
Avatar Personalization and Augmentation: Prominent Structure Analysis of 3D Morphable Models	Peng Wang*, Intel Labs China
Static and Dynamic Video Summaries	Yingbo Li*, EURECOM; Bernard Merialdo, EURECOM; Mickael Rouvier, University of Avignon; Georges Linares, University of Avignon

Back to Overview

Industrial Exhibit / Technical Demos 2

Session chair: Roger Zimmermann (National University of Singapore)

Industrial Exhibits 2
Towards A Real Time Public Transport Awareness System: Case Study In Dublin	Eric Bouillet*, IBM Research; Luca Gasparini, IBM Research; Olivier Verscheure, IBM Research
ARA: The Active Reading Application	Gene Golovchinksy, FX Palo Alto Laboratory; Scott Carter*, FX Palo Alto Laboratory; Tony Dunnigan, FX Palo Alto Laboratory
myUnity: A new platform to support communication in the modern workplace	Jacob Biehl*, FX Palo Alto Laboratory, Inc.; Thea Turner, FX Palo Alto Laboratory, Inc.; William van Melle, FX Palo Alto Laboratory, Inc.; Andreas Girgensohn, FX Palo Alto Laboratory, Inc.

Technical Demos 2
Galaxy Browser: Exploratory Search of Web Videos	Lei Pang, ; Song Tan*, City University of Hong Kong; Hung-Khoon Tan, ; Chong-Wah Ngo,
A Mobile Location Search System with Active Query Sensing	Felix X. Yu, Columbia University; Rongrong Ji*, Columbia University; Tongtao Zhang, Columbia University; Shih-Fu Chang, Columbia University
Collaborative Authoring of Serious Games for Health	Florian Mehm*, TU Darmstadt; Stefan Göbel, TU Darmstadt, KOM; Sandro Hardy, TU Darmstadt, KOM
PhotoReference: Towards Capturing a Great Photo via Learning from Community-Contributed Photo Collections	Haojie Li*, Dalian University of Technolog; Lei Yi, ; Jinhui Tang, ; Xiaohui Wang,
Changing Video Arrangement for Constructing Alternative Stories	Alberto Piacenza, University of Brescia; Fabrizio Guerrini*, University of Brescia; Nicola Adami, University of Brescia; Riccardo Leonardi, University of Brescia; Jonathan Teutenberg, Teesside University; Julie Porteous, Teesside University; Marc Cavazza, Teesside University
Visual-based plant species identification from crowdsourced data	Hervé Goëau*, Inria; Alexis Joly, Inria, Imedia Team; Souheil Selmi, Inria, Imedia Team; Pierre Bonnet, CIRAD, UMR AMAP; Elise Mouysset, Tela Botanica Association; Laurent Joyeux, Inria, Imedia Team; Jean-François Molino, IRD, UMR AMAP; Philippe Birnbaum, CIRAD, UMR AMAP; Daniel Barthélémy, CIRAD; Nozha Boujemaa, Inria
Dynamic Media Show Drivable by Semantics	Vivek Singh*, Univ. of California, Irvine; jiebo Luo, Kodak Research Labs; Dhiraj Joshi, Kodak Research Labs; Madirakshi Das, Kodak Research Labs; phoury Lei, Kodak Research Labs; Peter Stubler, Kodak Research Labs
SIRE: A Social Image Retrieval Engine	Steven C.H. Hoi*, NTU, Singapore; Pengcheng Wu, NTU, Singapore
Processing of Multimedia Data using the P2G Framework	Håkon Stensland*, Simula Research Laboratory; Paul Beskow, University of Oslo; Håvard Espeland, University of Oslo; Espen Kristiansen, University of Oslo; Preben Olsen, University of Oslo; Ståle Kristoffersen, ; Carsten Griwodz, University of Oslo; Pål Halvorsen, University of Oslo
PositionIt – An Image-based Remote Target Localization System on Smartphones	Qia Wang*, University of Missouri; Alex Lobzhanidze, University of Missouri; Suman Roy, University of Missouri; Wenjun Zeng, University of Missouri; Yi Shang, University of Missouri
Low-cost Creation of a 3D Interactive Museum Exhibition	David Monaghan*, Dublin City University; Noel O'Connor,
Interactive Rich Reading: Enhanced Book Reading Experience with a Conversational Agent	Koichi Mori*, Nokia Research Center; Rafael Ballafas, Nokia Research Center; Glenda Revelle, University of Arkansas; hayes Raffle, Nokia Research Center; hiroshi Horii, Nokia Research Center; mirjana Spasojevic, Nokia Research Center
Hierarchical Video Browsing with a 3D Carousel	Klaus Schoeffmann*, Klagenfurt University; Manfred del Fabro, Klagenfurt University
SACOZI: Crowdsourced and Content-based Zoomable Video Player	Axel Carlier*, IRIT - University of Toulouse; Arash Shafiei, IRIT - University of Toulouse; Julien Badie, IRIT - University of Toulouse; Salim Bensiali, IRIT - University of Toulouse; Vincent Charvillat, ENSEEIHT/IRIT; Wei Tsang Ooi, National University of Singapore
The Director"s Lens: An Intelligent Interface for Composing Virtual 3D Cinema	William Bares, Millsaps College; Christophe Lino, INRIA Rennes; Roberto Ranon, University of Udine; Marc Christie*, IRISA / INRIA Rennes
Video2GPS: A Demo of Multimodal Location Estimation on Flickr Videos	Gerald Friedland*, ICSI; Jaeyoung Choi, ICSI; Adam Janin, ICSI
Towards a Privacy Preserving Personal Photo Album Manager with Semantic classification, Indexing and Querying Capabilities	Jichao Sun*, NJIT; Alexis Fesnin, CNAM/CEDRIC; Vallerie Gouet-Brunet, CNAM/CEDRIC; Scott Kominen, New Jersey Institute of Technology; Vincent Oria, NJIT
Million-scale Near-duplicate Video Retrieval System	Yang Cai*, Zhejiang University; Linjun Yang, Microsoft Research Asia; Wei Ping, ; Fei Wang, ; Tao Mei, Microsoft Research Asia; Xian-Sheng Hua, Microsoft Research Asia; Shipeng Li, Microsoft Research Asia
Mobile Product Search with Bag of Hash Bits	Junfeng He*, Columbia University; Tai-Hsu Lin, Columbia University; Shih-Fu Chang, Columbia University

Back to Overview

Ubi-MUI

International ACM Workshop On Ubiquitous Meta User Interfaces (Ubi-MUI'11)
Thursday morning Dec. 1, 2011
Ali Asghar Nazari Shirehjini (DISCOVERLab, University of Ottawa)
Sahin Albayrak (DAI-Labor, Technical University Berlin)
Abdulsalam Yassin (Alcatel Lucent, Ottawa)

Session Schedule

10:30-10:35	Opening Remark
10:35-11:00	Control smart homes easily with simple touch	Sokratis Kartakis; Margherita Antona; Constantine Stephanidis
11:00-11:25	Gamifying Intelligent Environments	Yefeng Liu; Todorka Alexandrova; Tatsuo Nakajima
11:25-11:50	Color-based object detection using homogeneous probability redistribution of histograms	Mehdi Madani; Mahdi Bagheri
11:50-12:15	An Intelligent Multimedia system for Nutrient Intake Monitoring	Gregorio Villalobos; Rana Almaghrabi; Behnoosh Hariri; Shervin Shirmohammadi

Back to Overview

MTDL

International ACM Workshop on Multimedia Technologies for Distance Learning (MTDL'11)
Thursday Dec. 1, 2011
Rynson Lau (CUHK)
Timothy Shih (NCU, Taiwan)
Frederick Li (Durham University)
Neil Yen (Waseda University)

1. Regular Paper Session

Session Chair: Neil Yen, Waseda University, Japan
Time: 2:00PM - 4:05PM (25 minutes for each presentation)

2:00-2:25	Interactive Mixed Reality for Collaborative Remote Laboratories	Benjamin Jailly, Christophe Gravier, Marius Preda, and Jacques Fayolle
2:25-2:50	A Novel Framework for Distance Education using Asynchronous Interaction	Hiranmay Ghosh, Sujal Wattamwar, Tavleen Oberoi, Gautam Shroff, Arpan Pal, Ranjan Dasgupta, and Kingshuk Chakravarty
2:50-3:15	A-VIEW: Real Time Collaborative Multimedia e-Learning	K.R. Jayahari, Kamal Bijlani, and Ancy Mathew
3:15-3:40	Discovering Small-World in Association Link Networks For Web-based Learning	Shunxiang Zhang and Xiangfeng Luo
3:40-4:05	A User Evaluation Framework for Web-Based Learning Systems	Ke Niu, Wei Chen, Zhendong Niu, Qing Li, Peipei Gu, Yi Li, and Zhilei Huang

2. Invited Paper Session

Session Chair: Wei Chen, Beijing Institute of Technology, China
Time: 4:30PM – 5:45PM (25 minutes for each presentation)

4:30-4:55	Interactive Tutoring of Cooking Activities with Personalized Multimedia Recipe Search	Qing Li, Xudong Mao, and Haoran Xie
4:55-5:20	Game-Based Concept Visualization for Learning Programming	Frederick W. B. Li and Chris Watson
5:20-5:45	Discovery of Implicit Correlation between Shared Information in an Open Environment	Neil Y. Yen and Qun Jin

Back to Overview

SBNMA

ACM Workshop on Social and Behavioural Networked Media Access (SBNMA'11)
Thursday Dec. 1, 2011
Naeem Ramzan (Queen Mary University of London, UK)
Fei Wang (IBM T.J. Watson Research Center)
Charalampos Z. Patrikakis (Technological Educational Institute of Piraeus, Greece)
Peng Cui (Tsinghua University)
Nikos Doulamis (National Technical University of Athens, Greece)
Shiqiang Yang (Tsinghua University)
Gordon Sun (Tencent Technologies)

10:30-10:40	Welcome
(10:40-12:30)	SBNMA paper session 1 Session Chair: Peng Cui (Tsinghua University, China)
10:40-11:00	Graph-based recognition in Photo Collections using Social Semantics, Markus Brenner, Ebroul Izquierdo
11:00-11:20	Event Analytics via Social Media, Yuheng Hu, Ajita John, Doree Seligmann
11:20-11:40	Pursuing the Holy Grail by interrelating user intentions and Bag of Visual Words to perform retrieval adaptation, Marian Kogler, Mathias Lux
11:40-12:00	Using media related user profiles to personalize multimedia access over social networks, Lemonia Argyriou, Charalampos Z Patrikakis, Stuart CM Porter, Nikolaos Papaoulakis, Christina Androulaki
12:00-12:20	Measuring Media-based Social Interactions Provided by Smartphones Applications in Social Networks, Alan Gomes, Maria da Graca Pimentel
12:20-12:30	Discussion
12:30-14:00	Lunch break
(14:00-15:20)	SBNMA paper session 2 Session Chair: Charalampos Z. Patrikakis (Technological Education Institute of Piraeus, Greece)
14:00-14:20	A Hierarchical, Multi-modal Approach for Placing Videos on the Map using Millions of Flickr Photographs, Pascal Kelm, Sebastian Schmiedeke, Thomas Sikora
14:20-14:40	Improving Video Classification via YouTube Video Co-Watch Data, John Zhang, Yang Song, Thomas Leung
14:40-15:00	Gaze Movement Inference for User Adapted Image Annotation and Retrieval, S. Navid Haji Mirza, Ebroul Izquierdo
15:00-15:20	Employing clustering algorithms to create user groups for personalized context aware services provision, Athanasios S. Voulodimos, Anastasios D. Doulamis, Charalampos Z. Patrikakis, Emmanuel S. Sardis, Pantelis N. Karamolegkos
(15:20-16:00)	Keynote Address: Prof Tat-Seng Chua (National University of Singapore), "Learning the Social Pulses of a City from User-Generated Information Sources" Session Chair: Fei Wang (IBM T.J.Watson Research Center, USA)
16:00-16:30	Coffee break
(16:30-17:10)	SBNMA paper session 3 Session Chair: Shiqiang Yang (Tsingua University, China)
16:30-16:50	Adaptive Multi-view Video Streaming over P2P Networks Considering Quality of Experience, Saadet sedef Savas, Cihat Goktug Gurler, A. Murat Tekalp
16:50-17:10	Social and Behavioural Media Access: A Survey, Naeem Ramzan, Peng Cui, Fei Wang, Shiqiang Yang
(17:10-17:50)	Keynote Address: Prof Ebroul Izquierdo, (Queen Mary University of London, UK) "Social Networked Media: Advances and Trends" Session Chair: Naeem Ramzan(Queen Mary University of London, UK)
17:50-18:00	Closing remarks

Back to Overview

J-HGBU

Joint ACM Workshop on Human Gesture and Behavior Understanding (J-HGBU'11)
Thursday Dec. 1, 2011
Part 1: International ACM Workshop on Social Signal Processing (SSPW’11)
Maja Pantic (Imperial College London)
Alex Pentland (MIT, Media Lab, Cambridge, MA)
Alessandro Vinciarelli (University of Glasgow)
Part 2: International ACM Workshop on Multimedia access to 3D Human Objects (MA3HO'11)
Rita Cucchiara (Università di Modena e Reggio Emilia, Italy)
Mohamed Daoudi (TELECOM Lille, France)
Alberto del Bimbo (Università di Firenze, Italy)

Papers
Regular	Person authentication using 3D human motion	Felipe Gomez-Caballero; Takahiro Shinozaki; Sadaoki Furui; Koichi Shinoda
Regular	Estimation and utilization of articulations in recovering non-rigid structure from motion using motion subspaces	Rohith MV; Chandra Kambhamettu
Regular	Human Activity Recognition using Multiple Views: A Comparative Perspective on Recent Developments	Michael Holte; Cuong Tran, Mohan Trivedi; Thomas Moeslund
Invited	Fully Automatic 3D Facial Expression Recognition using a Region-Based Approach	Pierre Lemaire, Liris; Mohsen Ardabilian; Mohamed Daoudi; Boulbaba Ben Amor; Liming Chen
Invited	3DPeS: 3D People Dataset for Surveillance and Forensics	Davide Baltieri; Rita Cucchiara; Roberto Vezzani
Invited	3D Partial Face Matching using Local Shape Descriptors	Stefano Berretti; Alberto Del Bimbo; Pietro Pala
Poster	Multi-stage Feature Point Detection for 3D Human Data	Xiang Pan, Zhejang; Alexander Agathos.
Poster	Human Motion Classification and Management Based on Mocap Data Analysis	Harshad Kadu; C.-C. Jay Kuo; Maychen Kuo
Poster	3D Perceptual Shape Feature-based Body Parts Classification and Pose Estimation	Gang Hu, Qigang Gao.
Poster	Landmark Recognition and Retrieval: From 2D to 3D	Xian Xiao; Changsheng Xu; Jinqiao Wang; Min Xu
Invited Poster	The Florence 2D/3D Hybrid Face Dataset	Andrew Bagdanov; Alberto Del Bimbo; Iacopo Masi.
Regular	The Sounds of Social Life: Naturalistic (Acoustic) Observation Sampling	Matthias Mehl; Fenne Deters
Regular	Computational Study of Human Communication Dynamics	Louis-Philippe Morency
Regular	A Survey of Perception and Computation of Human Beauty	Hatice Gunes
Regular	Behavioral signal processing for understanding (distressed) dyadic interactions: Some recent developments	Panayiotis Georgiou; Matthew Black; Shrikanth Narayanan
Poster	Incorporating Uncertainty in a Layered HMM Architecture for Human Activity Recognition	Michael Glodek ; Lutz Bigalke; Martin Schels; Friedhelm Schwenker
Poster	Multimodal Recognition of Personality during Short Self-Presentations	Ligia Batrinca; Bruno Lepri; Fabio Pianesi
Poster	Automatic recognition of coordination level in an imitation task	Emilie Delaherche; Mohamed Chetouani

Back to Overview

AIEMPro

International ACM Workshop on Automated Media Analysis and Production for Novel TV Services (AIEMPro 2011)
Thursday Dec. 1, 2011
Sid-Ahmed Berrani (Orange Labs - France Telecom, France)
Alberto Messina (RAI - Centre for Research and Technological Innovation, Italy)
Robbie De Sutter (VRT-medialab, Belgium)
Jean-Pierre Evain (European Broadcasting Union, Switzerland)
Gerald Friedland (International Computer Science Institute, USA)
Masanori Sano (Japan Broadcasting Corporation, Japan)
Patrick Gros (INRIA, France)
Shin'ichi Satoh (National Institute of Informatics, Japan)

Papers
Regular	Speech recognition tools in a media retrieval system	Mike Matton; Karel Braeckman
Regular	Sequence-based Kernels for Online Concept Detection in Video	Werner Bailer
Regular	Picture-in-Picture copy detection using spatial coding techniques	Sanjay Purushotham; Qi Tian; C.-C. jay Kuo
Regular	News Story Clustering from Both What and How Aspects: Using Bag of Word Model and Affinity Propagation	Wei-Ta Chu; Chao-Chin Huang; Wen-Fang Cheng
Invited	From audio recurrences to TV program structuring	Alina Abduraman; Sid-Ahmed Berrani; Jean-Bernard Rault; Olivier Le Blouch
Regular	Produce. Annotate. Archive. Repurpose – Accelerating the Composition and Metadata Accumulation of TV Content	Robert Knauf; Jens Kürsten; Albrecht Kurze; Marc Ritter; Arne Berger; Stephan Heinich; Maximilian Eibl

Back to Overview

Industry Panel

Innovating the Multimedia Experience
Thursday December 1st, 2:00PM - 4:00PM, 2011
Organizers & Moderators:
Khaled El-Maleh, Qualcomm Inc. kelmaleh@qualcomm.com
Haohong Wang, TCL Research America, haohongwang@gmail.com

Panelists:
1.Susie Wee, Vice President and Chief Technology and Experience Officer of Collaboration and Communication at Cisco Systems.
2.Heather Yu, Director, Media Networking Lab, Huawei Tech USA
3.James D. Johnston, IEEE Distinguished Lecturer, Audio Technology Pioneer.
4.Zhengyou Zhang, Principal Researcher at Microsoft, Affiliate Professor at University of Washington.

Abstract
In this panel, each panelist will present their view of the current state-of-the-art of research and product innovations in the three major areas of multimedia experience: visual, auditory and gaming. We will discuss examples of innovation that enhance the consumption and sharing of multimedia (video, audio, graphics etc.) and thus increase quality of user experience. Another major focus of this panel is to open the discussion on how to innovate new multimedia user experiences.

Categories and Subject Descriptors
H.1.2[User/Machine Systems]:Human factors; H.5.1 [Information Interfaces and Presentation]:Multimedia Information Systems

General Terms
Human Factors

Keywords
Multimedia experience, Visual media, Auditory media, Gaming, Sharing, User experience

Khaled El-Maleh (Qualcomm Inc. ) received his M. Eng. and Ph.D. from McGill University, Canada, and B. Sc. in EE and B. Sc. in Applied Math from King Fahd University (KFUPM), Saudi Arabia. He has been with Qualcomm, San Diego since 2000. Dr. El-Maleh’s areas of expertise and interests include: design, implementation and quality evaluation of mobile multimedia systems with focus on speech and video compression/processing algorithms, innovation and technology transfer . He was a key member of the team who designed the first generation of mobile camcorder and video telephony solutions for both UMTS and CDMA networks. He was the video quality lead for Qualcomm chip unit and contributed to the design of media quality assessment tools with focus on automated on-chip quality testing. He is currently leading the multimedia engineering team in the Strategic IP Department of Qualcomm with focus on developing innovative technologies to enhance mobile multimedia user experience. In addition to his technical publications, he has filed more than 200 US and international patents covering video\image\speech processing and compression technologies, mobile learning and wireless power charging. He is a founding member of the International Workshop on Quality of Multimedia Experience (QoMEX) and served as the General Co-Chair for QoMEX 2009. He served/currently serving as the TPC Vice Chair (Tutorial Co-Chair) for Globecom 2010, TPC member for ACM Multimedia 2010, Steering Committee member for QoMEX 2010 and QoMEX 2011, Industry Co-Chair for ICME 2011, Industry Chair for SiPS 2011, Advisory Committee member of IEEE DSP/SPE 2011. He is currently serving as a member of the steering committee of ICME 2012-2014, Industry Panel Co-Chair of ACM Multimedia 2011, Industry Co-Chair for ICIP 2012, Special Sessions Co-chair for MMSP 2012, Tutorial Co-Chair for ICASSP 2013. He is a guest editor of two upcoming special journal issues on Quality of Multimedia User Experience (IEEE Signal Processing Magazine and European Journal of Signal and Image Processing)- to be published in 2011. Khaled is the recipient of Qualcomm first Career Thought Leader Award.

Haohong Wang (TCL Research America) received the B.S. degree in computer science and the M.Eng. degree in computer applications both from Nanjing University, China, the M.S. degree in computer science from University of New Mexico, and the Ph.D. degree in Electrical and computer engineering from Northwestern University, Evanston, USA. He is currently the General Manager of TCL Research America, TCL Corporation, at Santa Clara, California, in charge of the overall corporate research activities in North America including 5 research labs at multiple locations. Prior to that he held various technical and management positions at AT&T, Catapult Communications, Qualcomm, Marvell, TTE and Cisco. Dr. Wang's research involves the areas of multimedia communications, 3D graphics, video analysis and processing, and 3DTV systems. He has published more than 50 articles in peer-reviewed journals and International conferences. He is the inventor of more than 40 U.S. patents and pending applications. He is the co-author of 4G Wireless Video Communications (John Wiley & Sons, 2009), and Computer Graphics (1997).

Dr. Wang is the Editor-in-Chief of the Journal of Communications (2008-), the Technical Program Chair of IEEE GLOBECOM 2010 (Miami) and the General Chair of IEEE ICME 2011 (Barcelona) and IEEE ICCCN 2011 (Maui). He chairs the IEEE Multimedia Communications Technical Committee (2010-2012), and is an elected member of the IEEE Visual Signal Processing and Communications Technical Committee (2005-), and IEEE Multimedia and Systems Applications Technical Committee (2006-). He is a member of the Steering Committee of IEEE Transactions on Multimedia. He has served as the Editor-in-Chief of the IEEE MMTC E-Letter (2009) and as an Editor (or Guest Editor) of IEEE Transactions on Multimedia, IEEE Communications Surveys & Tutorials, IEEE Communications Magazine, ACM Multimedia Systems Journal and many others. He served as the General Chair of the 17th IEEE International Conference on Computer Communications and Networks (ICCCN 2008) (US Virgin Island).

Susie Wee (Vice President and Chief Technology and Experience Officer of Collaboration and Communication at Cisco Systems) is the Vice President and Chief Technology and Experience Officer of Collaboration and Communication at Cisco Systems where she is responsible for driving innovation and experience design in Cisco's collaboration products and software services. Prior to this, she was at Hewlett Packard in the roles of founding Vice President of the Experience Software Business and Chief Technology Officer of Client Cloud Services in HP’s Personal Systems Group and Lab Director of the HP Labs Mobile and Media Systems Lab. Susie was the co-editor of the JPSEC standard for the security of JPEG-2000 images and the editor of the JPSEC amendment on File Format Security. She was formerly an associate editor for the IEEE Transactions on Circuits, Systems and Video Technology and for the IEEE Transactions on Image Processing. While at HP Labs, Susie was a consulting assistant professor at Stanford University where she co-taught a graduate-level course on digital video processing. Susie received Technology Review’s Top 100 Young Innovators award, ComputerWorld's Top 40 Innovators under 40, the INCITs Technical Excellence award, and the Women In Technology International Hall of Fame award. She is an IEEE Fellow for her contributions in multimedia technology and has over 50 international publications and over 40 granted patents. Susie received her B.S., M.S., and Ph.D. degrees from the Massachusetts Institute of Technology.

Heather Yu (Director, Media Networking Lab, Huawei Tech USA) got her Ph.D. from Princeton University in 1998. Currently, she is the Director of the Huawei Media Networking Lab located at Bridgewater, NJ. With the mission of establishing a world class R&D team and leading the key multimedia technology innovations, she led the NJ team successfully accomplished the development of several new media technology research areas and a series of new technology innovations offering competitive edge capabilities and supporting various functionalities for Huawei’s products. Before joining Huawei, she was with Panasonic Princeton Lab working on media communication, media processing, media security, and P2P technology research. Since graduated from Princeton, Heather served numerous positions in related associations, such as Chair of the IEEE Multimedia Communications Tech Committee, IEEE Communications Society Strategic Planning Committee member, IEEE Human Centric Communications emerging technology committee chair, Associate Editor in Chief for PPNA journal, AEs of several IEEE journals/magazines, and Conference chair and TPC chair for many conferences in the field. She holds 23 granted US patents and has many in pending. She published 70+ publications, including 4 books, P2P Networking and Applications, Semantic Computing, P2P Handbooks, and Multimedia Security Technologies for Digital Rights Management.

James D. Johnston (IEEE Distinguished Lecturer, Audio Technology Pioneer) received his BSEE and MSEE from Carnegie-Mellon University. Mr. Johnston has been called the father of perceptual audio coding for his pioneering contributions that revolutionized digital audio. His accomplishments during a 26-year career at AT&T Bell Labs have, among other achievements, allowed for the distribution of digital music and digital radio over the Internet. He is presently an independent consultant having retired from the position of Chief Scientist at DTS, Inc. where he was working on a variety of acoustical modeling, preprocessing and postprocessing algorithms for audio capture, analysis, control, and presentation.

Dr. Johnston joined DTS Inc., from his position at Neural Audio. Prior to that, he worked for 5 years at Microsoft Corporation in the "Codecs", "Core Media Processing" and finally the video services groups as Audio Architect. Dr. Johnston retired from AT&T Labs - Research, quartered at Florham Park, NJ, Speech Processing Software and Technology Research Department. Before that, he was employed by AT&T Bell Laboratories, in the Acoustics Research Department under Dr. J. L. Flanagan, and in the Signal Processing Research Department.

Dr. Johnston was the primary researcher and inventor of the MPEG-2 AAC audio coding algorithm, and a principle contributor to the "MP3" algorithm. He also represented AT&T in the ANSI accredited group X3L3.1, and X3L3.1 in the ISO-MPEG-AUDIO (MP3, AAC) arena.

Dr. Johnston was awarded the IEEE James L. Flanagan Signal Processing Field Award (2006); elected Fellow, Audio Engineering Society (1997); received AT&T Technology Medal and AT&T Standards Award (1998); received a New Jersey Inventor of the Year Award (2001); elected IEEE Fellow (2002).

Dr. Johnston’s current research interests include acoustic scene modelling, loudspeaker design, loudspeaker pattern control, cochlear modelling, masking threshold models, stereo imaging models and stereo imaging sensitivity models, methods of reproducing soundfields either literally or perceptually, microphone and soundfield capture techniques, both actively steered and time-invariant, and speech and audio coding methods in general.

Zhengyou Zhang (Principal Researcher at Microsoft, Affiliate Professor at University of Washington) is a Fellow of the Institute of Electrical and Electronic Engineers (IEEE). He is the Founding Editor-in-Chief of the newly established IEEE Transactions on Autonomous Mental Development (IEEE T-AMD), and is on the Editorial Board of the International Journal of Computer Vision (IJCV), the Machine Vision and Applications, and the Journal of Computer Science and Technology (JCST). He was on the Editorial Board of the IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE T-PAMI) from 1999 to 2005, the IEEE Transactions on Multimedia (IEEE T-MM) from 2004 to 2009, the International Journal of Pattern Recognition and Artificial Intelligence (IJPRAI) from 1997 to 2008, among others. He is listed in Who's Who in the World, Who's Who in America and Who's Who in Science and Engineering.

Before joining Microsoft, Zhengyou worked at INRIA (French National Institute for Research in Computer Science and Control) for 11 years, and was a Senior Research Scientist since 1991, where he worked in the Computer Vision and Robotics group. In 1996-1997, he spent one-year sabbatical as an Invited Researcher at the Advanced Telecommunications Research Institute International (ATR), Kyoto, Japan.

He holds more than 80 US patents and has about 20 patents pending. He has published over 200 papers in refereed international journals and conferences, and is the author of several books.

Back to Overview

ACM Multimedia 2011

Nov 28th - Dec 1st, 2011 Scottsdale, Arizona, USA

Twitter: @acmmm11
Linked In: ACM SIGMM
Flickr: flickr.com/acmmm11

ACM MM 2011

Sponsors

Supporters

Program on Thursday (Dec.1, 2011)

Keynote 3: Digital Media Distribution – The Future

Back to Overview

Oral Session Day 3

Back to Overview

Poster Day 3

Back to Overview

Industrial Exhibit / Technical Demos 2

Back to Overview

Ubi-MUI

Session Schedule

Back to Overview

MTDL

1. Regular Paper Session

2. Invited Paper Session

Back to Overview

SBNMA

Back to Overview

J-HGBU

Back to Overview

AIEMPro

Back to Overview

Industry Panel

Back to Overview