ACM MM 2009 Header Image

Yahoo! Challenge:

Radvision Challenge:

CeWe Challenge:

Google Challenge:

HP Challenge:

Nokia Challenge:

Radvision Challenge:

Yahoo! Challenge:

Accenture Challenge:

CurrentTV Challenge:

Winners of the Multimedia Grand Challenge 2009

We are very happy to announce the winners of the Multimedia Grand Challenge 2009.
The challenge presentation took place last week in Beijing, during ACM Multimedia 2009. The session was a plenary session (read: no other presentations were going on at the same time), which meant a large crowd that, as far as I can tell, enjoyed the show. All the participants did a fantastic job, delivering engaging and interesting 3-minute presentations about their Grand Challenge responses, and Ayman Shamma masterfully hosted the event, entertaining and informing as needed.
But a number of presentations rose above the others. Here are the prizes awarded by our judges to the different presentations:

  • Grand Challenge Winner: Gerald Friedland, Luke Gottlieb, Adam Janin. Joke-o-mat: Browsing Sitcoms Punchline by Punchline (response to the Yahoo! video challenge — see video below).
  • Most Practical: Julien Law-To, Gregory Grefenstette, Jean-Luc Gauvain. VoxaleadNews: Robust Automatic Segmentation of Video into Browsable Content (response to the Yahoo! video challenge).
  • Best Presentation: Christoph Kofler, Mathias Lux. Dynamic presentation adaptation based on user intent classification (response to the Yahoo! image challenge).
  • Most Entertaining: Jongeun Cha, Mohamad Eid, Ahmad Barghout, Abu Saleh Md. Mahfujur Tahman, Abdulmotaleb Ei Saddik. HugMe: Synchronous Haptic Teleconferencing (response to the Radvision Video Conferencing Experience challenge).
  • HP Honorable Mention: Tewson Seeoun, Choochart Haruechaiyasek, Toshiaki Kondo. Identifying Auxiliary Web Images Using Combination of Analyses (response to the HP challenge).

Congratulations to all the winners! For your pleasure, a video version of the winning presentation is embedded below. Soon on these pages: pictures and more videos from the event, and details of the MMGC 2010 - see you in Florence!

[Updated: spelling correction.]

See you in Beijing: MMGC Session Details

We’re getting close! Just over a month from today, the Multimedia Grand Challenge will be presented in a dynamic, American-Idol style event at ACM Multimedia 2009. The MMGC session is scheduled for Wednesday, Oct 21, 2009, 16:30-18:00; at the Jewel Room of the conference venue, the Beijing Hotel.

David Ayman Shamma of Yahoo! Research will be the host / master of ceremony (read: session chair) of the MMGC. There will be a panel of judges that will react and respond to the presented challenge responses. The winners of the Grand Challenge will be chosen by the judges and the crowd (details to be posted), and will be announced at the conference banquet. There will be at least one Grand Challenge winner for a prize of $1500. Other stand-out presentations may be awarded additional prizes (e.g., “most entertaining presentation” or “best response to the challenge from corporate X”).

The goal of the session is to both engage and entertain the conference attendees. The session will begin with a quick overview of the challenges, most of which will continue into the ACM MM 2010 MMGC. Then, each response will be presented in exactly three minutes. There will be a timer, and when the timer goes off, the presenter will be cut off. When we say “cut off“, we mean it: the presenter will be shut off, the screen will be turned off, the mic will be muted, and if necessary, the host will spray water like you would a cat clawing the sofa. After this spectacle, or the allocated three minutes, the panel and crowd will have exactly 2 minutes to ask additional questions. We hope that these dynamics will result in a more direct, fun, informative and interesting presentations.

See you in Beijing!

Accepted Challenge Responses

We are all set for the Multimedia Grand Challenge at ACM MM 2009 in Beijing! The details of the (hopefully novel and entertaining) MMGC session at the conference will be posted soon. In the meantime, here is the list of accepted challenge responses, according to the challenge they address.

Responses to Google Challenge: Video Genre Classification

  • Xiao Wu, Chong-Wah Ngo. Towards Google Challenge: Combining Contextual and Social Information for Web Video Categorization.
  • Damian Borth, Jorn Hees, Markus Koch, Adrian Ulges, Christian Schulze, Thomas Breuel, Roberto Paredes.  TubeFiler: an Automatic Web Video Categorizer.
  • Yicheng Song, Xu Zhang, Juan Cao, Yong-Dong Zhang, Jintao Li. Google Challenge: an Incremental-Learning Web Video Categorization Algorithm on Robust Semantic Feature Space.

Responses to Yahoo! Challenge: Image Search Clustering

  • Christoph Kofler, Mathias Lux. Dynamic presentation adaptation based on user intent classification.
  • Winston Hsu, Liang-Chi Hsieh. Canonical Image Selection and Efficient Image Graph Construction for Large-Scale Flickr Photos.

Responses to Yahoo! Challenge: Video Segmentation

  • Julien Law-To, Gregory Grefenstette, Jean-Luc Garvain. VoxaleadNews: Robust Automatic Segmentation of Video into Browsable Content.
  • Gerald Friedland, Luke Gottlieb, Adam Janin. Joke-o-mat: Browsing Sitcoms Punchline by Punchline.

Responses to HP Challenge: Web Page MM Content

  • Wei Gong, Hangzai Luo, Jianping Fan. Extracting Informative Images from Web News Pages via Imbalanced Classification.
  • Tewson Seeoun, Choochart Haruechaiyasek, Toshiaki Kondo. Identifying Auxiliary Web Images Using Combination of Analyses.

Responses to CeWe Challenge: Tangible MM Products

  • Kan Ren, Risto Sarvas, Janko Calic. FreeEye - Intuitive Summarisation of Photo Collections.
  • Wei-Ta Chu, Chia-Hung Lin. Automatic Summarization of Travel Photos Using Near-Duplication Detection and Feature Filtering.
  • Pinaki Sinha, Hamed Pirsiavash, Ramesh Jain. Personal Photo Album Summarization.

Response to Radvision Challenge: Video Conference Experience

  • Jongeun Cha, Mohamad Eid, Ahmad Barghout, Abu Saleh Md. Mahfujur Tahman, Abdulmotaleb Ei Saddik. HugMe: Synchronous Haptic Teleconferencing.

Response to Accenture Challenge: Video Footage Analysis

  • Ming-Chun Tien, Yin-Tze Lin, Ja-Ling Wu. Sports Wizard.

This is it! more details about conference presentation to be posted soon.

What to Submit? Some Answers

We have been getting various queries asking about what kind of content is expected for the two-page submission to the Multimedia Grand Challenge. While there are no specific guidelines, there is one main thing to keep in mind:

  1. Convince us that your method is interesting, innovative, promising and that you will have something to show if we invite you to participate in October.

In other words, if the reviewers (including those from the relevant companies) are excited about your approach and your idea, that’s enough. Obviously, in two pages you cannot fit details of the implementation, the evaluation or the interface; focus on the high-level details of each. Of course, it will not hurt to show high-level results for a specific dataset if one is available.

Of course, if you provide a link to online demos or videos, even better.

That’s it! Submit your paper, make us excited, and do not forget the June 15th deadline is quickly approaching!

Evaluation Metrics & Example Test Data for HP Web Content Identification Challenge

The goal of the algorithms for the HP web content identiciation challenge is to retrieve or label all the informative multimedia content in web pages. The performance of the algorithms would be measured by comparing the automatically computed results with the manually labeled ground truth. Ground truth is generated by manually labeling all the informative multimedia content (i.e. images/video/flash objects) in a pre-selected set of the web pages (in various languages). The algorithm is expected to retrieve nearly all the informative multimedia content in the web pages.

 

The precision is defined as the number of informative images classified/labeled correctly by the algorithm divided by the total number of images labeled as informative by the algorithm. In other words, precision is the number of true positives divided by the sum of true positives and false positives. Recall is defined as the number of informative images classified/labeled correctly by the algorithm divided by the total number of informative images (which should have been labeled as informative). Recall is the number of true positives divided by the sum of true positives and false negatives. The final comparisons between the algorithms will be made by computing the F-measures using the precision and recall.

 

Here are examples of the types of web pages that we plan to use to evaluate the submissions:

 

* English

http://www.mapquest.com/maps?1c=Palo+Alto&1s=CA&2c=San+Francisco+&2s=CA (Driving directions)

 

http://www.buy.com/specialty_store_1/promotions/33379.html (shopping)

(Note, images of similar products or other product recommendations are considered informative)

 

http://edition.cnn.com/2009/TECH/05/27/ship.sinking.reef/index.html (news)

 

* Chinese

http://news.sina.com/oth/phoenixtv/502-104-103-108/2009-05-27/01323899156.html (entertainment articles)

 

http://www.china.travel/sym/lyhd/2009-05-14/274878.shtml (travel)

 

http://www.yahtour.com/destination/province.php?id=2238 (travel)

 

 

* Arabic

http://www.aljazeera.net/NR/exeres/5CC37A8B-39E7-4692-BCD9-2D8807ACE580.htm

 

http://www.marma.net/content.prt-CID=16303

 

* Korean

http://news.chosun.com/site/data/html_dir/2009/05/28/2009052800708.html (news)

 

http://blog.naver.com/honeykja/40045216645 (blog, recipe)

 

$3K prizes announced, Submission Site Open

We are delighted to announce that thanks to our generous sponsors (Google, HP Labs, Nokia and Yahoo!), the Multimedia Grand Challenge 2009 will offer a total of $3000 in prizes, including a first prize of $1500. See the Prizes section for (limited) details.

In other news, the submission site is now open, with submissions Due on June 15th! The submissions should be made in a 2-page ACM format, and must address one of the corporate challenges listed below. Details available on the SUBMIT page.

Submission, Prize, Sponsors all coming soon!

Stay tuned for information about submission and (most excitingly) a grand prize for the Grand Challenge coming very soon on this blog. In the meantime, Jay just posted about the Google Challenge on the Google Research Blog.

Posted: Dataset for Nokia Photo Location Challenge

We are still working on creating a ground truth set for Nokia’s Photo location & orientation Challenge. For the time being, we are glad to provide a larger data set (zip file), for which we unfortunately do not have the ground truth yet. This set also includes calibration grid images. All the images were taken with the same camera.


This set represents fairly realistic, but extremely challenging conditions. It is our hope that researchers will soon find methods that are able to fairly accurately reconstruct the camera poses from a set like this.  For the time being, though, we are not expecting submissions which can fully recover all the camera poses in this set.


The images were taken in the city of Lausanne, Switzerland, starting from Place Centrale, going around at Place Pepinet, continuing on Rue Centrale and taking a turn to Rue du Pont. From the fountain the pictures set continues on Escaliers du Marche towards the Cathedrale de Lausanne. There are good satellite and aerial images of the city publicly available (for example, from Live.com) , which may help you to visually inspect your results.


The image set can be downloaded here.

Full Paper Deadline Approaching

As we mentioned earlier, the Grand Challenge committee will give preference for submissions that are accompanied by other conference submissions. When we say “other conference submission” we primarily refer, of course, to full paper submissions.

With that in mind, we just wanted to remind everyone that the MM2009 full paper deadline is April 17th; April 10th is the deadline for paper Registration (abstract submission).

Other deadlines are May 8th for short paper submissions, and, of course, June 15th for the Grand Challenge.

Current TV Challenge: Media Production in the Age of Community

Community media is all around us.  News is broadcast everywhere: from websites to Facebook feeds.  In many cases, the conversation about news is as important as the news event itself.  News media providers are seeking ways to aid the production of content though the merger and analysis of video and social streams.  This raises many questions:
•    What kinds of social streams can be aligned in real time to live media?
•    What video content features align to social streams (such as what’s the relationship between a stadium video of camera flash bulbs and an onset of social status messages, like Twitter.com tweets about someone’s dress)?
•    How could social streams be used to find highlights and summarizations of events?
•    How is this video segment important to a community?  What deeper insight can the analysis of social streams add to news reportage?

application

We are seeking applications that produce insightful social media analysis of news and events, e.g. breaking new stories, political events (speeches, elections, press conferences, legislative voting), major media events (e.g. Oscars, Grammys, Pulitzer Prizes announcements, etc), etc.. These applications can be autonomous or semi-autonomous. In the latter case, the applications could require some minimal human intervention to ‘curate’ the production of new media. For a live broadcast scenario, the application could, for example, find relevant commentary from social sites aligned in real time. For a recorded broadcasts, applications can align the show to a set of time-delivered social comments and discover highlights. Given the plurality of social sources, performing video and text analysis at scale will be an issue for the live events, and perhaps even for recorded broadcasts.

Input—Any video stream of the type described above (or others) plus any combination of social sources (blog posts, Flickr Photos, Twitter tweets, Facebook Status messages, etc.). Content can be pulled from any number of public sources via public APIs or via any other feasible mechanisms. Live broadcasts can be ‘simulated’ for demonstration purposes.

Output—A filtering of social media aligned to the video stream. Once aligned, the social media should be analyzed to illuminate why a video segment is meaningful. There could be various paths for the analysis, perhaps based on categories. For example, a category of visual commentary includes information that relates to items and objects in the video: does this relate to the motorcade or the speech? Is it the speaker or their attire? What is the most insightful commentary about the identified objects? A topical commentary will show: Is this about a domestic or foreign issue? Is this about national security or health care? In another example, a sentiment analysis might show mood and reaction. For all analyses, being able to identify why an event is meaningful via the social commentary is key.

Evaluation

We will look for success in the following areas:

  • Efficacy of the filtration and categorization.
  • Speed of the application
  • Presentation novelty or attractiveness
  • Curator user Interface for the cases that require human curation
  • Quality of aggregation of community contributions over time, with best representative samples. I.e., for any segment, find representative topics and categories and the best sample social media excerpts for:
    • What did the majority of people say? (100-80%)
    • What did the core population say? (79% - 20%)
    • What did the outliers say? (< 20%)

Sample past and future events:
•    Election night: November 4th, 2008
•    Inauguration day: January 20th , 2009
•    Oscar night: February 22nd, 2009
•    Macworld
•    CES
•    Graduation Day, high school and college
•    American Idol finale
•    Baseball World Series


Sample past breaking news:
•    Hudson River plane crash, January 15th, 2009
•    Australian bushfires: early February 2009
•    Major stock market fluctuations (past and future, starting in Sept 2008)

Feel free to correspond with the challenge authors via the comments form below.

For private correspondence, consult the About page for contact details.