Preview

Content-Based Video Tagging

Powerful Essays
Open Document
Open Document
4180 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Content-Based Video Tagging
Content-based Video Tagging for Online Video Portals∗
Adrian Ulges1 , Christian Schulze2 , Daniel Keysers2 , Thomas M. Breuel1
1 University
2 German

of Kaiserslautern, Germany

Research Center for Artificial Intelligence (DFKI), Kaiserslautern
{a ulges,tmb}@informatik.uni-kl.de,

{christian.schulze,daniel.keysers}@dfki.de

Abstract
Despite the increasing economic impact of the online video market, search in commercial video databases is still mostly based on user-generated meta-data. To complement this manual labeling, recent research efforts have investigated the interpretation of the visual content of a video to automatically annotate it. A key problem with such methods is the costly acquisition of a manually annotated training set.
In this paper, we study whether content-based tagging can be learned from user-tagged online video, a vast, public data source. We present an extensive benchmark using a database of real-world videos from the video portal youtube.com. We show that a combination of several visual features improves performance over our baseline system by about 30%.

1

Introduction

Due to the rapid spread of the web and growth of its bandwidth, millions of users have discovered online video as a source of information and entertainment. A market of significant economic impact has evolved that is often seen as a serious competitor for traditional TV broadcast.
However, accessing the desired pieces of information in an efficient manner is a difficult problem due to the enormous quantity and diversity of video material published. Most commercial systems organize video access and search via meta-data like the video title or user-generated tags (e.g., youtube, myspace, clipfish) – an indexing method that requires manual work and is time-consuming, incomplete, and subjective.
While commercial systems neglect another valuable source of information, namely the content of a video, research in content-based video retrieval strives to



References: [1] Deselaers T. and Keysers D. and Ney H., ‘Discriminative Training for Object Recognition Using Image Patches’, CVPR, pp.157-162, Washington, DC, 2005. Vol. 9, No. 8, pp.1280-1289, 1999. Pattern Anal. Mach. Intell., Vol. 20, No. 3, pp.226-239, 1998. [8] Li J. and Wang J., ‘Real-time Computerized Annotation of Pictures’, Intern. Conf. on Multimedia, pp.911-920, Santa Barbara, CA, 2006. [11] Feng S.L. and Manmatha R. and Lavrenko V., ‘Multiple Bernoulli Relevance Models for Image and Video Annotation’, CVPR, pp.1002-1009, Washington, DC, 2004. [14] Fei-Fei L. and Perona P., ‘A Bayesian Hierarchical Model for Learning Natural Scene Categories’, CVPR, pp.524-531, San Diego, CA, 2005. [15] Sivic J. and Zisserman A., ‘Video Google: A Text Retrieval Approach to Object Matching in Videos’, ICCV, pp.1470-1477, Washington, DC, 2003. Vol. 22, No. 12, pp.1349-1380, 2000. [17] Snoek C. et al., ‘The MediaMill TRECVID 2006 Semantic Video Search Engine’, TRECVID Workshop (unreviewed workshop paper), Gaithersburg, MD, 2006. [19] Vasconcelos N. and Lippman A., ‘Statistical Models of Video Structure for Content Analysis and Characterization’, IEEE Trans. Image Process., Vol. 9, No. 1, pp.3-19, 2000. Intern. Conf. on Multimedia, pp.421-430, Santa Barbara, CA, 2006.

You May Also Find These Documents Helpful

  • Good Essays

    Midijam System Overview

    • 617 Words
    • 3 Pages

    Kevin Nolan SN 20036163 / B.Sc. in Entertainment Systems(Hons), Dept of Computing, Maths and Physics, School of Science, Waterford Institute of Technology. S.Email: 20036163@mail.wit.ie…

    • 617 Words
    • 3 Pages
    Good Essays
  • Better Essays

    Cango Week 4 Analysis

    • 1862 Words
    • 8 Pages

    References: Class Videos. (2010, January 4). Cango Quicktime Videos [Video File]. Videos listed on http://www.devryu.net/ec/crs/default.learn?CourseID=…

    • 1862 Words
    • 8 Pages
    Better Essays
  • Satisfactory Essays

    Pt1420 Unit 1 Assignment

    • 303 Words
    • 2 Pages

    IBM Multimedia Analysis and Retrieval System [8]. The service enabled users to train new classifiers in December 2015.…

    • 303 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    assign1

    • 309 Words
    • 1 Page

    kinds of films, more information is recorded. For instance, each foreign film has a spoken…

    • 309 Words
    • 1 Page
    Satisfactory Essays
  • Satisfactory Essays

    PHYS1160 Notes

    • 534 Words
    • 2 Pages

    format material more generally as it is easier to keep up to date than the video…

    • 534 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    In his work Bundesen discusses 3 types of perceptual categories, namely a color category, a shape category and a location category \cite{tva}. Our image retrieval model is founded mainly on color analysis. Therefore, we can follow the notation proposed in \cite{tva} with the assumption that perceptual categories are represented solely by colors.…

    • 389 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Online video is posed to be the leading worldwide medium for entertainment and information as it is already the most important source for the young target audience of 14-34 year-olds. This is according to an online multi-channel network of video content producers, Mediakraft Networks, who have partnered with British Pathè to create an online video series titled, The Great War. British Pathè is a curator of newsreel footage from around the world and contains over 85,000 films unrivalled in their historical and cultural significance. British Pathè provides a variety of archival footage from the First World War to be used in the production of the video series.…

    • 485 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Use of text/titles: - Documentaries usually use words on screens to anchor images in time and space. It is a quick and cheap way of conveying information.…

    • 327 Words
    • 2 Pages
    Good Essays
  • Powerful Essays

    In the past decade there has been a substantial growth in computing technology that has ignited a new revolution of online media for consumers. The internet now provides a greater amount of innovation for the entertainment industry. These innovations have provided businesses to be able to market a profitable model to the masses that in effect broadens the availability of entertainment interests of the consumer. Many companies have taken on this endeavor and have become a common place name globally. The largest name in the online community, Google, understood that internet video was the next step in expanding their ever-growing online presence; however, entering a market where AOL, MSN, and Yahoo claimed to be the titans of the online world was not going to be easy without innovation.…

    • 4472 Words
    • 18 Pages
    Powerful Essays
  • Satisfactory Essays

    Intro to Sift

    • 1504 Words
    • 7 Pages

    David G. Lowe Computer Science Department 2366 Main Mall University of British Columbia Vancouver, B.C., V6T 1Z4, Canada E-mail: lowe@cs.ubc.ca…

    • 1504 Words
    • 7 Pages
    Satisfactory Essays
  • Powerful Essays

    Video Streaming

    • 2208 Words
    • 9 Pages

    Abstract— This web-based video streaming application was developed based-on Vidiscript v0.43 (a free Youtube Clone Script) and installed on lecturer 's laptop as a video streaming server (with free open source Linux Ubuntu 9.10 Karmic Koala operating system). This video streaming application can be accessed by students (using an Internet Browser on his/her Linux Ubuntu or Windows XP laptop) through a wireless ad hoc network (without a support by an access point). Vidiscript installation on Linux Ubuntu 9.10 need additional support from some free open source software, such as LAMP Server (Linux, Apache, MySQL, and PHP), ioncube loader for linux, encoder (transcoder) FFMPEG, MEncoder, and LAME. The site settings and account management of this application can be setup by the admin server (lecturer) so it can help and support a multimedia-based class activity. A lecture can upload some lecture video files and manage right access for his/her students. As a client, a student can see a streaming video and upload some video file (register student) to be shared by others.…

    • 2208 Words
    • 9 Pages
    Powerful Essays
  • Satisfactory Essays

    -When the club acquires new videos ,they are categorised , coded and lebelled .These details are then recorded in the inventory and the video catalogue is updated.Videos are categorised as comedy,general,horror,thriller or cartoon.…

    • 443 Words
    • 2 Pages
    Satisfactory Essays
  • Powerful Essays

    Bullet Screen Case Study

    • 1149 Words
    • 5 Pages

    While the use of natural language processing on “Bullet Screen” is beneficial to store and organize natural language, as authors provide users with search function to help users find the wonderful part of the video. According to Alexa, in January 2017, in all integrated authors site all over the world, Bilibili (http://search.bilibili.com/) ranked the 224th, while the number of its visitors ranked the 286th. It is the most active Chinese large-scale “Bullet Screen” video site. In this paper, research object is presented and related works about videos with “Bullet Screen” are detailed in 3. Research problems of the “Bullet Screen” retrieval system are discussed. Basic algorithms and the proposed ISB methods are demonstrated. Finally, the conclusions are provided. In fact, different types of “Bullet Screen” are applied in situations and have various characters. Subtitle is the result of the subtitle group (a small number of users) editing the video data. Most users use subtitle information rather than participate in the creation itself, which is similar to authorsb1.0. “Bullet Screen” is similar to Authorsb2.0, in which the users can be more interactive. In this process, the users can participate in the creation of the “Bullet Screen”. To some degree, texts “Bullet Screen” in live site have an impact on the broadcast itself. The anchor can communicate with the users through “Bullet Screen”. In this way, different “Bullet Screen” applications actually conform to the development of the Internet. Their development does not have a reciprocal relationship to each other. Whether it is subtitles, barrage or live barrage, it is not gone but applied in different situations. During the video viewing, the density of the “Bullet Screen” commentary is significantly correlated with the importance of the video…

    • 1149 Words
    • 5 Pages
    Powerful Essays
  • Good Essays

    You must be coming across the website, YouTube wherein videos can be uploaded and viewed with the help of the internet. Such techniques or processes are known as streaming of multimedia wherein different multimedia applications and elements are transferred from the provider to the viewer with the help of a particular medium or the internet. Different players or media applications are used and employed for this particular application via which different elements like text, images, audio and videos can be transferred, viewed or shared. Whenever you make use of the internet to obtain or send a multimedia element, you are making use of the streaming process. The streaming depends on the bandwidth and speed of the network you use.…

    • 471 Words
    • 2 Pages
    Good Essays
  • Powerful Essays

    Ooredoo

    • 936 Words
    • 3 Pages

    Multimedia is broadly divided into two categories linear and non linear. Linear active content progresses without navigational control for the viewer such as presentation of cinema. Non linear uses interactivity to control progress with a video game or computer based training.…

    • 936 Words
    • 3 Pages
    Powerful Essays