Preview

Data Mining Project on IMDB Website

Powerful Essays
Open Document
Open Document
1238 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Data Mining Project on IMDB Website
Data Mining Project on IMDB website
ABSTRACT
The Internet Movie Database (IMDb) is an online database of information related to movies, television shows, stars, etc. We chose to do our project from 2008 to 2011 year’s movie database. We extracted data like Movie, Director, Star, Image Url, Studio from the IMDb website. For this extraction of data we used a tool named Mozenda. After the data extraction, the data was analyzed. For a particular star, his/her movie, director, studio with whom the star has worked was shown. A Graphical User Interface (GUI) for the same was developed. According to this GUI, when the user selects a Star his/her respective movies, directors, studios are displayed. A graph for the extracted data is also shown. For this a tool named NodeXL is used. This graph is having star and movie as the nodes and an edge is the relation between the star and the movie which shows that the star has worked in the movie and vice versa.

DATA EXTRACTION TOOL: MOZENDA

This tool was used to extract the web data. In the Mozenda agent builder, the url www.imdb.com was entered. The website page gets loaded in the agent builder. One can navigate through the pages from where to extract the data. We chose to extract data from January 2008 to April 2011. So the url for January 2008’s webpage (http://www.imdb.com/nowplaying/2008/1/) was entered. After the January 2008’s webpage is loaded, start new Agent from this page on the agent builder is clicked. As we have to extract the same set of data like movie name, director, image, studio for each movie, Create list of items on the agent builder is clicked. The movie names of the first two movies on the webpage are selected. Then a dialog box appears. A respective filed name like Movie is given. Same procedure is repeated for Director, Studio, Image Url. As we want to extract same type of data from multiple pages, Add list pager on the agent builder is clicked and then next month is clicked. Now the software

You May Also Find These Documents Helpful

  • Powerful Essays

    Turban, E., Rainer, K., & Potter, R. (2003). Introduction to Information Technology (8th ed.). New York: John Wiley & Sons, Inc. .…

    • 979 Words
    • 4 Pages
    Powerful Essays
  • Good Essays

    Ccld L3 Unit 5

    • 624 Words
    • 3 Pages

    In this homework you will research into different ways Information Technology and Computing are used to make Movies. The way the characters in Toy story come to life! The way Spiderman flies through the air! How do they do that?…

    • 624 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    Turban, E., Rainer. K., & Potter. R. (2003). Introduction to Information Technology. John Wiley and Sons, Inc.…

    • 1409 Words
    • 6 Pages
    Powerful Essays
  • Satisfactory Essays

    assign1

    • 309 Words
    • 1 Page

    Educational) for rent. Each film is uniquely identified by a Film ID. Each film is also…

    • 309 Words
    • 1 Page
    Satisfactory Essays
  • Good Essays

    Case of Movie Industry

    • 1117 Words
    • 5 Pages

    The developments in online movie distribution have come at a difficult time for the movie industry. At present day, the interest reduces barriers to entry, such as the need for a sales force, access to channels. It also provides a technology for driving business processes that makes other things easier to do.…

    • 1117 Words
    • 5 Pages
    Good Essays
  • Satisfactory Essays

    IMDb is known for listing every movie a star that has stared in a movie and give a…

    • 258 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Syllabus

    • 627 Words
    • 3 Pages

    Chapter 2 The Internet and World Wide Web and Making use of the Web Pages 43 - 94 DB 2 (Word) Chapter 3 Application Software & Digital Video Technology Pages 95 - 134…

    • 627 Words
    • 3 Pages
    Satisfactory Essays
  • Better Essays

    contains graphics and a written text and analyze it according to the criteria that follow.…

    • 983 Words
    • 4 Pages
    Better Essays
  • Satisfactory Essays

    Databases. This article would be great to use in my paper because the experts has made…

    • 109 Words
    • 1 Page
    Satisfactory Essays
  • Powerful Essays

    Netflix Information System

    • 1867 Words
    • 8 Pages

    One of the most important technologies that support Netflix’s customer relationship management is its custom-built intelligent agent. An intelligent agent is artificial intelligence software that helps or acts on behalf of the user to perform repetitive-computer related tasks (Haag 224). In particular, Netflix uses a buyer agent, also known as a shopping bot. A buyer agent is an intelligent agent on a website that assists the consumer in finding a product or service that he or she wants (Haag 225). Netflix’ shopping bots use two techniques in order to predict customers’ DVD preferences: collaborative filtering and adaptive filtering. Collaborative filtering is when a customer is matched with a group of users who have similar tastes. Then, the customer is presented with common selections in that group (Haag 225). Adaptive filtering is when the consumer is asked to rate a product or situation and then monitored over time (Haag 226). Ultimately, Netflix will know what the customer likes and dislikes. By using a hybrid technique, Netflix is able to give…

    • 1867 Words
    • 8 Pages
    Powerful Essays
  • Good Essays

    Citizens’ personal information has always been actively sought by government authorities and by private businesses, and up until recently, has been kept exclusively by the institutions requesting the information. However, those days of confidentiality are over, as the world becomes increasingly structured upon the evolution of the Internet. Today, government authorities and private businesses have a multitude of ways to access personal information that is submitted through the World Wide Web, one of these methods being the surveillance and tracking of search requests through online search engines such as Google (Search Engine Privacy). The collection of personally identifiable data by search engines threatens…

    • 989 Words
    • 4 Pages
    Good Essays
  • Better Essays

    For the past one hundred plus years numerous people have escaped the daily grind of life via the movie cinema. In fact, the movie cinema has been around in both America and Europe since 1905 when the first nickelodeon theatres sprang into existence (Pellettieri, 2007). Viewing a movie at the local cinema was for many generations a rite of passage for weekend activity. As time has passed movie viewing venues have brought the theatre into our homes via video tapes and Digital Video Disk (commonly know as DVD). However, even with these home viewing venues the viewer still had to go out, or send out to rent the viewing material. But, with the 21st Century now upon us a new venue called online movie downloads has now arrived, and it is steadily becoming the future theatre for viewing film.…

    • 2177 Words
    • 9 Pages
    Better Essays
  • Good Essays

    Nowadays, movies, which are the most important entertainment of people, has spent much more money and time than before by a growing number of people. Different kinds of new movies play nearly everyday; and the way to watch a movie isn’t confined to the cinema. Along with the improvement of digital postproduction and digital effect is applied to the movies, they make people to be personally on the scene when you watch a movie. In the past twenty years, the changes of the ways to watch a movie and the movie technology have already influenced entertainment for people deeply.…

    • 665 Words
    • 3 Pages
    Good Essays
  • Good Essays

    A movie is something that everyone can sit down and enjoy. All types of ages are able to watch a movie. The movie that I evaluated was Project Almanac. This movie was about a boy named David, who is crazy smart who dreams of going to MIT. In the movie the main character David, stumbles upon secret plans of his late father's. In these plans it includes a device to build which turns out to be a time machine. David and his friends then get to work to build this so called time machine. After many trials and errors they get the time machine to actually work. There were many consequences to building the time machine. You would just have to watch the movie to find out what they were. The whole movie was based on camerawork from the main character's…

    • 951 Words
    • 4 Pages
    Good Essays
  • Satisfactory Essays

    Entities and Attributes

    • 469 Words
    • 2 Pages

    Baranes, A. (2010, Jan 2). Lecture 6: Entities and Attributes in an Online Database, Internet Movie Database. Retrieved from https://sites.google.com/site/principlesofinformationsystems/lecture-6-entities-and-attributes-in-an-online-database…

    • 469 Words
    • 2 Pages
    Satisfactory Essays