Preview

Bigdata

Better Essays
Open Document
Open Document
3484 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Bigdata
Addressing the Challenge of Big Data & MDM in the Large Enterprise

Presented by:

Manish Sood, Founder & CEO, Reltio, Inc. manish@reltio.com October, 2012

Image: "Data Deluge," Brett Ryder, The Economist, Feb. 2010

Agenda 1. What is Big Data? 2. What is NoSQL vs. Relational DBs? 3. What is Hadoop (HDFS and MapReduce)? 4. MDM and Big Data – a Case Study

Confidential and Proprietary – please do not distribute without prior permission

2

Trend – Growing data sets
DATA VOLUME
Zettabyte

1.4 Zettabytes in Enterprise Data

2011

Machine To Machine

Exabyte

Petabyte

Interactions
Terabyte

Transactions
Mainframe PC Internet Mobile Machine

Time

Zettabyte = 1,000,000,000,000,000,000,000 Bytes Graph based on IDC and UC Berkeley Data Growth Estimates, Source: IDC & CosmoBC.com: http://techblog.cosmobc.com/2011/08/26/data‐storage‐ infographic/

Confidential and Proprietary – please do not distribute without prior permission

3

Trend – Information Connectivity

Information Connectivity

Internet of Things

Semantic Web Tagging Social Networks Text Files RDBMS Hypertext Blogs RDF Folksonomies User generated content

Web 1.0

Web 2.0

Web 3.0

1990

2000

2010

2020

Confidential and Proprietary – please do not distribute without prior permission

4

Trend – Data Complexity
Text files and Lists Majority of Webpages

Relational Databases

Performance

Social Networks

Internet of Things

Custom work

Data Complexity
Confidential and Proprietary – please do not distribute without prior permission 5

Characteristics of Big Data Velocity
Volume Variety Value

$
10’s of Billions of Daily Records From Terabytes to Petabytes Multi‐ Structured Business Insights

Big data is where the data volume, acquisition velocity, or data representation limits the ability to perform effective analysis using traditional relational approaches or requires the use of significant



Links: Inderpal Bhandari, VP & Chief Data Officer, Express Scripts October, 2012

You May Also Find These Documents Helpful

  • Satisfactory Essays

    All computers today have GB or TB. I was just in Wal-Mart today and saw a removable hard-drive with a 2TB capacity for only $150, and there was even a 3TB hard-drive. When I went over to staples and looked at all the computers I didn’t see any fewer than 750 GB of ROM and 4 GB of RAM. With technology in the world today expanding so quickly it is not farfetched to see hard-drives with 100 TB capacities in the near future. If you went by Moore’s Law, which I know is for transistors but I think goes along with many other things, I…

    • 420 Words
    • 2 Pages
    Satisfactory Essays
  • Powerful Essays

    References: Brown, B., Chiu, M., Manyika, J. (2011), Are you ready for the era of big data? Retrieved…

    • 1755 Words
    • 6 Pages
    Powerful Essays
  • Good Essays

    Week 6 Discussion 2

    • 582 Words
    • 3 Pages

    Any organization wishing to maintain a competitive advantage can benefit from big data management and analytical tools. When properly utilized, big data can increase efficiency, productivity, and predict future market conditions (Laudon, p. 231). As processors become faster and more affordable, big data management will become a necessary component of all organizations. The actual benefit from big data will lie in the ability to analyze and apply the vast amounts of information that are flooding databases at all times.…

    • 582 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    The compute framework of Hadoop is called Map Reduce. Map Reduce has been proven to the scale of…

    • 3076 Words
    • 13 Pages
    Powerful Essays
  • Powerful Essays

    Databases are everywhere now and impact our lives in a multitude of ways. It can accurately be said that “your life is in a database” or, more accurately, in multiple databases, and information about you (a retrieval of facts about you) is easily accessible. Your shopping history, credit history, medical history, even your driving history, is stored in one or more databases.…

    • 1190 Words
    • 5 Pages
    Powerful Essays
  • Powerful Essays

    Data

    • 1644 Words
    • 7 Pages

    The purpose of the report is to assist Aircraft Solutions (AS) in indentifying the most significant Information Technology (IT) security vulnerabilities. AS products and services are at the forefront of the industry and the protection of such is very important as they are an industry leader. The vulnerabilities that will be discussed are the firewall configuration, virtualization of their hardware assets and defining security policy regarding the timeliness of firewall configuration and updates.…

    • 1644 Words
    • 7 Pages
    Powerful Essays
  • Powerful Essays

    Du Preez, D. (2012a). Big data: hands on or hands off? 21 Feb 2012. Computing Feature, (n.d.). Retrieved from http://www.computing.co.uk/ctg/feature/2153789/-hands-hands/page/1…

    • 1730 Words
    • 7 Pages
    Powerful Essays
  • Good Essays

    Demchenko, Zhao, Grosso, Wibisono, & Laat (2012), have described the five primary characteristics of health care big data as five V’s: Volume, Velocity, Variety, Veracity, and Value. Volume refers to vast amounts of health-related data created and accumulated continuously. In 2011 alone, the U.S. healthcare system has reached 150 exabytes, and soon will reach the zettabyte (1021 gigabytes) scale and, not long after, the yottabyte (1024 gigabytes) (Raghupathi & Raghupathi, 2014). Velocity applies to the constant flow of new data accumulating at unprecedented rate, variety pertains to the level of complexity of the data, veracity measures includes questions of trust and uncertainty with regards to data and the outcome of analysis of that data, and value evaluate show how good the quality of the data is in reference to the intended results. (Herland, Khoshgoftaar, & Wald,…

    • 648 Words
    • 3 Pages
    Good Essays
  • Better Essays

    Cloud Bi

    • 1361 Words
    • 6 Pages

    Now, data can be in vast amounts, of which some might be useful and some might not be useful.…

    • 1361 Words
    • 6 Pages
    Better Essays
  • Better Essays

    Jacobs, Adam. "The Pathologies of Big Data." Communications Acm 19 June 2014: n. pag. Google Scholar. Web. 10 Sept. 2014.…

    • 1115 Words
    • 5 Pages
    Better Essays
  • Satisfactory Essays

    With that in mind, organizations should always cease to ensure that their data is eagerly managed. With the market changing, the process of data management is becoming more complex and the capacity of data to be managed is steadily increasing, this is sometimes referred to as “big data”. Big data is used in understanding organizations and their decision making process; when decisions are made, they are based on complex data transactions which have become difficult to the system that are using basic database and warehouse management systems (Vael, 2013). This causes many data management difficulties such as an increase in data, immature decision making, legal issues and data securing and integrity to name a few, but they can easily be reduced or resolved by the use of the following:…

    • 707 Words
    • 3 Pages
    Satisfactory Essays
  • Best Essays

    In this fast paced information age, there are many different sources on corporate networks and internet is collecting massive amounts of data, but there is a significant difference in this data compared to the conventional data, much of this data is semi-structured or unstructured and not residing in conventional databases. “Big data” is essentially a huge data set that scales to multiple petabytes of capacity; it can be created, collected, collaborated, and stored in real-time or any other way. However, the challenge with big data is that it is not easily handled using traditional database management tools. It typically consists of unstructured data, which includes text, audio and video files, photographs and other data (Kovar, 2012). The aim of this paper is to examine the concepts associated with the big data architecture, as well as how to handle, process, and effectively utilize big data internally and externally to obtain meaningful and actionable insights.…

    • 2200 Words
    • 9 Pages
    Best Essays
  • Powerful Essays

    Open-Data

    • 10067 Words
    • 41 Pages

    1. HM Revenue & Customs (HMRC) are fully committed to the transparency agenda, and transparency is a key principle for the Department.…

    • 10067 Words
    • 41 Pages
    Powerful Essays
  • Best Essays

    Data Warehousing and Olap

    • 2507 Words
    • 11 Pages

    In the 1990s, as businesses grew more complex, corporation spread globally, and competition became fiercer, business executives became desperate for information to stay competitive and improve the bottom line. Data warehousing technologies have been successfully deployed in many industries: manufacturing (for order shipment and customer support), retail (for user profiling and inventory management), financial services (for claims analysis, risk analysis, credit card analysis, and fraud detection), transportation (for fleet management), telecommunications (for call analysis and fraud detection), utilities (for power usage analysis), and healthcare (for outcomes analysis). This paper presents a roadmap of data warehousing technologies, focusing on the special requirements that data warehouses place on database management systems (DBMSs).…

    • 2507 Words
    • 11 Pages
    Best Essays
  • Good Essays

    Data in itself can be powerful, but also has many pitfalls if left to disparate databases and data collection routines. A collection of spreadsheets with account numbers entered into them can be view as a business liability. This same information in a database that can be queried, secured, organized and related to other data for analytical purposes becomes a power business tool. It takes “big data” and makes it business intelligence.…

    • 853 Words
    • 4 Pages
    Good Essays

Related Topics