Skip to main content

Do you know BIG Data? let's discuss about the types of it


What is Data?


The quantities, characters, or symbols on which operations are performed by a computer, which may be stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media.


Do you know BIG Data? let's discuss about the types of it


What is Big Data?


Big Data is also data but with a huge size. Big Data is a term used to describe a collection of data that is huge in volume and yet growing exponentially with time. In short such data is so large and complex that none of the traditional data management tools are able to store it or process it efficiently.


Examples Of Big Data


Following are some the examples of Big Data-


The New York Stock Exchange generates about one terabyte of new trade data per day.

Social Media


The statistic shows that 500+terabytes of new data get ingested into the databases of social media site Facebook, every day. This data is mainly generated in terms of photo and video uploads, message exchanges, putting comments etc.


A single Jet engine can generate 10+terabytes of data in 30 minutes of flight time. With many thousand flights per day, generation of data reaches up to many Petabytes.


Types Of Big Data


BigData' could be found in three forms:

  1. Structured

  2. Unstructured

  3. Semi-structured


Structured


Any data that can be stored, accessed and processed in the form of fixed format is termed as a 'structured' data. Over the period of time, talent in computer science has achieved greater success in developing techniques for working with such kind of data (where the format is well known in advance) and also deriving value out of it. However, nowadays, we are foreseeing issues when a size of such data grows to a huge extent, typical sizes are being in the rage of multiple zettabytes.

Examples Of Structured Data


An 'Employee' table in a database is an example of Structured Data


Employee_ID Employee_Name Gender Department Salary

2227

Sunny Karlekar

Male 

Finance

650000

3379

Ruchi Vaishnav

Female 

Admin 

650000

7363

Sanju Roy 

Male 

Admin 

500000

7509

Anil Patel

Male 

Finance 

500000

7679

Priya Malhotra

Female 

Finance 

550000




Unstructured


Any data with unknown form or the structure is classified as unstructured data. In addition to the size being huge, un-structured data poses multiple challenges in terms of its processing for deriving value out of it. A typical example of unstructured data is a heterogeneous data source containing a combination of simple text files, images, videos etc. Now day organizations have wealth of data available with them but unfortunately, they don't know how to derive value out of it since this data is in its raw form or unstructured format.


Examples Of Un-structured Data

The output returned by 'Google Search'


Do you know BIG Data? let's discuss about the types of it


Semi-structured


Semi-structured data can contain both the forms of data. We can see semi-structured data as a structured in form but it is actually not defined with e.g. a table definition in relational DBMS. Example of semi-structured data is a data represented in an XML file.


Examples Of Semi-structured Data


Personal data stored in an XML file-


<rec><name>Sunny Karlekar</name><sex>Male</sex><age>35</age></rec>
<rec><name>Ruchi Vaishnav</name><sex>Female</sex><age>41</age></rec>
<rec><name>Sanju Roy</name><sex>Male</sex><age>29</age></rec>
<rec><name>Anil Patel</name><sex>Male</sex><age>26</age></rec>
<rec><name>Jeremiah J.</name><sex>Male</sex><age>35</age></rec>


Comments