Big Data



What is Data?
data is information processed or stored by a computer. This information may be in the form of text documents, images, audio clips, software programs, or other types of data.
What is Big Data?
The data which are large in size is called Big Data. Normally we work on data of size KB(Text file, small work files),Size in MB (Large scale word documents, Excel, ppts), GB (Movies, Project Code) but if the data size is in Peta Bytes is called Big Data.
How is Big Data getting created in the memory?
Every time one person opens any application(apps) their mobile/web page, signs up online on a platform or even we search the information in the search engines(Google,…), so piece of data is gathered.
These data come from many source:
1. Social networking Sites
2. Stock exchange
3. E-commerce site
4. Telecom companies


Social networking Sites:
Face Book, Google, Linkedln, Instagram, Twitter….all these sites generates huge amount of data every day
2. Stock exchange:
The BSE/NSE stock exchange generates huge amount of data (tera bytes in size) of new trade data per day.

3. E-Commerce site: Amazon, Flipcart,Alibaba, Myntra,…..generates huge amount of data from the users..
4. Telecom Company: Like AirTel, Jio,Vodafone, BSNL, IDEA,…user trends and accordingly publish their plans and for this they save the data of its million users.
5. Share Market: stock exchanges across the world which generates huge amount of data through its daily transaction.
Data set: a collection of related sets of information that is maintained a separated elements of data, which can be manipulated as a unit by a computer.
(or) It is a collection of data, which is related to a specific person/object/subject.
Types of Big Data:
1. Structured Data
2. Semi-structured Data
3. Un-Structured Data

Structured Data:
Any data that can be stored, accessed and processed in the form of fixed format is called ‘Structured Data’. Example: Relational Data.
Example: Emp table in a database is an example of structured data.
Empid
Ename
Gender
Proj
Sal
2314
Naik
M
YTC
20000
2315
Sita
F
Esc
20000

2. Semi-Structured Data: XML (Extensible Markup Language) data:
.xml
<Emp><Name>rajendra</Name></emp>
4. Unstructured Data:
Any data which is not in the form of fixed format is called Unstructured data.
Example: word, pdf,Text, image, ….the out put which is returned by google search.
Big data Analytics:
Big data analytics is the process of checking large data sets. To get to know hidden information, which contains the hidden patterns, market trends, correlation and customer preferences.
Example: Netflix, Youtube. Here they use big data analytics for targeted users, advertising. Company collects huge data, which is the key to achieving the industry status.
Advantages of Big Data Analytics:
·       Using the information kept in the social networking site like Facebook, Twitter the marketing agencies are learning about the responses for their promotions, and other advertising media.
·       New strategies of your competition are noticed immediately:
·       Product services improves dramatically.
·       Time saving
·       Cost saving
·       They can keep up customer trends.
What are the technologies are using Big Data Analytics:
1. Hadoop
2. MongoDB
3. Spark
4. Cassandra….

Previous: Data Analysis

Comments

Popular Posts