Big Data
What is Data?
data is information processed or stored by a computer. This
information may be in the form of text documents, images, audio clips, software
programs, or other types of data.
What is Big Data?
The data which are large in size is called Big Data. Normally
we work on data of size KB(Text file, small work files),Size in MB (Large scale
word documents, Excel, ppts), GB (Movies, Project Code) but if the data size is
in Peta Bytes is called Big Data.
How is Big Data getting created in the memory?
Every time one person opens any application(apps) their
mobile/web page, signs up online on a platform or even we search the
information in the search engines(Google,…), so piece of data is gathered.
These data come from many source:
1. Social networking
Sites
2. Stock exchange
3. E-commerce site
4. Telecom companies
Social networking Sites:
Face Book, Google, Linkedln, Instagram, Twitter….all these
sites generates huge amount of data every day
2. Stock exchange:
The BSE/NSE stock exchange generates huge amount of data (tera
bytes in size) of new trade data per day.
3. E-Commerce site: Amazon, Flipcart,Alibaba, Myntra,…..generates
huge amount of data from the users..
4. Telecom Company: Like AirTel, Jio,Vodafone, BSNL, IDEA,…user
trends and accordingly publish their plans and for this they save the data of
its million users.
5. Share Market: stock exchanges across the world which generates huge amount
of data through its daily transaction.
Data set: a collection of related sets of information that is
maintained a separated elements of data, which can be manipulated as a unit by
a computer.
(or) It is a collection of data, which is related to a
specific person/object/subject.
Types of Big Data:
1. Structured Data
2. Semi-structured Data
3. Un-Structured Data
Structured Data:
Any data that can be stored, accessed and processed in the
form of fixed format is called ‘Structured Data’. Example: Relational Data.
Example: Emp table in a database is an example of structured
data.
Empid
|
Ename
|
Gender
|
Proj
|
Sal
|
2314
|
Naik
|
M
|
YTC
|
20000
|
2315
|
Sita
|
F
|
Esc
|
20000
|
2. Semi-Structured Data: XML (Extensible Markup Language) data:
.xml
<Emp><Name>rajendra</Name></emp>
4. Unstructured Data:
Any data which is not
in the form of fixed format is called Unstructured data.
Example: word,
pdf,Text, image, ….the out put which is returned by google search.
Big data Analytics:
Big data analytics is the process of checking large data
sets. To get to know hidden information, which contains the hidden patterns,
market trends, correlation and customer preferences.
Example: Netflix, Youtube. Here they use big data analytics
for targeted users, advertising. Company collects huge data, which is the key
to achieving the industry status.
Advantages of Big Data Analytics:
·
Using the information kept in the social networking site like
Facebook, Twitter the marketing agencies are learning about the responses for
their promotions, and other advertising media.
·
New strategies of your competition are noticed immediately:
·
Product services improves dramatically.
·
Time saving
·
Cost saving
·
They can keep up customer trends.
What are the technologies are using Big Data Analytics:
1. Hadoop
2. MongoDB
3. Spark
4. Cassandra….
Comments
Post a Comment