What is Big Data and How is it Used in the Real World?

by Jun 10, 2020Applications of IoT, Articles

What is Big Data ?

Big data is a large data source, ever-increasing and, in many cases, complex. It usually gathers data from various cheaply and widely available data sources. Some of these sources include daily transactions in small scale retail stores, sensors that determine a machine’s health and user behaviour on web sites. While managing such data might sound easy, the sheer number of data points is massive. Volume, Variety and velocity are the determining factors of Big Data, together they are termed the 3 Vs. How do you expect any computer to run complex analysis on such data? This is where the concept of big data arrives.

Tools Used in Big Data

Firstly, Big Data analytics requires a lot of computation to get it right. To setup these processes, we require tools that can guide us through the way. Here is a list of a few common tools in Big Data.

  • Apache Hadoop: Hadoop is a framework that allows you to first store Big Data in a distributed environment, so that, you can process it parallelly. In addition, it is an open-source framework written in Java (but is cross-platform).
  • Apache Spark: Spark is more like an alternative or successor of Hadoop. Essentially, they built Spark to overcome any drawbacks that Hadoop had.
  • Apache Cassandra: Cassandra is an open-source NoSQL DBMS. The main function was to manage large volumes of data. It employs CQL (Cassandra Query Language) to interact with the database.
  • Apache Storm: Storm is also an Apache product, a real-time framework to process data streams. It is free and open-source.
  • Apache Hive: Hive is a java based cross-platform data warehouse tool that facilitates data summarization, query, and analysis.

How Does Big Data Work ?

There are 3 main actions behind the working of Big Data – Integration, Management and Analysis. In other words, these are the names under which any Big Data process can be classified.

  • Integration

As mentioned earlier, the number of data points is what makes it “Big Data”. However, these large streams of data from ubiquitous sources have led to various problems. The volumes of data that we create are unimaginable. Quite often we deal with data in petabytes and sometimes even more. Hence, to make Big Data useful, we will have to get the data, process it and format it to suit the needs of the analysts.

  • Management

After getting your data from various sources, we need to store it. This storage is required if we want to access the data later. It is, most importantly, required by the computers to analyse the data. We can store the data locally if the resources are available. However, most people/companies prefer to use cloud services as it is easier to manage and they can just get new services when required.

  • Analysis

Now that we have all are required data stored, we can get to analysing it. The analysis is one of the major parts of the process as it allows the person to make sense of all the collected data. You can use it to make important market research and hence develop sales.

Uses of Big Data

The term Big Data was coined in the year 2005. However, this idea or concept has been in existence for longer. Most commonly Big Data has been used to make important business decisions based on several market factors. Apart from the aforementioned, there are many more use cases. Let us look into some of these use cases.

Source: datafloq
  1. Big data extracted from media, entertainment and social media

The data that is generated by millions of users on these sites is sent to databases for analysis. This analysis has plenty of benefits such as.

  • Optimizing recommendations to content
  • Generalising the interests of a user
  • Displaying more relatable advertisements

However, these benefits come with a few drawbacks as well which include privacy concerns.

  1. Big data generated by weather stations

Weather stations (both public and private) generate huge amounts of data on the local weather of every town in every country. This data can then be used to monitor and even predict the weather conditions. Other uses of this data include studies of global warming, prediction of natural disasters and to predict the availability of water in many places of the world. One such product launched in 1996 was IBMs Deep Thunder project. The aim of this project was mainly to improve the local weather forecasting using high-performance computing.

  1. Big Data in Banking security

With every passing day, the number of transaction increases at an unimaginable rate. However, the increase in the number of online transactions would most certainly result in an increase in the number of fraudulent transactions. The data generated by each transaction is extremely valuable to stop fraud from happening. The data collected on a normal day from a normal customer would be classified and stored. They can then train machine learning algorithms to match these patterns in the real world. If the algorithm detects any anomalies in these transactions, it would then flag them to be check by a person. The banks can prevent many different types of fraudulent activities with this principle.

Conclusion

I hope you all understood something new about the concept of Big Data. Although It is a pretty vast topic that has lots to discuss I have tried my best to cram in as much information as possible. Finally, if you have any question, comments or suggestions you can leave them in the comments section below.

Happy Learning !! 😃

Creating a multiplication Skill in Alexa using python

Written By Sashreek Shankar

Hey reader! I am Sashreek, a 16 year old programmer who loves Robotics, IoT and Open Source. I am extremely enthusiastic about the Raspberry Pi and Arduino hardware as well. I also believe in sharing any knowledge I have so feel free to ask me anything 🙂

RELATED POSTS

5 Booming Technologies in IoT to watch out for in 2022

5 Booming Technologies in IoT to watch out for in 2022

Introduction Internet of Things - IoT is one of the industries that has experienced an exponential rise in the past few years. With technology on the rise, we expect this field to grow even further in the coming years. It is one of the most important technologies...

Furtherance to SIM Technology: eSIM and embedded SIM

Furtherance to SIM Technology: eSIM and embedded SIM

eSIM (electronic SIM) and embedded SIM are two different terms. While both are under development and can be incorporated in IoT. They will result in more efficient SIM technology combined with the fast-growing and in-demand 5G network. Before going into the details...

The Internet of Nano Things (IoNT): Evolution of a new era

The Internet of Nano Things (IoNT): Evolution of a new era

Internet of Nano Things The internet of nano-things (IoNT) is a network that connects a collection of very small devices to transport data. The internet of nano-things is similar to the internet of things. The only difference is that the devices present inside it are...

10 Innovations in IoT Using 5G

10 Innovations in IoT Using 5G

5G usage cases typically depend on the improved speed and stability of 5G, as well as the reduced latency it provides, and they have the potential to disrupt both conventional and digital industries. And, in the coming months, years, and decades, 5G technology will...

What is Blockchain? How it can enhance IoT features?

What is Blockchain? How it can enhance IoT features?

In this article, we will learn about the “What is blockchain? How it can enhance IOT features?”. Before getting into the topic, lets brush up with basics about IOT and Blockchain. Blockchain refers to an encrypted, distributed, decentralized computer filing system...

IoT in the Education Sector

IoT in the Education Sector

Education in a literal sense means the process of receiving or giving systematic instruction, especially at a school or university, and with IoT, it is a more fun process. In simpler terms, it is an enlightening experience. Although traditional teaching may not have...

What is IoRT(Internet of Robotic Things)

What is IoRT(Internet of Robotic Things)

The IoT and robotics, two different fields, are coming together to create IoRT (Internet of Robotic Things). The IoRT is a concept in which intelligent devices can monitor the events happening around them, fuse their sensor data, use local and distributed intelligence...

Discover the Top 5 proven Use cases of IoT data analytics

Discover the Top 5 proven Use cases of IoT data analytics

Billions of connected IoT devices are generating a massive amount of data every second. Meanwhile, as the IoT is booming this data generation has exponential growth. This data needs to be analyzed in order to retrieve insights out of this data. Further, these insights...

Importance of Cybersecurity in IoT

Importance of Cybersecurity in IoT

The Internet of Things mainly refers to the everyday devices that have an internet connection and can communicate independently with the network and other devices. To improve our life, business, or the environment, we can use the information that is provided by these...

VIDEOS – FOLLOW US ON YOUTUBE

EXPLORE OUR IOT PROJECTS

IoT Smart Gardening System – ESP8266, MQTT, Adafruit IO

Gardening is always a very calming pastime. However, our gardens' plants may not always receive the care they require due to our active lifestyles. What if we could remotely keep an eye on their health and provide them with the attention they require? In this article,...

How to Simulate IoT projects using Cisco Packet Tracer

In this tutorial, let's learn how to simulate the IoT project using the Cisco packet tracer. As an example, we shall build a simple Home Automation project to control and monitor devices. Introduction Firstly, let's quickly look at the overview of the software. Packet...

All you need to know about integrating NodeMCU with Ubidots over MQTT

In this tutorial, let's discuss Integrating NodeMCU and Ubidots IoT platform. As an illustration, we shall interface the DHT11 sensor to monitor temperature and Humidity. Additionally, an led bulb is controlled using the dashboard. Besides, the implementation will be...

All you need to know about integrating NodeMCU with Ubidots over Https

In this tutorial, let's discuss Integrating NodeMCU and Ubidots IoT platform. As an illustration, we shall interface the DHT11 sensor to monitor temperature and Humidity. Additionally, an led bulb is controlled using the dashboard. Besides, the implementation will be...

How to design a Wireless Blind Stick using nRF24L01 Module?

Introduction Let's learn to design a low-cost wireless blind stick using the nRF24L01 transceiver module. So the complete project is divided into the transmitter part and receiver part. Thus, the Transmitter part consists of an Arduino Nano microcontroller, ultrasonic...

Sending Temperature data to ThingSpeak Cloud and Visualize

In this article, we are going to learn “How to send temperature data to ThingSpeak Cloud?”. We can then visualize the temperature data uploaded to ThingSpeak Cloud anywhere in the world. But "What is ThingSpeak?” ThingSpeak is an open-source IoT platform that allows...

Amaze your friend with latest tricks of Raspberry Pi and Firebase

Introduction to our Raspberry Pi and Firebase trick Let me introduce you to the latest trick of Raspberry Pi and Firebase we'll be using to fool them. It begins with a small circuit to connect a temperature sensor and an Infrared sensor with Raspberry Pi. The circuit...

How to implement Machine Learning on IoT based Data?

Introduction The industrial scope for the convergence of the Internet of Things(IoT) and Machine learning(ML) is wide and informative. IoT renders an enormous amount of data from various sensors. On the other hand, ML opens up insight hidden in the acquired data....

Smart Display Board based on IoT and Google Firebase

Introduction In this tutorial, we are going to build a Smart Display Board based on IoT and Google Firebase by using NodeMCU8266 (or you can even use NodeMCU32) and LCD. Generally, in shops, hotels, offices, railway stations, notice/ display boards are used. They are...

Smart Gardening System – GO GREEN Project

Automation of farm activities can transform agricultural domain from being manual into a dynamic field to yield higher production with less human intervention. The project Green is developed to manage farms using modern information and communication technologies....