Big Data and the Vs of data

Big Data and the Vs of data

In this article, we explore the term Big Data, its definition and its characteristics.

What is big data?

The term Big Data refers to a very large set of data that no traditional database management or information management tool can really work with.

The information coming from a multitude of sources are contained in the messages we send each other, the videos we post, the weather information, GPS signals, online purchase transaction records, and much more. This data is Big Data.

Big data is more and more an integral part of a company strategy in terms of data management and usage and we are going to see the issues that emerge from these massive volumes of data.

The 4 fundamental Vs

Big Data specialists, especially at IBM, define Big Data by the following four Vs: Volume, Variety, Velocity and Veracity. These four dimensions characterize and distinguish big data from ordinary data.

Aucun texte alternatif pour cette image

Volume

The main characteristic of big data is volume. The term is indeed taken directly from the immense mass of data generated on a daily basis.

According to IBM, an average of 2.5 quintillion bytes of data is created each day, or about 2.3 trillion gigabytes. This data is growing day by day with the constant addition of data sources. The rise of the Internet of thing (IoT) is proof of this.

Year after year, the amount of data increases dramatically. Over the whole of 2020, 40 zettabytes of data will be created, or 43 trillion gigabytes.

This data must be stored somewhere and the cloud is one of the available solutions.

Variety

Beyond the simple quantity, this data is also more diverse than ever. This phenomenon is linked to the diversification of the uses of the Internet and the digital media. The source of data, its format, but also the field to which it is linked are experiencing unprecedented variety.

New types of data originating from social media, machine-to-machine and smartphones add new dimensions to traditional transactional data. This therefore requires changes in data organization models that no longer fit into neat and easy-to-use structures (see Key-Value, Columnar, Document, Graph).

Velocity

The speed and directions from which data arrives in the enterprise is increasing due to interconnection and advancements in network technology, so sometimes it arrives faster than we can make sense of. The faster the data arrives and the more varied the sources, the more difficult it is to derive value from that data. Traditional calculation methods are getting limited and in some cases not working on data arriving at today's speeds!

Veracity

Finally, the veracity of the data is related to the amount of reliable data when key decisions need to be made on such large volumes and collected so quickly. It's hard to simply know that the data is in fact not spoofed, has not been corrupted, or is coming from an expected source - it could be, for example, from one of the thousands of security cameras, each producing several thousand video frames per hour.

Aucun texte alternatif pour cette image

To a lesser extent, there are other Vs also related to Big Data:

  • Value: for all the information that can be extracted from Big Data
  • Variability: for the stability of data models and links that can be made in these mountains of data.


With the development of the Internet of Things (IoT) and the ongoing digitalization in many areas of society, science and business, the amount of data is not diminishing. Big Data, as a generic term, describes the large volume of data - both structured and unstructured - that floods a business and to a larger extent the world on a daily basis.

Aucun texte alternatif pour cette image

Big data V typical characteristics like storage, accessibility, speed of execution, processing and analysis represent a challenge and an opportunity for many years to come.

It's also good to remember too that in many cases it's not the amount of data that matters, but what organizations do with the data that matters. Big data needs to be analyzed to gain insight that drives better decisions and supports business strategy.


Sources :

https://www.ibm.com/developerworks/library/bd-bigdatacloud/index.html

https://aws.amazon.com/fr/big-data/what-is-big-data/

To view or add a comment, sign in

More articles by Pierre Baudin 丁俊杰

  • Business Intelligence VS. Data Science

    Avec l’explosion des demandes d’analyses des données au sein des organisations, deux grands courants se détachent : la…

  • Data Warehouse vs Data Lake vs Data Mart: The Guide - Definition

    The world of data is changing very quickly and it is very easy to get lost in all the technical terms that flourish…

    2 Comments
  • Data Warehouse vs Data Lake vs Data Mart : le guide - Usage

    Les data warehouses, les datamarts et data lakes constituent les éléments incontournables des écosystèmes de données…

    1 Comment
  • How to succeed in your data project

    Managing data isn't always the funniest part of data science. It includes several essential steps that require time and…

  • Machine learning and types of learning

    To follow up on my first article on what machine learning is, how it works and its usefulness (see: what is machine…

  • Data Warehouse vs Data Lake vs Data Mart : le guide - Définition

    Le monde de la data évolue très rapidement et il est très facile de se perdre dans tous les termes techniques qui…

  • Data Trends for 2021

    The data science revolution that is dramatically influencing the way companies approach their business and their…

    2 Comments
  • Méthodologie pour réussir son projet data

    La gestion des données n’est pas toujours la partie la plus amusante de la science des données. Elle inclut plusieurs…

    2 Comments
  • Strategies for dealing with missing data

    To follow up on my previous article on missing data and how it is categorized (see: missing data: classification)…

  • Les grandes tendances de la data en 2021

    La révolution de la science des données qui influence de manière spectaculaire la façon dont les entreprises approchent…

Others also viewed

Explore content categories