Big data is an umbrella term used to describe extremely large data sets that are difficult to process and analyze in a reasonable amount of time using traditional methods. Big data consists of structured, unstructured, and semi-structured data. It is formally characterized by its five Vs: volume, velocity, variety, veracity, and value

Volume describes the massive scale and size of data sets that contain terabytes, petabytes, or exabytes of data

Velocity describes the high speed at which massive amounts of new data are being generated

Variety describes the broad assortment of data types and formats that are being generated

Veracity describes the quality and integrity of the data in an extremely large data set

Value describes the dataโ€™s ability to be turned into actionable insights.


References