Understanding Data

Understanding Data

ev-gpjvRZyavZc-unsplash.jpg

Simply put, according to Wikipedia, data means "known facts". Typically what comes to mind when data is mentioned is quantitative facts i.e. numbers. However, it can also refer to words, images, and even sounds or videos. In computing, data refers to information that has been translated into a form that can be used for processing. Data in its most basic form is referred to as raw data.

Types of Data

A quick web search will reveal that there are many types of data each depending on the context and use of the data. This can be confusing to those new in the field. However, I discovered that it is useful to know about as many types as possible. This paints a more robust picture of what data is. By looking at many types as possible, you'd get an idea about when to use each type and the context they are being used so that in the future, understanding the subject becomes a lot easier.

  • Big Data: This refers to data that cannot be practically contained in any database for analysis and processing because of the huge volume of information created by human and machine learning processes. Big data is the fuel that drives machine learning. Machine learning is the building block of Artificial Intelligence.
  • Structured and Unstructured Data: These forms of data depend on if a data conforms to the predefined standards of an organization or not. Generally, structured data refers to data that is in a standardized format for providing information and data made easily searchable by relational databases. Most structured data are quantitative. Unstructured data on the other hand has no pre-defined format or organization thereby making it much more difficult to collect, process, and analyze. Most unstructured data are qualitative.

The Future of Data

michael-dziedzic-aQYgUYwnCsM-unsplash.jpg

The amount of big data is increasing, but soon the essence of having big data will cease to exist because, regardless of if data is structured or unstructured, the most important factor will be having the most accurate data in order to gain an advantage in business or wherever the data is used. This advantage will lead to

  • Better customer care and interactions

  • Larger market shares

  • More profitability in business

  • Fewer costs to those companies who take advantage of an accurate data set

Also, high accuracy in the variety of data available will lead to better and more advanced algorithms. Rumor has it that Facebook is about to patent an algorithm that will enable it to determine its user's emotions through reports generated on how the user types (whether it's fast or slow) and how the user swipes across the phone screen.