Data Analytics

What is it?

What Is Data Visualisation?

Not everyone can understand how a process works by reading words alone.  In a lot of cases, being able to see it pictorially and in practice enables someone to understand it further.  It becomes more than just theory. For example, in 1869, Charles Joseph Minard, a pioneer in his time in statistical analysis, depicted the downfall of Napoleon’s historic loss of the Russian invasion in 1812.  He did this by orchestrating numerical data on a map of the border between Russia and Poland.

Underneath all his work was the data visualisation process. Data visualisation is a technique usemd to create and analyse the way data is represented such as through the use of graphs and tables.  When data is shown in the most efficient way to a user, it becomes more accessible and easier to understand and use for further research and hypotheses. 

With the ever growing amount of data that is being collected and used, being able to show it effectively is extremely important.  It is an expanding challenge and data science techniques are needed in order to provide solutions.

Effective graphics take advantage of pre-attentive processing and attributes and the relative strength of these attributes. For example, since humans can more easily process differences in line length than surface area, it may be more effective to use a bar chart (which takes advantage of line length to show comparison) rather than pie charts (which use surface area to show comparison).

Image


Effectiveness

Effective Data Visualisation

Ideally, effective data visualisation should:

  1. Show the data
  2. Induce the viewer to think about the substance rather than about methodology, graphic design, the technology of graphic production or something else
  3. Avoid distorting what the data has to say
  4. Present many numbers in a small space
  5. Make large set of data easy to understand
  6. Encourage the eye to compare different pieces of data
  7. Reveal the data at several levels of detail, from a broad overview to the fine structure
  8. Serve a reasonably clear purpose: description, exploration, tabulation or decoration
  9. Be closely integrated with the statistical and verbal descriptions of a data set.


Not actually applying the principles to a data set may lead to incorrect or inaccurate tables or graph.  This then leads to the incorrect conclusion being made from the data, leading to further errors in key decision making and further hypotheses.


Use cases

Displaying sets of data

Data Presentation Architecture 

Data presentation architecture (DPA) is a process that sets out to identify, find, manipulate, format and present data in such a way as to optimally communicate meaning and proper knowledge. It is critical to business intelligence.

DPA is a skill set that includes determining what data is to be chosen, on what schedule and in what what it is to be presented, not just the the most effective way to present the data. Data visualisation techniques are an element of DPA.

The objective of DPA is to provide knowledge in the most efficient and effective way as possible.  This is done by reducing and removing noise, complex and unnecessary data and providing relevant and complete data which displays its meaning.  By doing so can improve understanding and enable better key decisions to be made.

DPA performs the following tasks to provide knowledge from the data:

Creating effective delivery mechanisms for each audience member depending on their role, tasks, locations and access to technology

Defining important meaning (relevant knowledge) that is needed by each audience member in each context

Determining the required periodicity of data updates (the currency of the data)

Determining the right timing for data presentation (when and how often the user needs to see the data)

Finding the right data (subject area, historical reach, breadth, level of detail, etc.)

Utilising appropriate analysis, grouping, visualisation, and other presentation formats.