The data used by the research team was obtained from Imperial College London YouGov Covid 19 Behaviour Tracker Data Hub hosted at Github. The research mainly utilised survey conducted on January 2021. The field vac-1: If a Covid-19 vaccine were made available to me this week, I would definitely get it: of the survey is used to prepare the data visualisation.
The Task
a. Critic the Data Visualisation
Clarity
There is no indication of how many respondents are there for each country so it might be mislead the reader thinking that the number of respondents of each country are the same.
Since both of the original visualisations are bar charts (stacked and horizontal respectively), having different X-axis scale might mislead readers. The horizontal chart for “Strongly Agree” on the right with X-axis range from 0% to 60% looks much longer than the stacked bar chart on the left with x-axis range from 0% to 100% even though they should be of the same size.
The legend indicated “Vac 1” which is the alias for the survey question “If a Covid-19 vaccine were made available to me this week, I would definitely get it.”, but there is no mention of it anywhere. This might confuse readers on the meaning of “Vac 1”.
The stacked bar chart on the left is not easy to quickly see the proportions of disagree, agree, and neutral as there is no central guide to guide the readers.
The horizontal bar chart on the right did not take into considerations of uncertainty.
Aesthetics
There is no specific meaning to the colour scheme chosen and might confuse readers. For example, red is usually used for negative connotation and green is usually used for positive connotation, which should be used for disagree and agree respectively. However, red is used for neutral and green is used for “disagree” which is of a negative connotation.
Wrong spelling of “Vaccine” in the left chart title.
The legend was not labeled clearly and fully as only 1 and 5 are shown as “Strongly Agree” and “Strongly Disagree” respectively. 2, 3, and 4 were not labeled and were left to readers to derive.
The sorting order of the legend also differs from the actually presented order of the data in the visualisation. The order in the visualisation starts from strongly disagree on the left to strongly agree on the right, while the legend sorted in the opposite way from top to down. This is harder for the readers to compare and reference.
b. Alternative Graphical Presentation
Proposed Visualisation
Proposed Visualisation
Clarity
Add a tooltip to indicate the number of responses for each selected elements of the graph, and add a dynamic title to show total number of respondents.
Since we are changing the horizontal bar chart to a scatter plot with confidence interval, there is no need to sync the x-axis range as there is no similar element (bars) to compare, and thus, no confusion.
The proposed visualisation will be able to dynamically change the survey question and display the full question, rather than the alias e.g. Vac 1, in a dynamic title. This will make sure the readers know what is the actual context for the visualisation.
A diverging stacked bar with neutral response being centralised is used. This chart allows readers to quickly see the proportion of agree and disagree in a glance.
Uncertainty line is added to the proposed scatter chart to indicate the standard of errors at selected confidence interval.
Aesthetics
Red is used to indicate negative connotation (i.e. strongly disagree and disagree with the prior of a darker shade) and green is used to indicate positive connotation (i.e. strongly agree and agree with the prior of a darker shade.). Grey which is a neutral colour by nature is used for the “neutral” response.
Correct the spelling, if required.
The number 1 to 5 are taken out of the legend and replaced with the actual meaning. i.e. strongly agree, agree, neutral, disagree, and strongly disagree.
The legend will be sorted according to the order in the visualisation, from strongly agree to strongly disagree, so that it is easier for readers to reference.
We are provided with data sets from 29 countries. For the purpose of this makeover, we are using only the same data sets from the same 14 countries used i.e. Australia, Canada, Denmark, Finland, France, Germany, Italy, Japan, Netherlands, Norway, Singapore, South Korea, Sweden, and United Kingdom.
The data provided also contains many more columns or fields that we need. Tableau Public might takes too long to load if the dataset is too big so we will trim the data down to those that we need. All the data cleaning is done via Excel.
All Fields
We are only extracting the following fields:
RecordNo
endtime
household_children
age
gender
household_size
employment_status (4 data sets have 7 of these columns - take all of them)
vac_1
vac2_1
vac2_2
vac2_3
vac2_6
vac_3
Selected Fields
As mentioned in Step 2, there are 4 data sets with employment_status “unpivoted” with the answer in different columns and boolean value (1 or 0).
The 4 data sets are:
Denmark
Finland
Norway
Sweden
Unpivoted Employment Status
An Excel formula is used to combine the 7 columns into 1 column and mapped into the proper employment status using another formula as shown.
Reference Sheet for mapping.
Once the data sets are cleaned, we will import the data set into Tableau.
Select the following columns and click pivot.
vac_1
vac2_1
vac2_2
vac2_3
vac2_6
vac_3
Rename the Pivot Column and Pivot Value to Questions and Score_raw respectively.
Remove the initial data set from the board and double-click on Union. Select Manual Union and drag in all the data sets.
Go to the last column called Table. Rename the column to Country and create alias as follows.
Data is now prepped for visualisation.
Creating the Diverging Stacked Bar Chart
Create a calculated field called “Score” with the following formula.
Score - Calculated Field
Create calculated field called “Answer” with the following formula.
3. Click on the menu button of the Answer field and click Sort. Sort Answer by Score.
Sort Answer
Create a calculated field called Negative Scores with the following formula.
Negative Scores - Calculated Field
Create a calculated field called Total Negative Scores with the following formula and click on Default Table Calculation. Compute using Answer.
6. Create a calculated field called Total Scores with the following formula and click on Default Table Calculation. Compute using Answer.
7. Create a calculated field called Gantt Start using the following formula.
Gantt Start - Calculated Field
Create a calculated field called Percent of Total Sizing using the following formula.
Percent of Total Sizing - Calculated Field
Create a calculated field called Gantt Percent using the following formula and click on Default Table Calculation. Compute using Answer.
Gantt Percent - Calculated Field
Drag Gantt Percent to Columns and Country to Rows.
Rows and Columns
Go to Marks and drag Answer to Color, Percent of Total Sizing to Size, and Number of Records to Detail as shown below. Change chart type from Automatic to Gantt Bar.
Marks
Click on the legend on the top right and click Sort. Sort Answer by Field Score (Minimum) in Descending Order.
Legend Sort
Right click on the Null value in the Legend and click Exclude.
Legend - Exclude Null
Under Marks, click on Color > Edit Colors… and choose appropriate colours for each answer as shown below.
Strongly Disagree - Dark Red
Disagree - Red
Neutral - Light Grey
Agree - Green
Strongly Agree - Dark Green
15. Right-click on the X-axis and click Edit Axis… Change to Fixed and change Fixed Start to -1 and Fixed End to 1.
Right-click on the X-axis and click Add reference line. Select Constant and change value to 0. Change the line to a dashed line and select dark grey colour.
17. Double-click on the Title and change title to Opinions to Selected Question.
18. In preparation to the dynamic dashboard later, add the following fields to Filters.
Questions
gender
employment_status
age
endtime
household_size
household_children
Filter
Right-click on each of the filters and Apply to Worksheets > All Using This Data Source.
20. Add Number of Records to Details. Double-click on Tooltip and enter the following.
Tooltip
And the first visualisation: Diverging Stacked Bar Chart is done.
Diverging Stacked Bar Chart
Creating the Scatter Plot with Standard Error Line
Create a calculated field called “Score 1” using the formula below.
Score 1
Create a calculated field called “% Score 1” using the formula below.
% Score 1
Create a calculated field called “Prop” using the formula below.
Prop
Right-click on Dimension and click Create Parameter. Call it Confidence Interval and add in all the different Confidence Intervals with their respective Z score. Following the settings as follow. Right-click the Confidence Interval parameter and click Show Parameter Control.
Parameter - CI 1Parameter - CI 2Parameter - CI 3
Create a calculated field called “Prop SE” using the formula below.
Prop SE
Create a calculated field called “Prop_Margin of Error” using the formula below.
Prop_ SE_Margin of Error
Create a calculated field called “Prop_Lower Limit” using the formula below.
Prop_Lower Limit
Create a calculated field called “Prop_Upper Limit” using the formula below.
Prop_Upper Limit
Drag Measure Values to Columns and Country to Rows.
Cols and Rows
Remove all from Measure Values except “Prop_Upper Limit” and “Prop_Lower Limit” as shown.
Measure Values
Go to Marks, Change Chart Type to Line, and Crtl+Drag Measure Names to Color, Details and Path.
Marks
Drag Prop to Columns to the right of Measure Values. Right-click in it > Select Dual Axis > Select Synchronize Axis.
Prop to Columns
Right-click on Country and Sort. Sort by Prop in Descending Order.
Sort country
Adjust the size of the line and circle accordingly.
Change colours of Prop, Prop_Lower Limit, and Prop_Upper Limit to Dark Green (Colour of Strongly Agree). Change Colour Opacity of Prop_Lower Limit and Prop_Upper Limit to 24%.
Prop Colours
Right-click on the X-axis at the top and uncheck Show Header.
Hide Axis
Go to Analysis > Totals > Show Column Grand Totals > Column Grand Totals to Top.
Grand Totals
Right click on X-axis > Format > Change to Percentage (1 decimal point).
Format Axis
Double-click on Title and change to the following.
Title
Go to Marks > AGG(Prop) > Change to Circle type chart > Drag AGG(Prop) to Label.
Label
And the second visualisation: Scatter Plot with Standard Error Line is done.
Scatter Plot with Standard Error Line
Dynamic Title
Create a New Worksheet. Go to Marks and drag Questions, Score 1, Number of Records, and Prop into Detail. Change chart type to Polygon.
Marks
Double-click on the title and change to the following.
Format Title
Right-click on AGG(Prop) > Format > Change to Percentage (1 decimal point).
And you have a dynamic title for the dashboard later.
Dynamic Title
Creating the Dashboard
Create a New Dashboard. Drag both visualisations created onto the dashboard. Diverging Stacked Bar Chart on the left and Scatter Plot with Standard Error Line on the right, as shown below.
Dashboard 1
Right-click on the Diverging Stacked Bar Chart > Filter > Select all the filters that we have prepared earlier as shown below.
Dashboard 2
Format the Filter according to your preference and line them neatly on top of the 2 visualisations. Drag the dynamic title created earlier below the filters.
Double-click on the Dashboard Title and format it to your preference, or follow the format below.
Dashboard 5
And the dashboard is done.
Dashboard 6
e. Major Observations
Many are probably worried about potential side effects of vaccine that are created in about 1 short years (vaccine usually takes many more years to create) and are waiting for at least a year later before they can fully accept the vaccine.
This can be seen in the huge increase in the proportion of those who agree or strongly agree to the question “If a Covid-19 vaccine were made available to me a year from now, I definitely intend to get it” from that of the question “If a Covid-19 vaccine were made available to me this week, I would definitely get it.”.
The responses to the question “I am worried about the potential side effects of a COVID-19 vaccine.” are also steering more towards the Agree and Strongly Agree side, which confirmed our observation above.
Animated Transition - Vaccine made within a week versus a year.
Responses for Question
“If a Covid-19 vaccine were made available to me a year from now, I definitely intend to get it”.
Responses for Question
“If a Covid-19 vaccine were made available to me this week, I would definitely get it.”.
The responses from the above questions also give us insights on which are the countries that are most open to Covid-19 vaccination at the moment. The result consistently rank (1) United Kingdom and (2) Denmark as the top countries who has the highest proportion of Strongly Agree for Covid-19 vaccination-related questions.
Which is also not surprising that they are the countries least worried about the potential side effects of Covid-19 vaccination.
Responses for Question
“If a Covid-19 vaccine were made available to me a year from now, I definitely intend to get it”.
Responses for Question
“If a Covid-19 vaccine were made available to me this week, I would definitely get it.”. Responses for Question
“If I do not get a Covid-19 vaccine when it is available, I will regret it.”.
The older you are, the more open you are to Covid-19 vaccination and would not mind getting the vaccination if made available within a week. The tipping point is at around age 40. This could be due to the heavier responsibilities that come with age especially around age 40 when people probably got married with kids and further down, being a grandparent and wanting to see your grandchildren grows up.
Animated Transition - Vaccine made available in a week against Age