Iron Viz 2015 Entry (First Public Viz)

I have been challenged by Paul Banoup to create a viz for Tableau's annual IronViz The viz can be about any topic, but all the data has to come from Wikipedia. I had been looking at Data from Wikipedia anyway out of interest a few weeks ago and thought I could do something with the shipwreck data on there.

The Data:

 

The Data Cleaning:

Copy data from WIKI "list of shipwrecks of the United Kingdom" paste in to excel, (exculding England). Repeat for "list of shipwrecks of Endland" and add the two together. 
Then clean up Null rows and add a coloumn with Area List is split into area sections. 
To get the data for location I had to be split location by "/" to get a usable longitude and Latitude section, split this again by ";" this gives you a longitude field and latitude field (clean up any special charictors). A number of rows do not have coordinates, for these I found the locations mentioned in the notes field in wiki and used that instead. The cut and paste of the list removed the exact link infomation to each ship's Wiki Page. To recreated the link with the ship name, I checked each of these and replaced with correct link if there were more than one (the royal Navy particulally like to reused the same name for it's ships).

 

The Tableau Charts:

Connect the data to tableau

I started with a map and placed longitude and latitude. This quickly showed that one Wreck was out of place (a ship called the Amsterdam that sank off Hastings, the wiki coordonates are actually the city Amsterdam!) I corrected this in the data by using the wiki coordinates for Hastings instead. then change the mark type to X's.

The sunk date data was a bit tricky, tableau (and excel) do not recognise dates pre 1900 and the format of the dates was not consistant anyway. I therefore left the dates as a string and grouped them manually into the following buckets ("period sunk"): 
pre 1800 
1800 - 1913 
World war I 
Between World Wars 
World War II 
Post WWII

I could then create a simple bar chart with these groups and the "count of records" renaming to "count of Shipwrecks". then used these Period's to colour the bar chart and the map X's.

 

The Tableau Dashboard:

First I added the map and the bar to a dashboard, the added a Web page to the dash board with a default of "list of shipwrecks of the United Kingdom" .
Next I created an action so that when you hover over the coordinates it changes the Web Page to the url for the Ship link from wiki . I also added an action to the bar chart so the map is filtered on the "period Sunk" and a couple of filter dropdowns ("Area" and "Ship"). then I created a "reset button" (to return the web page back to the "list of shipwrecks of the United Kingdom" start page). I did this using a pie chart and URL action! 
Finally I added some text annotating the bar chart and an image of the Wiki logo to emphisies the data source
 

The End Result:

ShipwrecksoftheUnitedKingdom

Update 21st March

Paul Banoup had also told me that I will not be able to resist tinkering with the dashboard, he was right! I enhanced the data again from the wiki pages of the ships to include Year the ship sunk, Navy and type of ship. this enabled me to add a bar chart with the number of shipwreck per year (as both Excel adn Tableau are not good with dates pre 1900, but a year can be a number) and a horizontal stacked bar with the flags of the navy's country with actions (again the flags are copied from wiki entries for each country)

Hope you like the final result as final submission is tomorrow! - ShipwrecksoftheUnitedKingdom