{"id":1,"date":"2014-08-29T17:50:49","date_gmt":"2014-08-29T17:50:49","guid":{"rendered":"http:\/\/www.karlhartkopf.com\/blog\/?p=1"},"modified":"2019-03-13T02:59:05","modified_gmt":"2019-03-13T02:59:05","slug":"time-com-makes-beautiful-incorrect-maps-by-compounding-flawed-data-with-silly-mistakes","status":"publish","type":"post","link":"http:\/\/www.karlhartkopf.com\/blog\/2014\/08\/29\/time-com-makes-beautiful-incorrect-maps-by-compounding-flawed-data-with-silly-mistakes\/","title":{"rendered":"Time.com Makes Beautiful Incorrect Maps By Compounding Flawed Data With Silly Mistakes"},"content":{"rendered":"<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">Using big open data, Time.com has produced and published <a title=\"Safest Counties \/ Most Dangerous Counties\" href=\"http:\/\/time.com\/safest-counties\/\" target=\"_blank\" rel=\"noopener\">&#8220;How Safe Am I From a Natural Disaster&#8221;<\/a> in which it presented\u00a0a &#8220;disaster index&#8221; for each county in the United States based on National Weather Service (NWS) and National Oceanic and Atmospheric Administration (NOAA) data. \u00a0While the idea is exciting, the\u00a0resulting piece doesn&#8217;t recognize the\u00a0flaws in the source data and\u00a0compounds those flaws by making some mistakes of their own in the process.<\/span><\/p>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">Issues with this interactive map and dataset\u00a0are best illustrated by <a href=\"http:\/\/time.com\/safest-counties\/#22071\" target=\"_blank\" rel=\"noopener\">examining Time.com&#8217;s data for Orleans Parish, Louisiana (LA)<\/a>, which gets you the table below *:<\/span><\/p>\n<div id=\"attachment_8\" style=\"width: 371px\" class=\"wp-caption alignnone\"><a href=\"http:\/\/www.karlhartkopf.com\/blog\/wp-content\/uploads\/2014\/08\/Orleans-Parish-New-Orleans-County-Safety-Time-Magazine.jpg\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-8\" class=\" wp-image-8\" src=\"http:\/\/www.karlhartkopf.com\/blog\/wp-content\/uploads\/2014\/08\/Orleans-Parish-New-Orleans-County-Safety-Time-Magazine-222x300.jpg\" alt=\"New Orleans - Orleans Parish Natural Disaster Data Detail From Time.com Retrieved 10:30 am EST August 29, 2014\" width=\"361\" height=\"487\" srcset=\"http:\/\/www.karlhartkopf.com\/blog\/wp-content\/uploads\/2014\/08\/Orleans-Parish-New-Orleans-County-Safety-Time-Magazine-222x300.jpg 222w, http:\/\/www.karlhartkopf.com\/blog\/wp-content\/uploads\/2014\/08\/Orleans-Parish-New-Orleans-County-Safety-Time-Magazine.jpg 640w\" sizes=\"auto, (max-width: 361px) 100vw, 361px\" \/><\/a><p id=\"caption-attachment-8\" class=\"wp-caption-text\">New Orleans &#8211; Orleans Parish Natural Disaster Data Detail From Time.com Retrieved 10:30 am EST August 29, 2014<\/p><\/div>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">Choosing\u00a0Orleans Parish (parishes are Louisiana&#8217;s equivalent of counties) for comparison is intentional, given that over <a title=\"Hurricane Katrina Death Toll\" href=\"http:\/\/en.wikipedia.org\/wiki\/Hurricane_Katrina#Impact\" target=\"_blank\" rel=\"noopener\">1,500 people died during Hurricane Katrina<\/a>\u00a0in 2005, most of them residents of New Orleans, which shares\u00a0the same boundary as Orleans Parish.\u00a0 In addition to being one of the five deadliest hurricanes in history, Hurricane Katrina was the costliest ever natural disaster in the United States causing over $100 billion of\u00a0damage. <\/span><\/p>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">Yet Orleans Parish does not appear on Time.com&#8217;s top ten list of &#8220;Most Dangerous Counties&#8221;, with Time.com ranking it as the 266th most dangerous county in the United States while ranking Ocean County, NJ the <a href=\"http:\/\/time.com\/safest-counties\/#34029\" target=\"_blank\" rel=\"noopener\">1st most dangerous in the nation<\/a>. \u00a0Ocean County&#8217;s most dangerous ranking is assigned in\u00a0spite of the highest number of confirmed deaths from a single event over the last 65 years was <a href=\"http:\/\/www.reuters.com\/article\/us-storm-sandy-deaths-idUSBRE8AF0ZX20121116\" target=\"_blank\" rel=\"noopener\">37 dead from Hurricane Sandy in 2012<\/a>\u00a0(btw, those deaths are not found this Time.com project), with\u00a0few other deaths beyond those. Furthermore, Time.com indicates only property damage on August 28, 2005 and nothing for subsequent days after Hurricane Katrina.<\/span><\/p>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">To figure out why this seemingly inexplicable result exists, let&#8217;s start by identifying the source data from which the mapping and analysis was generated.\u00a0 Time.com claims that &#8220;The disaster index is calculated from all records in the National Oceanic and Atmospheric Administration storm events database, which includes earthquakes and tornados [sic] from 1950 and 42 other event types since 1996.&#8221;\u00a0 Here is a <a title=\"National Oceanic and Atmospheric Administration (NOAA) National Climatic Data Center (NCDC) Storm Events Database Data Sources\" href=\"http:\/\/www.ncdc.noaa.gov\/stormevents\/details.jsp?type=collection:\">summary of NOAA source descriptions<\/a>:\u00a0<\/span><\/p>\n<ol>\n<li><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">National Weather Service (NWS) Storm Prediction Center which includes Tornadoes (1950-1992), Thunderstorm Winds (1955-1992) and Hail (1955-1992)<\/span><\/li>\n<li><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">National Weather Service (NWS) Weather Offices keyed Storm Data files sent to National Climatic Data Center (NCDC) in Word Perfect 5.0 format on 3.5&#8243; floppy diskettes and imported into more unified data files.\u00a0 They note that &#8220;These data had many inconsistencies in the spelling of event types and were standardized into the 48 current event types at NCDC in 2013.&#8221;<\/span><\/li>\n<li><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">National Weather Service (NWS) Paradox Database Files: I can&#8217;t do any better job describing how dirty the data was than letting NOAA&#8217;s description speak for itself; &#8220;Beginning in January 1996, the NWS began using Storm Data for forecast verification purposes and the NWS decided to collect the data at the NWS Headquarters in Silver Spring, MD. They selected Borland\/Corel Paradox format for their database and supplied the NCDC with the raw data files, which were then used for the Storm Data publication and inclusion into the Storm Events Database. From 1996-1999, the event type field was a free-text field so there were many, many variations of event types. Most of the events were standardized into the 48 current event types in 2013. In 2000 the NWS added a drop-down selector for Event Type on the data entry interface, which standardized the Event Type values sent to NCDC.&#8221;<\/span><\/li>\n<li><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">National Weather Service (NWS) Windows SQL Server Database Exported Comma-Separated Text (CSV) Files: Beginning in October 2006, the NWS supplied NCDC a comma separated (CSV) text files that NCDC import into its own database for the Storm Data publication and Storm Events Database.<\/span><\/li>\n<\/ol>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">Time.com&#8217;s first failure was not to prominently note NOAA&#8217;s own overt recognition of messy data, data often collected and recorded without rigorous field specifications and definitions. While the NOAA data descriptions note\u00a0that the NWS and NOAA cleaned them up somewhat, the data remains messy.<\/span><\/p>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">Delving into how messy this data gets us answers on the faulty specifics. In the Time.com Orleans Parish data table produced by the Time.com Safest Counties tool (shown above in &#8220;New Orleans &#8211; Orleans Parish Natural Disaster Data Detail&#8221;), property damage is listed as only on August 28, 2005, with no deaths at all *** and only direct costs for August 28th and 29th, in spite of over 50 levees breaching causing billions of dollars worth of flood damage and hundreds of deaths.<\/span><\/p>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\"><a title=\"Storm Events Database Extract of Orleans Parish Events between 01\/01\/1994 and 05\/31\/2014\" href=\"http:\/\/1.usa.gov\/1qNz8xB\">Pulling New Orleans storm event data directly from the National Weather Service&#8217;s Storm Events Database<\/a>, Table 1: Hurricane Katrina Late August NWS &#8211; NOAA Storm Events Entries (abbreviated) contains two faulty entries of interest:<\/span><\/p>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">Table 1: Hurricane Katrina Late August NWS &#8211; NOAA Storm Events Entries (abbreviated)<\/span><\/p>\n<div style=\"overflow-x: auto;\">\n<table style=\"height: 114px;\" width=\"740\">\n<tbody>\n<tr>\n<td width=\"78\">EVENT_ID<\/td>\n<td width=\"120\">CZ_NAME_STR<\/td>\n<td width=\"117\">BEGIN_LOCATION<\/td>\n<td width=\"147\">BEGIN_TIME<\/td>\n<td width=\"120\">DEATHS_DIRECT<\/td>\n<td width=\"183\">\u00a0DAMAGE_PROPERTY_NUM<\/td>\n<\/tr>\n<tr>\n<td>5475622<\/td>\n<td>ORLEANS (ZONE)<\/td>\n<td>8\/28\/2005<\/td>\n<td>Hurricane (Typhoon)<\/td>\n<td>0<\/td>\n<td>\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 3,560,000,000<\/td>\n<\/tr>\n<tr>\n<td>5475650<\/td>\n<td>ORLEANS (ZONE)<\/td>\n<td>8\/29\/2005<\/td>\n<td>Storm Surge\/Tide<\/td>\n<td>0<\/td>\n<td>\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 17,900,000,000<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">It is now clear why Time.com reported no deaths; the lack of deaths was in the Weather Service&#8217;s database! ***<\/span><\/p>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">The next mistake was Time.com&#8217;s and perhaps the more significant one; exclusively using the number of claims as opposed to deaths, damages, crop damage or a weighted assessment of multiple criteria. Using this single, significantly flawed database compounded the problem and led to very problematic conclusions.<\/span><\/p>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">There are a few lessons for data analysts and data team journalists:<\/span><\/p>\n<ul>\n<li><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">Test your own tool alongside real world facts and make sure the stories they tell are similar.<\/span><\/li>\n<li><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">Go back to the original data and compare multiple records from that source to your final product. Choose high and low outliers AND additional random cases to insure processing and display accuracy.<\/span><\/li>\n<li><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">Before going live \/ public, have someone inside AND outside your organization review your final product.\u00a0 Familiarity breeds blindness.<\/span><\/li>\n<\/ul>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">And really, any readers of journalism that contains data from large data sets should cast a critical eye on the source and quality of data.<\/span><\/p>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">Big-data driven, computer-assisted reporting is here to stay, and that is as it should be.\u00a0 We can discover new truths\u00a0about our world through the analysis of data which in many cases has existed for decades. That data was just waiting for the tools, computing power and new-found excitement for vigorous and efficient science, business, and government.\u00a0 While the problems revealed by this study can be minimized by good data practice, they won&#8217;t be eliminated and we should accept that. \u00a0After all, in this budget-pinched, high-speed news world, the answer to the\u00a0question &#8220;Would you rather something be reported and corrected, or not be reported at all?&#8221; should almost certainly be to accept the former and fight the later.<\/span><\/p>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">Data driven journalism requires the same level of reader awareness and skepticism as does traditional journalism, and perhaps even a bit more data checking.\u00a0 &#8220;Don&#8217;t believe everything you read&#8221; doesn&#8217;t magically go away once a map, chart or table of numbers appears, be it in your local newspaper or on the Internet.<\/span><\/p>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">* Update November 24, 2014 &#8211; In addition to a\u00a0few small corrections made in the body of the article, what follows is a clarification Time.com&#8217;s project\u00a0title. \u00a0Though the title of the piece at the top of their interactive map website is &#8220;How Safe am I From a Natural Disaster&#8221;, the page title for the piece embedded in the page&#8217;s code but not displayed directly on the page is the &#8220;Most Dangerous Place to Live in America: Natural Disasters&#8221;. The feature&#8217;s URL of\u00a0<a title=\"Safest Counties\" href=\"http:\/\/time.com\/safest-counties\/\" target=\"_blank\" rel=\"noopener\">http:\/\/time.com\/safest-counties\/<\/a>\u00a0and &#8220;How Safe am I&#8230;&#8221; title do\u00a0suggest that they intended to indicate lower numbered counties as\u00a0&#8220;safer&#8221; and higher numbered counties as\u00a0more &#8220;dangerous&#8221;.<\/span><\/p>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">This piece is\u00a0part of Time.com&#8217;s\u00a0bigger project\u00a0called &#8220;<a title=\"Time.com The Answers Issue\" href=\"http:\/\/time.com\/the-answers-issue\/\" target=\"_blank\" rel=\"noopener\">Time, The Answers Issue<\/a>&#8220;, the\u00a0subtitle of which is &#8220;Everything You Never Knew You Needed to Know&#8221;. \u00a0Two final notes. \u00a0I am not sure anymore whether to\u00a0call the organization &#8220;Time Magazine&#8221;, &#8220;Time.com&#8221; or just &#8220;Time&#8221; given that while there ARE still print issues, much or all\u00a0content is also available online (and more?). While Time, Inc.\u00a0still refers to regularly released\u00a0collections of articles as &#8220;issues&#8221;, it isn&#8217;t always clear if these collections are &#8220;magazines&#8221; anymore given\u00a0the interactive features they contain and the fact that they may have no physical analog. And just for fun,\u00a0the\u00a0title of the piece at &#8220;The Answers Issue&#8221; portal uses yet another title variant and weblink;\u00a0&#8220;<a title=\"How Safe Are You From a Natural Disaster\" href=\"http:\/\/time.com\/?page_id=3174898\" target=\"_blank\" rel=\"noopener\">How Safe Are <strong>You<\/strong> From a Natural Disaster<\/a>?&#8221;<\/span><\/p>\n<p><span style=\"font-family: Verdana, Arial, Helvetica, sans-serif;\">*** Update January 15, 2016 &#8211; NOAA has updated and corrected the database which now indicates\u00a0638 deaths for August 28, 2005 in Orleans Parish. I am not sure how long it took to NOAA to fix the Katrina entry since my original August 2014 publication date, but even if they fixed it immediately that meant that their entry was wrong for ten years prior to this post&#8217;s publication.<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Using big open data, Time.com has produced and published &#8220;How Safe Am I From a Natural Disaster&#8221; in which it presented\u00a0a &#8220;disaster index&#8221; for each county in the United States based on National Weather Service (NWS) and National Oceanic and Atmospheric Administration (NOAA) data. \u00a0While the idea is exciting, the\u00a0resulting piece doesn&#8217;t recognize the\u00a0flaws in [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2,6,5,3],"tags":[18,17,15,16,14,12,13],"class_list":["post-1","post","type-post","status-publish","format-standard","hentry","category-data","category-geographic-information-systems-gis","category-hazard-mitigation","category-planning","tag-big-data","tag-computer-assisted-journalism","tag-hurricane-katrina","tag-interactive-mapping","tag-louisiana","tag-new-orleans","tag-orleans-parish"],"_links":{"self":[{"href":"http:\/\/www.karlhartkopf.com\/blog\/wp-json\/wp\/v2\/posts\/1","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/www.karlhartkopf.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.karlhartkopf.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.karlhartkopf.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/www.karlhartkopf.com\/blog\/wp-json\/wp\/v2\/comments?post=1"}],"version-history":[{"count":9,"href":"http:\/\/www.karlhartkopf.com\/blog\/wp-json\/wp\/v2\/posts\/1\/revisions"}],"predecessor-version":[{"id":60,"href":"http:\/\/www.karlhartkopf.com\/blog\/wp-json\/wp\/v2\/posts\/1\/revisions\/60"}],"wp:attachment":[{"href":"http:\/\/www.karlhartkopf.com\/blog\/wp-json\/wp\/v2\/media?parent=1"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.karlhartkopf.com\/blog\/wp-json\/wp\/v2\/categories?post=1"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.karlhartkopf.com\/blog\/wp-json\/wp\/v2\/tags?post=1"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}