Professional Documents
Culture Documents
Version 0.2
Contributors
Contributors to this book include:
David Banisar, Article 19 Caelainn Barr, EU Data Journalist Mariano Blejman, Hacks/Hackers Marianne Bouchart, Data Journalism Blog Liliana Bounegru, European Journalism Centre Brian Boyer, Chicago Tribune Jane Park, Creative Commons Paul Bradshaw, Birmingham City University, City University London Lucy Chambers, Open Knowledge Foundation Helen Darbishire, Access Info Europe Steve Doig, Cronkite School of Journalism David Erwin, New York Times Lisa Evans, Guardian Datablog Tom Fries, Bertelsmann Stiftung Duncan Geere, Wired.co.uk Rich Gordon, Northwestern University Jonathan Gray, Open Knowledge Foundation Ted Han, DocumentCloud Kate Hudson, Open Journalism Francis Irving, ScraperWiki Lizzie Jackson, Ravensbourne College Nicolas Kayser-Bril, Data Journalist John Keefe, New York Public Radio Friedrich Lindenberg, Open Knowledge Foundation Lorenz Matzat, OpenDataCity Aidan McGuire, ScraperWiki Philip Meyer, University of North Carolina at Chapel Hill Cynthia O'Murchu, Financial Times
Aron Pilhofer, New York Times Anthony Reuben, BBC Simon Rogers, Guardian Datablog Amanda Rossi, freelance journalist Fabrizio Scrollini, London School of Economics Adam Thomas, Source Fabric Andrew Vande Moere, infosthetics.com Sascha Venohr, Zeit Online Jerry Vermanen, De Stentor Csar Viana, Estacio de Sa University Farida Vis, University of Leicester
Coordinators
European Journalism Centre Open Knowledge Foundation
Table of contents
The Data Journalism Handbook Contributors Table of contents 0. Preface 0.1 The purpose of this book 0.2 Add to this book 0.3 Share this book 1. Introduction 1.1 What is data journalism? 1.2 Why is it important? 2. Introducing data journalism in the newsroom 2.1 Changes in the newsroom 2.2 How is it done: journo-developers vs. coders for hire 3 Types of outcomes/projects and case studies 3.1 Data powered stories 3.2 Data served with stories 3.3 Data driven applications 4. Working on the data story 4.1. Step 1: Getting your data 4.1.1 Where does data live? 4.1.2 Asking for data 4.1.3 Getting your own data 4.2 Step 2: Understanding your data 4.2.1 Data literacy 4.2.2 Working with data tips 4.2.3 Tools and techniques for analysing data
4.2.4 Harnessing external expertise 4.3 Step 3: Finding a story in your data 4.3.1 From datasets to stories - approaches 4.4 Step 4: Delivering your data project 4.4.1 Serving data with stories 4.4.2 Visualising data 4.4.3 Data driven applications 5. Engagement, outreach and community 6. How to make data journalism sustainable 6.1 Measuring impact 6.2 Business models 7. Appendix 7.1 Further resources 7.2 Glossary:
Project hashtag: #ddjbook Overview of progress: http://bit.ly/ssiDYe (as of Sunday 6 November). More recent updates in text below in yellow highlight.
Questions? Want to contribute? Get in touch: Liliana Bounegru (bounegru@ejc.net) Lucy Chambers (lucy.chambers@okfn.org)
0. Preface
0.1 The purpose of this book
Overview: Explain what this book does and doesnt aim to do Authors: Lucy Chambers, Liliana Bounegru Length: 0.5-1 page
1. Introduction
1.1 What is data journalism?
Overview: Define and describe data journalism and how it is different from other forms of journalism. Authors: Paul Bradshaw, Jonathan Gray, Aron Pilhofer, Jerry Vermanen, Philip Meyer, Duncan Geere, David Anderton, Federica Cocco, Brian Boyer, JV Chamary, [Heather Brooke], [Simon Rogers], [Richard Gordon] Length: 4 pages (with quotes from different people) Editor: Liliana Bounegru (European Journalism Centre) Peer-reviewer: Jonathan Gray (Open Knowledge Foundation) UPDATE: First draft of chapter finished. STILL NEED: Peer-review.
UPDATE: Pending input from Justin Arenstein STILL NEED: Input from Justin Arenstein
What was your approach? (exploratory vs. hypothesis approach) What techniques and tools did you use? How did you present the data powered story? What is the potential of data powered stories? Why should journalists/newsrooms be interested in producing such projects? What were the challenges in producing these stories? What tips and advice would you give to journalists who want to work on similar projects? Please include relevant links, videos and images. Authors: Steve Doig, Cynthia O'Murchu, Caelainn Barr, Sascha Venohr, Amanda Rossi Length: 1.5-3 pages per example UPDATE: Ready for review EDITOR: Lucy/Kat
EDITOR: Lucy/Kat
Overview: Give and describe successful examples of data driven applications you worked on. Describe how you produced these applications. The aim is to give journalists and decision-makers in newsrooms who might be interested in data journalism a sense of what the potential of data driven applications is and how they could go about producing them. What data did you use and how did you obtain it? What determined you to start this project? What did the project aim to achieve? How long did you work on the project? How many people worked on it? What was the cost of the project? What were the skills necessary for this project? (domain knowledge, coding, research, visualisation, etc.) What was your approach? What techniques and tools did you use? How did you present the outcome? What is the potential of such projects? Why should journalists/newsrooms be interested in producing such projects? What were the challenges in producing these projects? What tips and advice would you give to journalists who want to work on similar projects? Include relevant links, videos and images. Authors: Aron Pilhofer, Matt Stiles Length: 1.5- 3 pages per example UPDATE: needs doing! STILL NEED: Guardian, NYT, BBC, EDITOR: Lucy/Kat
Authors: Jonathan Gray, Brian Boyer Length: 1-3 pages (with links and examples) Social data services Overview: An overview of community driven websites which aim to help you find the data you need - such as GetTheData.org and TheDataHub.org - and their function in enabling collaboration around datasets Authors: Jonathan Gray Length: 0.5-1 page (with links and examples) Research data Overview: An overview of sites to find research data Authors: Length: 0.5-1 page (with links and examples) UPDATE: Great input and notes from Brian Boyer/Chicago Tribune, Jane Park/Creative Commons, John Keefe/WNYC, Chrys Wu/HacksHackers. STILL NEED: Needs to be written up and expanded. EDITOR: Friedrich
technical ability, etc. (case study approach with lessons learned from each project presented)
Authors: Steve Doig (Cronkite School of Journalism), Lisa Evans (Guardian), Richard Gordon (Medill School of Journalism), Lizzie Jackson (Ravensbourne College), Amanda Rossi (freelance journalist), JV Chamary (BBC), Fabrizio Scrollini (London School of Economics), Ted Han (DocumentCloud), Claire Miller (Wales Online) Length: 9 pages Editor: Liliana Bounegru (European Journalism Centre) Peer-reviewer: UPDATE: Input mainly from Steve Doig (Cronkite School of Journalism) and Claire Miller (Wales Online) STILL NEED: Input from Friedrich Lindenberg on types of errors to look for when
working with scraped / extracted / manipulated data 4.2.3 Tools and techniques for cleaning and analysing data
Overview: Overview of different types of tools for analysing and working with datasets, examples of how they can be used, examples of how they have been used by journalists. Authors: Liliana Boungeru, Lucy Chambers, Claire Miller Length: 1-2 pages per case study UPDATE: Needs doing! STILL NEED: Input from Friedrich. EDITOR: Friedrich.
Overview: Explaining how to find stories in datasets (various approaches), including examples and case studies. Also looking at the broader role of data journalists in the newsroom, how they work with other journalists, etc. Authors: Caelainn Barr, Claire Miller Length: 0.5-1 page per approach/case study UPDATE: Ready for Review EDITOR: Lucy
Overview: Roles of visualisation in journalism what function(s) visualisations play in reportage (what do journalists use visualisations for): (1) to find stories, (2) to tell a story Tools, tutorials and good examples of using visualisations to find stories When do you need to visualise a dataset to explore it and find a story? When dont you need to? How do you go about discovering a story? What tools do you use? What protocol do you follow? What clues do you follow, what do you pay attention to? (lessons, tips, advice). Examples of how to explore a dataset with a visualisation tool with a step by step description of the protocol followed to find the story. Tools, tutorials and good examples of using visualisations to tell stories When do you need to visualise a story and when dont you need to?
What types of visualisations are good for presenting what types of stories? How do you go about visualising a story? What tools do you use? What steps do you take? (lessons, tips, advice). What makes a good visualisation, what makes a bad visualisation? Examples of good and base use of visualisations to tell a story with explanation of what makes them a good/bad case.
Note: The aim of this chapter is not to show journalists how to do a data visualisation but to explain when a visualisation could be useful in their work, what could visualisations help them with, how they could assess the quality of a visualisation, getting them familiar with the vocabulary so they know what to ask for from designers, getting them familiar, introducing and showing them how to use visualisation tools for non-experts. Authors: Sarah Cohen (Knight Professor of the Practice of Journalism and Public Policy, Sanford), Geoff McGhee, David Erwin (New York Times), Aron Pilhofer (New York Times), Farida Vis (University of Leicester), Kate Hudson (openjournalism.ca), Lulu Pinney (infographics specialist), Mariano Blejman (Hacks/Hackers), Length: 1-2 pages per case study Editor: Liliana Bounegru (European Journalism Centre)
UPDATE: Good start! STILL NEED: Needs expanding and editing, and more examples. EDITOR: Liliana Bounegru (EJC)
EDITOR: Liliana Bounegru (EJC) 4.4.4 Telling Stories through Social Media UPDATE: Pending content from Luca Dello Iacovo STILL NEED: Above content EDITOR: Lucy Chambers
UPDATE: Case study on measuring impact from Sascha Venohr (Zeit Online). Excellent input from Mirko Lorenz (Deutsche Welle) on making the case for data journalism facts to keep in mind when thinking about sustainability and business models for data journalism. Case studies from Lorenz Matzat (OpenDataCity), Mark Hunter on Kaas & Mulvad (pending permission), Clement Renaud on OWNI (in progress). STILL NEED: more case studies sustainability, business models and measuring impact of data journalism from Guardian, NYT, Chicago Tribune, etc. EDITOR: Liliana Bounegru (European Journalism Centre)
7. Appendix
7.1 Further resources
Overview: Lists of links, resources, examples and other bits and pieces that dont fit in the handbook Authors: Everyone! Length: 5 pages
7.2 Glossary:
Link: 5.2 Glossary UPDATE: Needs doing! STILL NEED: Lots of ideas from everyone. EDITOR: Jonathan