vrijdag 7 juli 2017

Tanzania data journalism training


Tanzania Media Foundation
Twitter: @newTMF

Training Tanzania's next Data Journalism Leaders

TMF and EJC continue to deepen their commitment to data journalism in Tanzania by implementing their first workshop targeting the country's need for data journalism trainers. Kicking off 17 July in Dar es Salaam and led by former Utrecht professor, Peter Verweij, the programme features sessions led by participants themselves and dives deep into the local context of controversial statistics and cybercrime laws recently passed by the government.

During the workshop, the newly published Swahili language version of EJC's Data Journalism Handbook is being released and distributed to journalists across Tanzania and Kenya both digitally and hardcopy.

donderdag 6 juli 2017


Are you still using Whatsapp? That is old school and there are good reasons to switch to another service. Telegram Messenger for example. After Facebook bought Whatsapp, the chances that your privacy was endangered, were increasing. Of course Facebook created a possibility for users to opt out, that is Facebook had no access to the whatsapp data…. For the moment. The first reason for me to look into Telegram was, that it is completely cloud based. You login to your Telegram account with a mobile phone, a tablet, or your laptop, using an app or just the browser. Secondly Telegram offers more possibilities for including files to your message. Not only pics but audio, video, or text files can be attached. Nice but that was not compelling for me to switch. That was the possibility or using bots and also creating your own bot.

How does it work? Installing the messenger service is a piece of cake. Download the app and subscribe. A pin code is end to your phone to login; done. In your contact you can see who is already using Telegram. Others you can send an invitation.

maandag 26 juni 2017


Creating interactive graphics is vital to data journalism stories. In my first blog post on this subject I explored the possibilities of D3, .JavaScript , R and plotly. If you want to avoid D3  and JavaScript completely and only  make use of Python, plotly has developed an interesting new library called Dash. I have been digging into this possibility using data about Dutch municipalities.
From an analysis in R I know that there is correlation between the value of the houses  and the average income for municipalities. Checking the partial correlation  and using political party as an intervening variable, the correlation does not change dramatically. Can we produce an interactive graph showing this conclusion?

vrijdag 16 juni 2017


Working with the new version of Tableau Public 10.3 makes working with data a lot easier.  Here are some of the most important improvements:
  1. Pdf are always a pain in the rear and cracking the file can sometimes be hard, using one of the web services like Pdf to Excel, or using Tabula. Now Tableau is able to open pdf's and connect them immediately to a worksheet.
  2. If you don't want to work with Excel, there is always Google sheets. But getting the sheets into Tableau, exporting to an .xls format was needed. Now we can import Google sheets directly in Tableau.
  3. Excel has its limitations for statistical analysis. R has much more tools under the hood, but making visualization is limited, especially for online. Starting Tableau 10.3 .Rdata can directly imported into the worksheets of Tableau.
  4. Making maps with Tableau had important limitations because one had to rely on the maps provided bu Tableau. My solutions was to produce the map in QGIS and export the map to Google FT(Fusion Tables). And here it is: Tableau reads the shape files(.shp) and makes beautiful maps. Adding data to the map is now problem: choose of 4 different database joins between your map and your data.

zaterdag 10 juni 2017


 Since the beginnings of data journalism in the nineties of the last century, then called CARR or Computer Assisted Research and Reporting, techniques for  analyzing and visualizing data have improved enormously. One of the central tools in te nineties was the spreadsheet, standardized by Microsoft Excel. Spreadsheets are still much used for analysis though moving into the area of advanced data journalism: using for example R for deeper statistical analysis or D3 for creating better interactive graphics creates various new challenges. Then you often will engage in different types of coding: I got struck between Python (for R) or JavaScript (for D3). Does a data journalists need to learn all these programming languages or is there an easier and faster solution?
Looking at journalism practice the answer is:  step on the steep learning  curve and start with learning how to code. Here is some help. Paul Bradshaw starts next year an MA in Data Journalism at the Birmingham School of Media. Studying   Coding and computational thinking being applied journalistic ally (I cover using JavaScript, R, and Python, command line, SQL and Regex to pursue stories)” is one of the elements of this new MA, writes Bradshaw on his blog.
Looking into the market, there is really demand for data journalist with coding skills. Here is a job listing from the Economist. One of the preferred qualities include: A good understanding of data analytics and Coding skills (JavaScript and Python), or a background in data journalism, are a plus.
In the following I will argue that a basic understanding of coding is very helpful, but new services on the web help data journalists to avoid getting stuck in coding.

dinsdag 16 mei 2017


Playing around with scalable vector graphics (.svg), that is text files describing shapes like cubes, rectangles etc. Important for creating charts using D3, data driven documents. For manipulating .svg I use Inkscape, an open source alterbative for Adobe Illustrator. Available for Windows, Mac and Linux at: https://inkscape.org/en/

dinsdag 9 mei 2017



Bought a 32 GB USB drive….that is a complete hard drive! In stead of dual booting Linux from the hard drive, you can boot from the USB in persistent mode, and saving your work and settings. In reading/writing or booting speed is not really different from dual booting. This Philips USB 3.0 Circle 32 GB (compatible with USB 2.0) reads at 55MB/s and writes 10Mb/s. In order to speed up you can run the whole thing in RAM.


Booting from USB is with UEFI a bit more difficult. Generally Ubuntu live start from UEFI with secure boot on. However installing a module in the kernel that is not signed (for example bcmwl-kernel-source, for wireless) is blocked. Let’s disable secure boot. Perfect working, however if you want to start a persistent live Ubuntu then you need to start from UEFI with CSM, because the Ubuntu .iso is a hybrid, and Compatibility Support Module (CSM) provides legacy BIOS compatibility. 

So start up the system with: disable secure boot and fast boot, and enable UEFI with CSM.



To make the USB booting persistent use mkusb. Choose the image, select the drive, tag UEFI and persistent. However when working on HP, mine is HP Elite 820, choose MSDOS partition table not the GUID partition table (GPT), which is standard for UEFI, because HP likes MSDOS better, it seems.

Now I am not only booting into Ubuntu from a portable hard drive, which I can carry along and use it on any machine.