Visualizing book production

Applying data viz techniques to study how a project evolves over time

Data visualization is one of the hot topics of the last year or two. So what does this offer publishing and book production?

Open data activists in particular have been lobbying governments for access to databases which they use to create infographics and visualisations for campaigns. It’s not a new science of course, it was here long before the net (for some background on contemporary practice see the wonderful books by Edward Tufte) but the net is made of data and a good mechanism for transporting it. The net is a good medium for scraping and re-presenting data in more palatable forms.

Prior to the net visualising book production has been tricky since the information either wasn’t recorded or is embedded in ungainly ‘record changes’ data in word processing files. So the few examples were preceded with some forensic historical research. One pre-Word example is the wonderful Traces by Processing inventor Ben Fry. Traces visualises the production of The Origin of Species over time to illustrate Darwin’s evolving thesis. Rather appropriately it is a  nice visualisation of the evolution of a book.

This is where the web steps in and changes the game. Online book production platforms enable you to store and retrieve historical data and use it as you like. You can record and access information quite easily. Information such as who is actively working on a book and when, how much they changed, what they changed, who else was online, word counts over time, is all available. If we could access and process this information in chunks it could perhaps help us to make books better.

Just to show you where this might go, the following are simple prototypes Juan Barquero and I put together using real data from the online book production platform Booktype. The nice thing about Booktype is that it already has all this data recorded in the history for each book so we could write a visualisation application and then look back over the history of many books. So we made a simple API (Application Programming Interface) and Juan put together a few demos using the JavaScript visusalisation library D3JS. The following are some images taken from these trials:

channel1

circles5

And you can try some if these prototypes here (all using real book production data):

  1. Visualisation of book production over time. Clicking around changes the zoom level. Clicking on names removes them from the timeline.
  2. These graphics show the activity of contributors. Clicking on the first graph shows the breakdown of activities by a specific contributor.
  3. Very basic remapping of the same data as #2 above. With this example you can sort by actions or contributor  Clicking on the graph zooms in.

These are just a few simple and raw examples that are of course very far away from being production-ready. However they serve as interesting prototypes for thinking about how this might look and what we might learn from such techniques in a production environment.

tags: , , , , , , , , ,