giCentre bookScraper visualization

giCentre » msc courses information science school of informatics city university london

This prototype application was developed with colleagues from The Times to show how interactive visualization of the bookScraper collection may help us identify trends and anomalies in the data set. Doing so may give us ideas about the collection as we aim to explore the data and gain insights from it.

The application is designed to support analysis of the titles according to their content and the use of particularly important words and how this compares between authors. It is indicative of the kind of visualization work that we do at the giCentre.

Vocabulary TreeMap

Our interactive 'treemap' allows us to explore the 100 most important words in each of the publications. These words are represented by rectangles sized according to the number of times they occur in a document.

The largest rectangles occur to the top left showing the authors who use the most words, the books with most words for each author and the most popular words in each book at different levels of the hierarchy. Publications with more words cover a larger area - as do authors with larger total word counts.

« Click the image to run the application.

Visualization developed by Dr. Aidan Slingsby
with Dr. Jason Dykes, Dr. Jo Wood, Jonathan Richards and Julian Burgess


You can learn more about processing, visualizing and using digital information through our Masters programmes and modules available through flexible modes of study.