• Visual Explorations of Sample Size

     

    Drawing conclusion based on small samples is obviously problematic. At the same time, I also wonder whether the rise to prominence of "Big Data" can lead organisations to blindly collect as much data as possible rather than think logically about how…

  • Aspects of Datasets - Part 2

    This is the second (and final) article looking at key aspects of datasets. Having previously covered relevance, accuracy, and precision, here we will consider consistency, completeness and size.

    Consistency

    On the 23rd of September 1999, NASA's Mars Climate…

  • Too Big Data: Coping with Overplotting

    Scatter plots are a wonderful way of showing (apparent) relationships in bivariate data. Patterns and clusters that you wouldn't see in a huge block of data in a table can become instantly visible on a page or screen. With all the hype around Big Data…