Last month The Data Lab was very pleased to partner with data visualisation guru Andy Kirk on a one-day workshop at the G&V Royal Mile Hotel in Edinburgh, as part of our mission to bring leading data experts to the Scottish community. My colleague Caterina Constantinescu and I were in attendance, along with around forty […]
Data-Driven Design of Abstract Art
Post by Matthew Higgs, Data scientist at The Data Lab An Example of Negative-Time Evaluation in The Intention Economy Motivation Every prototype you build is an investment of time and/or money, and it’s only when you start getting feedback on a prototype that you can start to assess whether it was a good/bad investment. What […]
Martina Pugliese, Data Science Lead at Mallzee
In this interview we welcome Martina Pugliese, Data Science Lead at the “Tinder for clothes” app company Mallzee.
ScotlandIS Digital Technology Awards 2018
Application is now open for the Digital Technology Awards 2018! The Digital Tech Awards on 26 April always attract fierce competition from all corners of our industry and provide an amazing opportunity to showcase your company. Be in with a chance to win one of their coveted awards. The Data Lab are proud to be […]
Running R remotely: some options and tips
Why would you need to do this? Say, for instance, you are dealing with sensitive data that should not leave a specific system, or quite simply that you are away on a work retreat – but your laptop is far less powerful than your work desktop computer which you left behind – so you want […]
Excel-like functionality with Python pandas: The Data Lab takes the Pepsi Challenge!
Happy Birthday Excel! I would posit that the world’s most used data science software is the ubiquitous Microsoft Excel. Released for Windows in November 1987, this month marks its 30th anniversary. In that time I’d imagine it has been employed by all manner of people across near all industries: from the fund manager tracking his […]
Analysis of Gaelic Station Names: An exploration of inter-language similarity measures for place-names and the design of rural scores.
Motivation Most of modern Scotland was once Gaelic-speaking and a policy change in 2010 means Gaelic names appear alongside English names on almost all station signs across Scotland’s railway. I live in Glasgow and often travel out into the highlands and over time I hypothesised: H1: The Gaelic and English names of a station become more similar […]
Snakes and Ladders (Part 3 of 3): Analysing the classic children’s game
To recap the analysis from our previous article, we have now shown that the advantage to Player 1 in snakes and ladders is minimal (amounting to less than 6 extra wins out of every 1,000 games). In this post we look at visualising some results, focusing in particular on the distribution of game lengths and the […]
Dealing with many dimensions in historical data: Tracking cooperation & conflict patterns over space and time in R
For this post, I’ve managed to find some extremely interesting historical event data offered by the Cline Center on this page. As you will see, this dataset can be quite challenging because of the sheer number of dimensions you could look at. With so many options, it becomes tricky to create visualisations with the ‘right’ level of granularity: […]
Matt Higgs, Data Scientist at The Data Lab
First in a new series of Interviews with Data Scientists, we speak to Matt Higgs of The Data Lab.


