I recently had an issue with a long running web process that I needed to substantially speed up due to timeouts. The delay arose because the system needed to fetch data from a number of URLs. The total number of URLs varied from user to user, and the response time for each URL was quite long (circa […]

Read More →

This post is about creating Python Flask web pages that can be asynchronously updated by your Python Flask application at any point without any user interaction. We’ll be using Python Flask, and the Flask-SocketIO plug-in to achieve this. In short, the final result is hosted on GitHub. What I want to achieve here is a […]

Read More →

FAST TRACK: There is some python code that allows you to scrape bike availability from bike schemes at the bottom of this post… SLOW TRACK: As a recent aside, I was interested in collecting Dublin Bikes usage data over a long time period for data visualisation and exploration purposes. The Dublinbikes scheme was launched in […]

Read More →
Multiple heatmaps

Self-Organising Maps (SOMs) are an unsupervised data visualisation technique that can be used to visualise high-dimensional data sets in lower (typically 2) dimensional representations. In this post, we examine the use of R to create a SOM for customer segmentation. The figures shown here used use the 2011 Irish Census information for the greater Dublin […]

Read More →

“Is there any online reading or courses I can do to get into data analysis?” At my workplace, I get asked the question above. The question is usually posed by people typically with a finance background, who’s working as a management consultant. In this post I propose a learning path for such people to “get […]

Read More →

The Web Summit, Europes largest technology-industry conference was held in Dublin this week. An annual event since 2010, the Web Summit attracted over ten thousand visitors from over 90 countries. The Web Summit puts Ireland on the international startup and internet scene. With speakers like Elon Musk (founder of Paypal, SpaceX, and Tesla) and representatives from […]

Read More →

As a practicing data scientist, I have regular need to present and distribute the results of analyses, or to provide descriptive statistics on data sets. Ideally these results can be presented in the form of interactive graphics, standalone applications, or as continually updating dashboards. There are a range of excellent dashboard and visualisation building softwares […]

Read More →

Last weekend, I had the pleasure of attending a Dublin startup weekend called “Dublin Launch48”. The premise of the weekend was to develop good business ideas from inception to realisation over the course of 48 hours. At the end of the weekend, you (and your newly formed team) have present your work, explain how you came […]

Read More →