In this post, geocoded data for all property price sales in Ireland from 2012-2017 is available. Data is sourced on the Irish Property Price Register and geocoded using the Google geocoding script in Python. All of the GPS latitude/longitude coordinates are further tied to census small area and electoral division boundaries.

Read More →
merging tutorial for pandas part 1

Merging and Joining data sets are key activities of any data scientist or analyst. In this tutorial, we explore the process of combining datasets based on common columns quickly and easily with the Python Pandas library and it’s fast merge() functionality. Finally conquer merging and become a master with this 2-part tutorial.

Read More →

Geocode your addresses for free with Python and Google For a recent project, I ported the “batch geocoding in R” script over to Python. The script allows geocoding of large numbers of string addresses to latitude and longitude values using the Google Maps Geocoding API. The Google Geocoding API is one of the most accurate geocoding […]

Read More →

Pandas Data Selection There are multiple ways to select and index rows and columns from Pandas DataFrames. I find tutorials online focusing on advanced selections of row and column choices a little complex for my requirements. Selection Options There’s three main options to achieve the selection and indexing activities in Pandas, which can be confusing. The three selection cases and […]

Read More →
How often do you actually get wet going to work? Using pandas, python, and some graphs, we find out.

How often do you get wet cycling to work? Cycling in Ireland is taking off. The DublinBikes scheme is a massive success with over 10 million journeys, there’s large increases in people cycling in Irish cities, there’s a good cyclist community, and infrastructure is slowing improving around the country. However, Ireland is a rainy place! It turns out that […]

Read More →
Cyclist in the rain. Blog about python scraping data from wunderground rainfall data.

The most recent post on this site was an analysis of how often people cycling to work actually get rained on in different cities around the world. You can check it out here. The analysis was completed using data from the Wunderground weather website, Python, specifically the Pandas and Seaborn libraries. In this post, I will […]

Read More →

[Short version] The S3 ingestion script for Amazon applications provided by Logentries will not work for the gzip compressed log files produced by the Elastic Beanstalk log rotation system. A slightly edited script will work instead and can be found on Github here.[/Short Version]   Logentries is a brilliant startup originating here in Dublin for collecting […]

Read More →