merging tutorial for pandas part 1

Merging and Joining data sets are key activities of any data scientist or analyst. In this tutorial, we explore the process of combining datasets based on common columns quickly and easily with the Python Pandas library and it’s fast merge() functionality. Finally conquer merging and become a master with this 2-part tutorial.

Read More →

Geocode your addresses for free with Python and Google For a recent project, I ported the “batch geocoding in R” script over to Python. The script allows geocoding of large numbers of string addresses to latitude and longitude values using the Google Maps Geocoding API. The Google Geocoding API is one of the most accurate geocoding […]

Read More →

There are multiple ways to select and index rows and columns from Pandas DataFrames. I find tutorials online focusing on advanced selections of row and column choices a little complex for my requirements. Selection Options There’s three main options to achieve the selection and indexing activities in Pandas, which can be confusing. The three selection cases and methods covered in […]

Read More →
How often do you actually get wet going to work? Using pandas, python, and some graphs, we find out.

How often do you get wet cycling to work? Cycling in Ireland is taking off. The DublinBikes scheme is a massive success with over 10 million journeys, there’s large increases in people cycling in Irish cities, there’s a good cyclist community, and infrastructure is slowing improving around the country. However, Ireland is a rainy place! It turns out that […]

Read More →
Cyclist in the rain. Blog about python scraping data from wunderground rainfall data.

The most recent post on this site was an analysis of how often people cycling to work actually get rained on in different cities around the world. You can check it out here. The analysis was completed using data from the Wunderground weather website, Python, specifically the Pandas and Seaborn libraries. In this post, I will […]

Read More →

[Short version] The S3 ingestion script for Amazon applications provided by Logentries will not work for the gzip compressed log files produced by the Elastic Beanstalk log rotation system. A slightly edited script will work instead and can be found on Github here.[/Short Version]   Logentries is a brilliant startup originating here in Dublin for collecting […]

Read More →

This is a very quick post to help some people out on installation problems with Office for Mac 2016. On excited installation of Excel 2016 on my Macbook, the following error threatened to ruin the day: “An unknown error has occurred, the error code is: 0xD0000006” Seemingly unfound on the internet, the solution, oddly enough was to ensure […]

Read More →