python

Amazon Elastic Beanstalk – Logging to Logentries from Python Application

[Short version] The S3 ingestion script for Amazon applications provided by Logentries will not work for the gzip compressed log files produced by the Elastic Beanstalk log rotation system. A slightly edited script will work instead and can be found on Github here.[/Short Version]   Logentries is a brilliant startup originating here in Dublin for collecting …

Amazon Elastic Beanstalk – Logging to Logentries from Python Application Read More »

Summarising, Aggregating, and Grouping data in Python Pandas

Pandas – Python Data Analysis Library I’ve recently started using Python’s excellent Pandas library as a data analysis tool, and, while finding the transition from R’s excellent data.table library frustrating at times, I’m finding my way around and finding most things work quite well. One aspect that I’ve recently been exploring is the task of …

Summarising, Aggregating, and Grouping data in Python Pandas Read More »

Parallel programming allows you to speed up your code execution - very useful for data science and data processing

Using Python Threading and Returning Multiple Results (Tutorial)

Threading in Python is simple. It allows you to manage concurrent threads doing work at the same time. The library is called “threading”, you create “Thread” objects, and they run target functions for you. You can start potentially hundreds of threads that will operate in parallel. Speed up long running tasks by parallelising and threading computation where you can.

Scraping Dublin City Bikes Data Using Python

FAST TRACK: There is some python code that allows you to scrape bike availability from bike schemes at the bottom of this post… SLOW TRACK: As a recent aside, I was interested in collecting Dublin Bikes usage data over a long time period for data visualisation and exploration purposes. The Dublinbikes scheme was launched in …

Scraping Dublin City Bikes Data Using Python Read More »