datascience

Python Pandas read_csv – Load Data from CSV Files

CSV (comma-separated value) files are a common file format for transferring and storing data. The ability to read, manipulate, and write data to and from CSV files using Python is a key skill to master for any data scientist or business analysis. In this post, we’ll go over what CSV files are, how to read CSV files into Pandas DataFrames, and how to write DataFrames back to CSV files post analysis.

Using iloc, loc, & ix to select rows and columns in Pandas DataFrames

Pandas Data Selection There are multiple ways to select and index rows and columns from Pandas DataFrames. I find tutorials online focusing on advanced selections of row and column choices a little complex for my requirements. Selection Options There’s three main options to achieve the selection and indexing activities in Pandas, which can be confusing. The three selection cases and …

Using iloc, loc, & ix to select rows and columns in Pandas DataFramesRead More »

Summarising, Aggregating, and Grouping data in Python Pandas

Pandas – Python Data Analysis Library I’ve recently started using Python’s excellent Pandas library as a data analysis tool, and, while finding the transition from R’s excellent data.table library frustrating at times, I’m finding my way around and finding most things work quite well. One aspect that I’ve recently been exploring is the task of …

Summarising, Aggregating, and Grouping data in Python PandasRead More »