This post provides an introduction to “word embeddings” or “word vectors”. Word embeddings are real-number vectors that represent words from a vocabulary, and have broad applications in the area of natural language processing (NLP). We examine training, use, and properties of word embeddings models, and look at how and why you should look to use word embeddings over older bag-of-words techniques in your data science and language modelling tasks.
In this post, geocoded data for all property price sales in Ireland from 2012-2017 is available. Data is sourced on the Irish Property Price Register and geocoded using the Google geocoding script in Python. All of the GPS latitude/longitude coordinates are further tied to census small area and electoral division boundaries.
A lot of modern-day games, especially the ones being developed for mobile, are built on business models revolving around data. Understanding how the audience thinks and responds with a product, as well as knowing how retention works in gaming, are both important in paving the way for the future of gaming.
Merging and Joining data sets are key activities of any data scientist or analyst. In this tutorial, we explore the process of combining datasets based on common columns quickly and easily with the Python Pandas library and it’s fast merge() functionality. Finally conquer merging and become a master with this 2-part tutorial.