Bucketing Continuous Variables in pandas
Bucketing Continuous Variables in pandas In this post we look at bucketing (also known as binning) continuous data into discrete chunks to be used as ordinal categorical variables. We'll start…
Bucketing Continuous Variables in pandas In this post we look at bucketing (also known as binning) continuous data into discrete chunks to be used as ordinal categorical variables. We'll start…
Parallel Coordinates in Matplotlib In this post we will be exploring the Auto data set from ISLR, which can be found here. Parellel coordinates is a method for exploring the…
Basic Language Processing with NLTKIn this post, we explore some basic text processing using the Natural Language Toolkit (NLTK). We will be grabbing the most popular nouns from a list…
Bar Charts in MatplotlibBar charts are used to display values associated with categorical data. The plt.bar function, however, takes a list of positions and values, the labels for x are…
Scatter Matrices using pandasUsing pandas we can create scatter matrices to easily visualise any trends in our data. Pandas uses matplotlib to display scatter matrices. We start with our imports…
Scatter Charts in MatplotlibWe start by importing matplotlib and display all visuals inline, using the ggplot style sheet. In [1]: import matplotlib.pyplot as plt %matplotlib inline plt.style.use('ggplot') Scatter plots take 2…
Line Charts in MatplotlibWe start by importing matplotlib and display all visuals inline In [1]: import matplotlib.pyplot as plt %matplotlib inline We'll use the ggplot style in this example for aesthetics…
Correlation in PythonCorrelation values range between -1 and 1. There are two key components of a correlation value: magnitude - The larger the magnitude (closer to 1 or -1), the…
Interacting with the Twitter API using pythonTwitter has a RESTful API to retrieve Tweets for certain queries for use in your applications. This post will show how we can authorise…
Djikstra's algorithm is a path-finding algorithm, like those used in routing and navigation. We will be using it to find the shortest path between two nodes in a graph. It…