## K-means Clustering in Python

K-means Clustering in PythonK-means clustering is a clustering algorithm that aims to partition $n$ observations into $k$ clusters. There are 3 steps: Initialisation - K initial "means" (centroids) are generated…

## Isomap for Dimensionality Reduction in Python

• Post author:
• Post category:Python

Isomap for Dimensionality Reduction in PythonIsomap (Isometric Feature Mapping), unlike Principle Component Analysis, is a non-linear feature reduction method. We will explore the data set used by the original authors…

## Principle Component Analysis in Python

• Post author:
• Post category:Python

Principle Component Analysis in PythonPrinciple component analysis (PCA) is an unsupervised statistical technique that is used for dimensionality reduction. It turns possible correlated features into a set of linearly uncorrelated…

## Fibonacci Implementation in python

• Post author:
• Post category:Python

Fibonacci Implementation in pythonIn this post, we're going to take a look at how to implement the fibonacci sequence in python. The fibonacci sequence is the series of numbers you…

## Bucketing Continuous Variables in pandas

• Post author:
• Post category:Python

Bucketing Continuous Variables in pandas In this post we look at bucketing (also known as binning) continuous data into discrete chunks to be used as ordinal categorical variables. We'll start…

## Parallel Coordinates in Matplotlib

• Post author:
• Post category:Python

Parallel Coordinates in Matplotlib In this post we will be exploring the Auto data set from ISLR, which can be found here. Parellel coordinates is a method for exploring the…

## Basic Language Processing with NLTK

• Post author:
• Post category:Python

Basic Language Processing with NLTKIn this post, we explore some basic text processing using the Natural Language Toolkit (NLTK). We will be grabbing the most popular nouns from a list…

## Bar Charts in Matplotlib

• Post author:
• Post category:Python

Bar Charts in MatplotlibBar charts are used to display values associated with categorical data. The plt.bar function, however, takes a list of positions and values, the labels for x are…

## Scatter Matrices using pandas

• Post author:
• Post category:Python

Scatter Matrices using pandasUsing pandas we can create scatter matrices to easily visualise any trends in our data. Pandas uses matplotlib to display scatter matrices. We start with our imports…

## Scatter Charts in Matplotlib

• Post author:
• Post category:Python

Scatter Charts in MatplotlibWe start by importing matplotlib and display all visuals inline, using the ggplot style sheet. In : import matplotlib.pyplot as plt %matplotlib inline plt.style.use('ggplot') Scatter plots take 2…