Scatter Plots are one of the most powerful and widely used techniques for the task of data analysis and visualization. In this article, I’ll explain how to plot a scatter plot with the Python programming language by using Matplotlib.
Introduction to Scatter Plots
Large amounts of multidimensional data occur in many important application areas such as analyzing telephone service usage, sales, and monitoring server performance. A Data Analyst wants to know how much one attribute in the dataset is affected by another. In these type of cases, the scatter plot is one of the most powerful tools for data analysis.
It is still true that scatter plots are one of the most common ways to visualize multidimensional data. By using a scatter plot, we can identify the relationship between two attributes, groups of points and outliers.
Scatter Plot with Python using Matplotlib
Now in this section, I will take you through how to plot a scatter plot with Python by using Matplotlib. I will first use numerical data generated by using Numpy to plot a scatter plot and then I will use a real-time dataset to plot a scatter plot with Python.
The code below explains how to draw a random scatter plot with points of many colours and sizes. To avoid the overlapping results, I’ll also use the alpha keyword to increase the transparency level:
Now let’s analyse on a realtime data. So in the code below, I will use the housing prices and I will plot the data to analyze the median prices and the population. The idea is to visualize the densely populated areas with bigger circles and the areas with high prices with darker circles and vice versa:
So I hope you liked this article on how to visualize scatter plots with Python programming language by using the matplotlib library. Feel free to ask your valuable questions in the comments section below.