Word Cloud from a Pandas DataFrame in Python

A word cloud is a data visualization technique that shows the most used words in large font and the least used words in small font. It helps to get an idea about your text data, especially when working on problems based on natural language processing. If you want to learn how to visualize a word cloud from a pandas DataFrame, this article is for you. In this article, I will present a tutorial on how to visualize a word cloud from a pandas DataFrame in Python.

Word Cloud from a Pandas DataFrame in Python

A pandas DataFrame is used to store the data that you use when working on a data science task. Sometimes your dataset contains a column with textual information such as opinions or reviews of people about a product. To understand how most people think about the product, you can visualize a word cloud of that column. So to view a word cloud from a pandas DataFrame in Python, you need to have the wordcloud library installed in your Python environment. You can install this Python library using the pip command as mentioned below:

  • pip install wordcloud

Now let’s see how to visualize a word cloud from a pandas DataFrame in Python. For this task, I will first import all the necessary Python libraries and a dataset with textual information:

from wordcloud import WordCloud
from wordcloud import ImageColorGenerator
from wordcloud import STOPWORDS
import matplotlib.pyplot as plt
import pandas as pd
data = pd.read_csv("https://raw.githubusercontent.com/amankharwal/Website-data/master/spam.csv")
print(data.head())
view raw word cloud.py hosted with ❤ by GitHub
  label                                               text
0   ham  Go until jurong point, crazy.. Available only ...
1   ham                      Ok lar... Joking wif u oni...
2  spam  Free entry in 2 a wkly comp to win FA Cup fina...
3   ham  U dun say so early hor... U c already then say...
4   ham  Nah I don't think he goes to usf, he lives aro...

There are only two columns in this dataset where the text column contains textual data. So below is how you can visualize a word cloud from the text column of this dataset using Python:

text = " ".join(i for i in data.text)
stopwords = set(STOPWORDS)
wordcloud = WordCloud(stopwords=stopwords, background_color="white").generate(text)
plt.figure( figsize=(15,10))
plt.imshow(wordcloud, interpolation='bilinear')
plt.axis("off")
plt.show()
view raw word cloud2.py hosted with ❤ by GitHub
Word Cloud from a Pandas DataFrame

So this is how you can easily visualize a word cloud from any column of your dataset using Python.

Summary

Sometimes your dataset contains a column with textual information such as opinions or reviews of people on a certain product. To understand how most people think about the product, you can visualize a word cloud of that column. I hope you liked this article on how to visualize a word cloud from a DataFrame in Python. Feel free to ask your valuable questions in the comments section below.

Default image
Aman Kharwal
Coder with the ♥️ of a Writer || Data Scientist | Solopreneur | Founder
Articles: 1170

Leave a Reply