International Pop Music

A Social Network Visualization

Python (Pandas, Spotipy) + Gephi

View Code Here

For this short project, I explored the relationships between the Spotify Top 50 playlists in countries around the world through social network analysis. I did this with the goal of better understanding relationships in international musical tastes.

I used Spotify’s API and the Spotipy module to get the top charts for each country where Spotify is available. I cleaned and compiled the data in Python, and used Gephi for analysis and visualization.

This data is from 06/18/20, and the exact relationships below almost certainly do not hold on other days. In the future, I hope to sample more days in order to paint a more complete picture.

  • Each edge (the line connecting two nodes) represents at least one shared song on the two countries’ Spotify Top 50s.
  • The thickness of an edge is proportional to the number of songs the two countries had in common, ranging from 1 to 48.
  • Text size is proportional to a node’s degree, or number of connections a node has to other nodes.
  • Countries that are the same color have been grouped into the same community using Louvain Community Detection. Essentially, these countries have the most in common with each other.
  • Those in the spaghetti-like middle are highly connected to other countries. I know its a bit of a mess so I’d appreciate any advice how to clean that up!

Some Take-aways:

  • Malta stands out as being unexpectedly dissimilar from other countries. Despite the data being from June, many of its Top 50 were Christmas songs.
  • The Spanish-speaking world is extremely musically connected.
  • Countries that do not widely use the Latin Alphabet tend to be less connected.