Analyzing IMDb Top Celebrities Collaborative Relationship

{% assign lvl = page.url | append:‘X’ | split:'/' | size %} {% capture relative %}{% for i in (3..lvl) %}../{% endfor %}{% endcapture %}

Overview

As an international student in Canada, I found I am only familiar with a few number of TV show/movie stars. As a result, I decided to write a post concerning the relationship between tv/movie celebrities. Instead of focusing on the rumors or hearsay on gossip magzines, this post is more about working relationships and career achievements.

I get the top 200 celebrities from IMDb Most Popular Females/Males on Aug 2017. This ranking is based on IMDb STARmeter which does not mean the acting skills of the stars but the level of public interest in the person. The working relationships are extracted from IMDb Datasets located in the AWS S3 bucket.

All source code has been uploaded to this github repository.

Strongly connected celebrities

Working relationship can be easily demonstrated by graphs. In this post, I use D3.js and Typescript to draw interactively Force-Directed Graph. D3.js is a really cool visualization tool which can be hosted in github.io with Jekyll Blog. while learning Javascript, I found Typescript is rather interesting. It provides not only plenty of object-oriented syntactic sugars, but also static analysis which helps me learn D3.js faster.

In the following graph, I tried to find strongly connected celebrities' groups by applying a classic community detection algorithm - Louvain Method. This algorithm divides the top celebrities into groups such that each set of person is densely connected internally and sparsely connected between groups.

TV stars’relationship network: (You can click-and-drag the nodes around. The name of celebrities should emerge when hovering over the circles. Different groups are distinguished by different color. The radius of circles stand for their number of connections.)

For TV show stars, there are a few densely connected groups:

Game of Throne:

  • Emilia Clarke
  • Nikolaj Coster-Waldau
  • Peter Dinklage
  • Lena Headey
  • Kit Harington
  • Aidan Gillen
  • Rose Leslie
  • Sophie Turner
  • Natalie Dormer

Vikings:

  • Katheryn Winnick
  • Travis Fimmel

Riverdale:

  • Cole Sprouse
  • Mädchen Amick

Big Little Lies:

  • Shailene Woodley
  • Reese Witherspoon
  • Shailene Woodley

Guardians of the Galaxy:

  • Chris Pratt
  • Zoe Saldana

Movie stars’relationship network:

For movie stars, they are more likely to collaborate with varous of people.

Centers Among Stars

In this section, I am going to compare the “degree” of top celebrities.

Reference

Yuhui Lin
Yuhui Lin
Software Engineer
Next
Previous