Analysis
There are four CSV files, one on characters, species, planets, starships and vehicles. Now you are not going to be doing any ground breaking statistical work here as the context of these data sets are pretty niche to die hard Star Wars fans. Like, I'm not sure who will care that the Bantha-II cargo skiff has a one day supply of consumables. None the less these are good data sets to be used for basic stats (finding mean, standard deviation, correlation etc). You can definitely find many attributes that are categorical as well. One thing I did noticed is that with most of the sets there was always one or two things that could be used to talk about outliers. Like Jabba the Hutt in the Character's dataset or the rotational period of planets in the planet data set
Sample Questions
- When you consider the length of a vehicle compared to the number of crew it holds, are there any outliers?
- What is the standard deviation of the _______ attribute in the _______ data set?
- Find your favourite character. Pick and attribute and describe how your character compares to the others.
May the fourth be with you! Who has the most lines in the original Star Wars trilogy and what are their 20 top words?#dataviz #MayThe4thBeWithYou #MayTheFourthBeWithYou pic.twitter.com/WarvwX2XOf
— Neil Kaye (@neilrkaye) May 4, 2021
Downloads
- Original Data - https://www.kaggle.com/jsphyg/star-wars
- Entire folder
- Characters (CSV, Google Sheets, CODAP)
- Species (CSV, Google Sheets, CODAP)
- Planets (CSV, Google Sheets, CODAP)
- Starships (CSV, Google Sheets, CODAP)
- Vehicles (CSV, Google Sheets, CODAP)
No comments:
Post a Comment