video games icon

Video Games Python Library

From the CORGIS Dataset Project

By Austin Cory Bart acbart@vt.edu
Version 2.0.0, created 4/13/2017
Tags: video games, games, publishers, console, gaming

Overview

This dataset requires special permission to use.

Originally collected by Dr. Joe Cox, this dataset has information about the sales and playtime of over a thousand video games released between 2004 and 2010. The playtime information was collected from crowd-sourced data on “How Long to Beat”.

Some more information can be found here.

<Cox, Joe. “What makes a blockbuster video game? An empirical analysis of US sales data.” Managerial and Decision Economics 35.3 (2014): 189-198.>

Explore Structure

Each row represents $MISSING_FIELD.

Index Type Example Value
0 dict { }
1 dict (same structure)
2 dict (same structure)
... ... ...
Key Type Example Value Description
"Title" str "Super Mario 64 DS" The full title of this game.
"Features" dict { }
"Metadata" dict { }
"Metrics" dict { }
"Release" dict { }
"Length" dict { }
Key Type Example Value Description
"Handheld?" bool True Whether this is a hand-held game.
"Max Players" int 1 The maximum number of players that can play this game.
"Multiplatform?" bool True Whether this game is available on multiple platforms.
"Online?" bool True Whether this game supports online play.
Key Type Example Value Description
"Genres" str "Action" The list of genres that this game belongs to.
"Licensed?" bool True Whether this game was based off a previously licensed entity.
"Publishers" str "Nintendo" The list of publishers who created this game
"Sequel?" bool True Whether this game is a sequel to another game.
Key Type Example Value Description
"Review Score" int 85 a typical review score for this game, out of 100.
"Sales" float 4.69 The total sales made on this game, measured in millions of dollars.
"Used Price" float 24.95 A typical "used" price for this game (i.e. previously returned and sold), measured in dollars. It is unclear where this number originally came from.
Key Type Example Value Description
"Console" str "Nintendo DS" The name of the console that this particular game was released for. Note that the dataset contains multiple copies of the same game, released for different consoles.
"Rating" str "E" The ESRB rating for this game, either E (for Everyone), T (for Teen), or M (for Mature).
"Re-release?" bool True Whether this game is a re-release of an earlier one.
"Year" int 2004 The year that this game was released.
Key Type Example Value Description
"All PlayStyles" dict { }
"Completionists" dict { }
"Main + Extras" dict { }
"Main Story" dict { }
Key Type Example Value Description
"Average" float 22.716666666666665 The mean time that players reported completing the game in any way, in hours. This is the average over all the other categories.
"Leisure" float 31.9 The slowest time that players reported completing the game in any way, in hours. This is the minimum over all the other categories.
"Median" float 24.483333333333334 The median time that players reported completing the game in any way, in hours. This is the median over all the other categories.
"Polled" int 57 The number of players that reported completing the game in any way. This is the count over all the other categories.
"Rushed" float 14.3 The fastest time that players reported completing the game in any way, in hours. This is the maximum over all the other categories.
Key Type Example Value Description
"Average" float 29.766666666666666 The mean time that players reported completing everything in the game, in hours.
"Leisure" float 35.03333333333333 The slowest time that players reported completing everything in the game, in hours.
"Median" float 30.0 The median time that players reported completing everything in the game, in hours.
"Polled" int 20 The number of players that reported completing everything in the game.
"Rushed" float 22.016666666666666 The fastest time that players reported completing everything in the game, in hours.
Key Type Example Value Description
"Average" float 24.916666666666668 The mean time that players reported completing the main game and major extra parts of the game, in hours.
"Leisure" float 29.966666666666665 The slowest time that players reported completing the main game and major extra parts of the game, in hours.
"Median" float 25.0 The median time that players reported completing the main game and major extra parts of the game, in hours.
"Polled" int 16 The number of players that reported completing the main game and major extra parts of the game.
"Rushed" float 18.333333333333332 The fastest time that players reported completing the main game and major extra parts of the game, in hours.
Key Type Example Value Description
"Average" float 14.333333333333334 The mean time that players reported completing the main game storyline, in hours.
"Leisure" float 18.316666666666666 The slowest time that players reported completing the main game storyline, in hours.
"Median" float 14.5 The median time that players reported completing the main game storyline, in hours.
"Polled" int 21 The number of players that reported completing the main game storyline.
"Rushed" float 9.7 The fastest time that players reported completing the main game storyline, in hours.

Downloads

Download all of the following files.

  1. video_games.py
  2. video_games.data

Usage

import video_games
video_game = video_games.get_video_game()

Documentation

get_video_game()
Returns a list of dictionaries representing video game.