music icon

Music CSV File

From the CORGIS Dataset Project

By Ryan Whitcomb
Version 2.0.0, created 5-18-16
Tags: music, songs, artists, creativity, media


This library comes from the Million Song Dataset, which used a company called the Echo Nest to derive data points about one million popular contemporary songs. The Million Song Dataset is a collaboration between the Echo Nest and LabROSA, a laboratory working towards intelligent machine listening. The project was also funded in part by the National Science Foundation of America (NSF) to provide a large data set to evaluate research related to algorithms on a commercial size while promoting further research into the Music Information Retrieval field. The data contains standard information about the songs such as artist name, title, and year released. Additionally, the data contains more advanced information; for example, the length of the song, how many musical bars long the song is, and how long the fade in to the song was.

<Thierry Bertin-Mahieux, Daniel P.W. Ellis, Brian Whitman, and Paul Lamere.
The Million Song Dataset. In Proceedings of the 12th International Society
for Music Information Retrieval Conference (ISMIR 2011), 2011.>


Download the following file:

Key Descriptions

Key List of... Comment Example Value
artist.familiarity Float A measure of 0..1 for how familiar the artist is to listeners. 0.581793766
artist.hotttnesss Float A measure of the artists's popularity, when downloaded (in December 2010). Measured on a scale of 0 to 1. 0.401997543 String A unique ID for this artist. "ARD7TVE1187B99BFB1"
artist.latitude Float The home location's latitude of this artist. 0.0
artist.location Integer Unknown. 0
artist.longitude Float The home location's longitude of this artist. 0.0 String The name of the artist. "Casual"
artist.similar Float Unknown. 0.0
artist.terms String The term most associated with this artist. "hip hop"
artist.terms_freq Float The frequency of this term. 1.0 Integer The ID of the release (album) on the service 300848 Integer Unknown value 0
song.artist_mbtags Float Unknown field. 0.0
song.artist_mbtags_count Float Number of tags for the artist on mbtags. 0.0
song.bars_confidence Float Confidence value (between 0 and 1) associated with each bar. 0.643
song.bars_start Float Average start time of each bar, measured in bars. 0.58521
song.beats_confidence Float Average confidence interval of the beats. 0.834
song.beats_start Float Average start time of each beat, measured in beats. 0.58521
song.duration Float Duration of the track in seconds. 218.93179
song.end_of_fade_in Float Time of the end of the fade in, at the beginning of the song. 0.247
song.hotttnesss Float A measure of the song's popularity, when downloaded (in December 2010). Measured on a scale of 0 to 1. 0.60211999 String A uniquely identifying number for the song. "SOMZWCG12A8C13C480"
song.key Float Estimation of the key the song is in. Keys can be from 0 to 11. 1.0
song.key_confidence Float Confidence value (between 0 and 1) of the key estimation. 0.736
song.loudness Float General loudness of the track -11.197
song.mode Integer Estimation of the mode the song. 0
song.mode_confidence Float Confidence value (between 0 and 1) of the mode estimation. 0.636
song.start_of_fade_out Float Start time of the fade out, in seconds, at the end of the song. 218.932
song.tatums_confidence Float Confidence value (between 0 and 1) associated with each tatum. 0.779
song.tatums_start Float Average start time of each tatum, measured in tatums. 0.28519
song.tempo Float Tempo in BPM. 92.198
song.time_signature Float Time signature of the song, i.e. usual number of beats per bar. 4.0
song.time_signature_confidence Float Confidence of the time signature estimation 0.778
song.title Integer Name of the song. 0
song.year Integer Year when this song was released, according to 0