From the CORGIS Dataset Project
By Austin Cory Bart acbart@vt.edu
Version 2.0.0, created 4/2/2016
Tags: classics, books, texts, text, book, classic, english, shakespeare, literature, novel, language, composition, writing, author, publication, words
Project Gutenberg (PG) is a volunteer effort to digitize and archive cultural works, to ‘encourage the creation and distribution of eBooks’. It was founded in 1971 by Michael S. Hart and is the oldest digital library. This dataset is a collection of the top 1000 most popular books on Project Gutenberg, as determined by downloads. Each book has information about its authorship, publication date, congressional classication, and a few other fields. It also has some simple, computed statistics based on common metrics such as sentiment analysis, Flesch Kincaid Reading level, and average sentence length.
https://www.gutenberg.org/ebooks/search/?sort_order=downloads