Throughout the years I've had to create different datasets for different things. A lot of them are pretty boring,
but here's a few that are cool.
Description: Datasets covering the USDA Pomological Watercolor Collection:
The USDA Pomological Watercolor Collection documents fruit and nut varieties developed by growers or introduced by USDA plant explorers around the turn of the 20th century. Technically accurate paintings were used to create lithographs illustrating USDA bulletins, yearbooks, and other series distributed to growers and gardeners across AmericaIncludes both a
.csv
file containing metadata for each painting as well as a directory of images for all 7,584 paintings.
Link: https://github.com/jwilber/USDA_Pomological_Watercolors
Description: Covers of all National Geographic Magazines, (1960-2018).
Description: All Bob Ross paintings (and a few from his son) featured in the TV Show 'The Joy of Painting'.
Includes both a .csv
file containing metadata for each painting as well as a directory of images for all 411 paintings.
Description: Dataset covering the music used in skateboarding videos from 1989 to 2018. Data scraped from skatevideosite.com.
Link: https://github.com/the-pudding/data/tree/master/skate-music