- This dataset is about the people who use energy in Spain, whose energy production and grid was sampled for this dataset. There is no identifiable information on the individual level, meaning that there is little personal risk with this dataset.
- The energy usage and sources of energy production of the people of Spain are what is being recorded by the data set. This addresses all of our questions in Assignment 1.
- This data was collected between 2015 – 2019, meaning that the data is recent and therefore relevant. It is cross-sectional since the data was collected from several cities in Spain. This dataset contains real-time data.
- The data is collected from the five largest cities in Spain: Madrid, Barcelona, Valencia, Seville, and Bilbao. It could possibly be extrapolated that the energy usage would be similar in the surrounding European countries with similar populations and weather as these five cities, and it is certainly possible that larger generalizations about predicting energy usage could be used for non-European locations.
- The data was collected by ENTSOE, a public portal for Transmission Service Operator (TSO) data and is publicly available.
This work was partially sponsored by NSF.
NSF IUSE #1935076
CUE Ethics: Collaborative Research: Open Collaborative Experiential Learning (OCEL.AI): Bridging Digital Divides in Undergraduate Education of Data Science
01/01/2020 – 6/30/2021, $ 350,000
Copyright © 2020 OCEL.AI.