What are the best datasets for building a data visualisation portfolio?
Datasets for data visualization examples, teaching and research - Cross Validated
Where to find data for data visualization?
Suggest some dataset for data analysis and visualization(beginner)
Can I combine multiple datasets for analysis or visualization?
What is a dataset in data visualization?
What is an example of a dataset?
Videos
Hi,
I have recently developed an interest in data visualisation and EDA. I am currently trying to figure out what are some great datasets that I can put on my portfolio.
There are large number of databases available on internet. Depending on the subject, you can get different sources.
For example, in Human Development subject area you can have data sources at (http://hdrstats.undp.org/):
http://hdrstats.undp.org/en/tables/default.html
For Climate change observation, there is a web with high resolution climate data at (http://www.ipcc-data.org/), for example:
http://www.ipcc-data.org/obs/cru_ts2_1.html
Both examples, contains real data, used in published scientific papers, with large quantity of data. Time related and/or space related data. Visualization possibilities of those data are endless.
I like to use the Anscombe data sets (also available in R) to show the importance of plotting when doing regressions. If you aren't familiar, you get the same regression line and diagnostics from all four data sets, even though the sets themselves all look quite different. You can take the plots below and turn them into residual plots to illustrate problems that you might look for in the residuals after performing a regression.
