Tag Archives: Sample Data

Public Data for Practice

Published / by shep2010

It is no secret that i will frequently use my blogs as a resource for me to collect and store my own thoughts and to remember where i put things, and i kinda figure if i need to learn something and write it down, others can probably benefit from it.  This particular post will be a living post as i am always finding new public data sources i need to remember.   Some off my links will be duplicated in other links. Some of these will be required for some future blogs of R training scripts.

The Equality of Opportunity Project

Science and Engineering Doctorates

United States Education Data (Maintained on the USDA site.. )
https://data.ers.usda.gov/reports.aspx?ID=18243

Social Security Data Files by Title
https://www.ssa.gov/policy/data_title.html

Florida Data
http://www.floridacharts.com/FLQuery/Population/PopulationRpt.aspx

Florida Election Watch
http://enight.elections.myflorida.com/

US Bureau of Labor and Statistics
https://www.bls.gov/data/

Google Public Data
http://www.google.com/publicdata/

The New York Independent System Operator (power grid) http://mis.nyiso.com/public/

Generically Awesome Public Datasets
https://github.com/caesar0301/awesome-public-datasets

Amazon Public Data Sets
http://aws.amazon.com/datasets/

Check out the Data Section
https://trello.com/b/rbpEfMld/data-science

Kaggle Datasets
https://www.kaggle.com/datasets

UCI Machine Learning Repository
http://archive.ics.uci.edu/ml/

Yahoo Datasets
http://webscope.sandbox.yahoo.com/#datasets

New York Public Library
http://www.nypl.org/research/collections/digital-collections/public-domain