I am looking for some large public datasets, in particular:
Large sample web server logs that have been anonymized.
Datasets used for database performance benchmarking.
Any other links to large public datasets would be appreciated. I already know about Amazon's public datasets at: http://aws.amazon.com/publicdatasets/
-
Well for the web server logs you could always just generate them for the format you need. If you are going to test code against it etc. it will have to be tailored to the fields you want to store/parse.
For the datasets used for database performance benchmarking, you'll probably want to look at a tool that can generate data for you. Red Gate has a great one for not too much money.
-
Just a thought:
- USGS Geographic Names database
- USDA PLANTS checklist
- Any one of the many state GIS repositories e.g. NH's GRANIT
-
Google Fusion Tables has a few.
-
Find some Public Open Datasets at Edigitalz
0 comments:
Post a Comment