| bryguy ( @ 2008-04-29 11:52:00 |
very large dataset
I've got a problem for work and I thought perhaps one of my readers out there might have some insight.
I need to come up with a very large dataset of publicly available data that is related to some topic high schoolers might find intriguing (as much as that's possible :) ). My first thought was the freedb project which is a free implementation of the cddb database that Gracenote stole from the users a few years back, but I was disappointed to find that the whole dataset clocks in at under 600 megabytes. I need something that's at least a terabyte and ideally much bigger. It has to be free (not just as in cost, but as in freedom i.e. no copyright/trademark restrictions), it can't raise any privacy concerns, and it has to be pg-13.
If you have any suggestions and or pointers I will be in your debt.
I've got a problem for work and I thought perhaps one of my readers out there might have some insight.
I need to come up with a very large dataset of publicly available data that is related to some topic high schoolers might find intriguing (as much as that's possible :) ). My first thought was the freedb project which is a free implementation of the cddb database that Gracenote stole from the users a few years back, but I was disappointed to find that the whole dataset clocks in at under 600 megabytes. I need something that's at least a terabyte and ideally much bigger. It has to be free (not just as in cost, but as in freedom i.e. no copyright/trademark restrictions), it can't raise any privacy concerns, and it has to be pg-13.
If you have any suggestions and or pointers I will be in your debt.