--- - infochimps_schema: # # Fundamental description # # Name for This Dataset (free text) - in Titlecase name: "Population and Area: 1790 to 2000 (Statistical Abstract 2008 Table 0001)" # A unique_name_in_identifier_form uniqid: statab2008_0001_populationandarea1790to2000 # freeform tags describing the real-world concepts, space separated (put phrases in ""s) # Think: "If I were searching for this, what might I type in?" tags: 'population "population density" area growth' # Name the larger collection this dataset belongs to. # (Name it the same as the dataset if it's a collection of one) collection: Statistical Abstract of the United States 2008 # collection_unique_name_in_identifier_form coll_uniqid: statisticalabstract_2008 # tags for the collection itself coll_tags: "us united states usa america census demographics government " # List each format this dataset is available in. The hashes will be filled # in automagically; just put a line like the ones below for each format. Use # 'flat' for fixed-width text files, and generally the extension for most others. formats: csv: {} yaml: {} xls: {} # # Contributors # # People/organizations who created or prepared the dataset. Include links # and citations wherever possible. This gives credit where it's deserved, # and allows people to trace the provenance of the data. # contributors: - name: US Census Bureau # We should probably be using the wikipedia citation structure. # Textile's fine too, as shown here. cite: |- U.S. Census Bureau, Statistical Abstract of the United States: 2008 (127th Edition) Washington, DC, 2007; "http://www.census.gov/statab/www/":http://www.census.gov/statab/www # the page where you got this. Link to an HTML url: http://www.census.gov/statab/www role: source desc: "" uniqid: gov.census/statab - name: Philip (flip) Kromer url: http://infochimp.org/flip role: converted uniqid: org.infochimp/flip # # Free-form descriptive notes # # The identifier will be titleized (_ to spaces, words Capitalized) and # appear as a header (a
). # # An anchor link is created for each note, so if you'd like to link back # to a different note field use # "link text"#note_identifier # for example "(usage overview)":#usage # notes: # Free-form (textile) description of the dataset. desc: |- For every US census since 1790, the total US population, its population density (population per mile) and their absolute and percentage change. # Describe the collection as a whole collection_desc: |- The Statistical Abstract of the United States is the standard summary of statistics on the social, political, and economic organization of the United States. It is also designed to serve as a guide to other statistical publications and sources. The latter function is served by the introductory text to each section, the source note appearing below each table, and Appendix I, which comprises the Guide to Sources of Statistics, the Guide to State Statistical Abstracts, and the Guide to Foreign Statistical Abstracts. This volume includes a selection of data from many statistical sources, both government and private. Publications cited as sources usually contain additional statistical detail and more comprehensive discussions of definitions and concepts. Data not available in publications issued by the contributing agency but obtained from the Internet or unpublished records are identified in the source notes. More information on the subjects covered in the tables so noted may generally be obtained from the source. Although emphasis in the Statistical Abstract is primarily given to national data, many tables present data for regions and individual states and a smaller number for metropolitan areas and cities. Appendix II, Metropolitan and Micropolitan Statistical Areas: Concepts, Components, and Population, presents explanatory text, a complete current listing and population data for metropolitan and micropolitan areas defined as of December 2005. Statistics for the Commonwealth of Puerto Rico and for island areas of the United States are included in many state tables and are supplemented by information in Section 29. Additional information for states, cities, counties, metropolitan areas, and other small units, as well as more historical data are available in various supplements to the Abstract. # # Any technical stuff you'd need to know. If this is long, break it up # into an overview (here) and other free-form notes; link to each with # "link text":#anchor_tag # usage: | The Statistical Abstract files are distributed by the census department as excel files. These files have data mixed with notes and references, multiple tables per sheet, and worst of all the table headers aren't easily matched to their rows and columns. So these will be difficult to parse in bulk. # # Please be careful to include the exact text of any license or # request for restrictions accompanying this dataset. If they ask that it # be included as a file please also inject that into the payload. # # Here's some info about copyright and collections of facts: # http://blog.infochimps.org/2008/04/02/good-neighbors-and-open-grazing/ rights: |- All US Census Bureau materials, regardless of the media, are entirely in the public domain. There are no user fees, site licenses, or any special agreements etc for the public or private use, and or reuse of any census title. As tax funded product, it's all in the public record. Some of our products, however, are special cases. [...] The Statistical Abstract has some data covered by copyright law. Check the table's footnotes to determine if the data are covered by copyright law. # Eventually we have to track dimensionality, but for now # fill in a 'table: [rows, columns]' note if you feel like it. shape: "table: [23, 13]" # # Don't fill this unless it's free to do so; this will eventually be # autogenerated. # snippet: | | Census Date | Census data notes | Resident population - Number | Resident population - Number (notes) | Resident population - Per square mile of land area | Resident population - Increase over preceding census - Number | Resident population - Increase over preceding census - Percent | (Resident Population Increase - notes) | Area - Total (square miles) | (Area - Total notes) | Area - Land | (Area - Land notes) | Area - Water | | 1790-08-02 | | 3929214 | | 4.5 | (X) | (X) | | 891364 | | 864746 | | 24065 | | 1800-08-04 | | 5308483 | | 6.1 | 1379269 | 35.1 | | 891364 | | 864746 | | 24065 | | 1810-08-06 | | 7239881 | | 4.3 | 1931398 | 36.4 | | 1722685 | | 1681828 | | 34175 | | 1820-08-07 | | 9638453 | | 5.5 | 2398572 | 33.1 | | 1792552 | | 1749462 | | 38544 | | 1830-06-01 | | 12866020 | | 7.4 | 3227567 | 33.5 | | 1792552 | | 1749462 | | 38544 | | 1840-06-01 | | 17069453 | | 9.8 | 4203433 | 32.7 | | 1792552 | | 1749462 | | 38544 | | 1850-06-01 | | 23191876 | | 7.9 | 6122423 | 35.9 | | 2991655 | | 2940042 | | 52705 | | 1860-06-01 | | 31443321 | | 10.6 | 8251445 | 35.6 | | 3021295 | | 2969640 | | 52747 | | 1870-06-01 | \2 | 39818449 | \2 | 11.2 | 8375128 | 26.6 | | 3612299 | | 3540705 | | 68082 | | 1880-06-01 | | 50189209 | | 14.2 | 10370760 | 26 | | 3612299 | | 3540705 | | 68082 | | 1890-06-01 | | 62979766 | | 17.8 | 12790557 | 25.5 | | 3612299 | | 3540705 | | 68082 | |\13=. ... __snip__ ... | | 1990-04-01 | \4 | 248718302 | | 70.3333567101 | 22176103 | 9.78895018142 | \5 | 3717796 | | 3536278 | \5 | 181518 | | 2000-04-01 | \6 | 281424603 | | 79.5560425357 | 32706301 | 13.1499373938 | | 3794083.06 | | 3537438.44 | | 256644.62 | # # You can enter any note you like: footnotes, in this example. # footnotes_and_symbol_explanations: | h2. Notes (pg 2) # Data for 1790 to 1980 cover inland water only. Data for 1990 comprise Great Lakes, inland, and coastal water. Data for 2000 comprise Great Lakes, inland, territorial, and coastal water. # Revised to include adjustments for underenumeration in southern states; unrevised number is 38,558,371 (10.9 per square mile). # Total population count has been revised since the 1980 census publications. Numbers by age, race, Hispanic origin, and sex have not been corrected. # The April 1, 1990, census count includes count question resolution corrections processed through December 1997, and does not include adjustments for census coverage errors. # Data reflect corrections made after publication of the results. # Reflects modifications to the Census 2000 population as documented in the Count Question Resolution program. # The notes can be big or small, no matter: statistical_abstract_table_number: 1 # # Field listing # fields: - # Name for this field (free text) - in Titlecase name: Census Date # Identifier for this field. uniqid: census_date # The real-world concept this field embodies: tags: date # The units of this field's real-world concept; Remember, the tags # describe the *concept*, the units describe its *representation* enter # any "Frink-understandable units":http://futureboy.homeip.net/frinkdata/units.txt units: date.iso # And this says how it will be represented in the computer # Any of the Kwalify simple datatypes: http://kuwata-lab.com/kwalify/ruby/users-guide.html # (str int float number text bool date time timestamp scalar seq map any) datatype: date # if it's not really clear what to do, just make up something sensible, # as here with the datatype "_note" - name: Census data notes tags: _note units: _note datatype: str uniqid: census_data_notes - name: Resident population - Number tags: country persons units: persons datatype: int uniqid: resident_population_number # # Simple equations are fine. Use anything "Frink digs":http://futureboy.homeip.net/frinkdata/units.txt - name: Resident population - Per square mile of land area tags: country numberdensity:persons-area units: persons / mile^2 datatype: float uniqid: resident_population_per_square_mile_of_land_area - name: Resident population - Increase over preceding census - Number tags: country rate:persons-time units: persons / year datatype: int uniqid: resident_population_increase_over_preceding_census_number # # If appropriate, describe a percent change as (unit/unit)% -- this # divides out in the mathematical sense but retains the information about # what's being examined. - name: Resident population - Increase over preceding census - Percent tags: country rate:pct_persons-time units: (persons/persons)% / year datatype: float:2.1 uniqid: resident_population_increase_over_preceding_census_percent - name: Area - Land tags: country area units: mile^2 datatype: int uniqid: area_land - name: Area - Water tags: country area units: mile^2 datatype: int uniqid: area_water # # We don't use these yet, but we will eventually. # See the HOWTO # http://help.infochimps.org/help/show/HOWTO+Schema # for criteria ratings: interesting: rating: 3 by: initial story: "" authoritative: rating: 3 by: initial story: "" comprehensive: rating: 3 by: initial story: "" accurate: rating: 3 by: initial story: These files have not been checked for conversion errors.