what works · what's broken · what's on the way

Cancer--Estimated New Cases, and Survival Rates: 1987-1989 to 1996-2003 (Statistical Abstract 2008 Table 0171)

Photo of a Chimpanzee

Size: 4.7 KB (approx) Downloaded: 0 times
Available in: csv, yaml, and xls Category: demographics/us

About

The Statistical Abstract of the United States is the standard summary of statistics on the social, political, and economic organization of the United States. It is also designed to serve as a guide to other statistical publications and sources. The latter function is served by the introductory text to each section, the source note appearing below each table, and Appendix I, which comprises the Guide to Sources of Statistics, the Guide to State Statistical Abstracts, and the Guide to Foreign Statistical Abstracts. This volume includes a selection of data from many statistical sources, both government and private. Publications cited as sources usually contain additional statistical detail and more comprehensive discussions of definitions and concepts. Data not available in publications issued by the contributing agency but obtained from the Internet or unpublished records are identified in the source notes. More information on the subjects covered in the tables so noted may generally be obtained from the source.

Although emphasis in the Statistical Abstract is primarily given to national data, many tables present data for regions and individual states and a smaller number for metropolitan areas and cities. Appendix II, Metropolitan and Micropolitan Statistical Areas: Concepts, Components, and Population, presents explanatory text, a complete current listing and population data for metropolitan and micropolitan areas defined as of December 2005. Statistics for the Commonwealth of Puerto Rico and for island areas of the United States are included in many state tables and are supplemented by information in Section 29. Additional information for states, cities, counties, metropolitan areas, and other small units, as well as more historical data are available in various supplements to the Abstract.

Fields

nametypeunitstags

Credits

US Census Bureau source http://www.census.gov/statab/www

U.S. Census Bureau, Statistical Abstract of the United States: 2008 (127th Edition) Washington, DC, 2007; http://www.census.gov/statab/www/

Philip (flip) Kromer converted http://infochimp.org/flip
U.S. National Institutes of Health, National Cancer Institute,

U.S. National Institutes of Health, National Cancer Institute,

referenced on dataset section Data (#1)

Usage Notes

[none]

Rights Info

All US Census Bureau materials, regardless of the media, are entirely in the public domain. There are no user fees, site licenses, or any special agreements etc for the public or private use, and or reuse of any census title. As tax funded product, it’s all in the public record. Some of our products, however, are special cases. [...] The Statistical Abstract has some data covered by copyright law. Check the table’s footnotes to determine if the data are covered by copyright law.

File structure

The Statistical Abstract files are distributed by the census department as excel files. These files have data mixed with notes and references, multiple tables per sheet, and worst of all the table headers aren’t easily matched to their rows and columns. The excel files in this collection are unmolested copies of the census originals, with the following exceptions:

  1. A few files had extraneous characters in the title. These were corrected to be consistent. A few files have a sheet of crufty gibberish in the first slot. The sheet order was shuffled but no data were changed.

    The tables that were changed:

    0166 0257 0362 0429 0445 0446 0459 0461 0462 0464 0465 0466 0467 0469 0479 0480 0481 0482 0483 0484 0485 0486 0487 0559 0628 0629 1144 1227 1231

  1. The first four files have been restructured to allow full comprehension of the table. If you’d like to help clean up the data follow along with what’s there.

The CSV files, and the payload portions of the yaml files, have not been processed beyond extracting an array (excel sheets) of 2-D arrays (each sheet’s cells).

Some metadata (title, footnotes, symbols, and sources) has been copied (without molesting the imported stream) into the appropriate slot in this schema. This metadata identification was purposefully done to be strict and simple, and the original files are somewhat irregular, so it’s possible that some metadata fields were missed

These files have been tagged by hand and received cursory inspection, but you’re advised to check against the originals before you go lauching any Mars rovers.

Footnotes

Notes (pg 2)

  1. Estimates provided by American Cancer Society (www.cancer.org) are based on rates from the National Cancer Institute’s SEER program.
  2. Includes other sites not shown separately.
  3. Survival rates for female only.
  4. All types combined
  5. Invasive cancer only. U.S. National Institutes of Health, National Cancer Institute, For more information: http://www.seer.cancer.gov/

Headnotes

[1,445 represents 1,445,000. The 5-year relative survival rate, which is derived by adjusting the observed survival rate for expected mortality, represents the likelihood that a person will not die from causes directly related to their cancer within 5 years. Survival data shown are based on those patients diagnosed while residents of an area listed below during the time periods shown. Data are based on information collected as part of the National Cancer Institute’s Surveillance, Epidemiology and End Results (SEER) program, a collection of population-based registries in five states (Connecticut, Hawaii, Iowa, New Mexico, Utah) and four metropolitian areas (Atlanta, Detroit, San Francisco-Oakland, and Seattle-Puget Sound)]

Shape

table: [30, 12]

Snippet

CASES,1 2007
Site (1,000) White Black
1987 to 1990 to 1993 to 1996 to 1987 to 1990 to 1993 to 1996 to
Total Male Female 89 92 1995 2003 89 92 1995 2003
All sites 2 1445 767 678 57.7 62.4 63.4 67 43.6 48.2 52.8 57
Lung 213 115 99 13.8 14.5 15.1 15.7 11.2 10.8 13 12.5
Breast 3 181 2 178 85.3 86.7 87.9 90.3 71.2 71.7 72.8 77.9
Colon and rectum 154 79 75 61.1 63.1 61.5 65.9 53.3 53.8 52.9 55.7
12=. ... snip ...
Source: U.S. National Institutes of Health, National Cancer Institute,

Symbols

Notes (pg 2)

  • (X) Not applicable.

Tablenum

0171

Year

2008

History

Uploaded by (admin) Modified by (admin)