All Datasets

Check out our new MySpace data collection.


Infochimps makes it easy to share or sell data online

Add a Dataset
or Request Data if you can't find it!
order by: score | title | recently edited

Word List - 100,000+ official crossword words (Excel readable) *****

Moby Project Word Lists | Added by Infochimps almost 2 years ago

113,809 official crosswords A list of words permitted in crossword games such as Scrabble™. Compatible with the first edition of the Official Scrabble Players Dictionary™. Since this list has all forms: -ing, -ed, -s, and so on of words, it makes a good addition when building a custom spell …

Linguistics » Word Lists

Measuring Worth: Interest Rates - US & UK 1790-2007 *****

Twelve interest rate series for the United Kingdom and the United States | Added by mrflip 5 months ago

What Was the Interest Rate?
U.K. U.S.
Short-Term: Ordinary Funds, Contemporary Series
Short-Term: Ordinary Funds, Consistent Series
Short-Term: Surplus Funds, Contemporary Series
Short-Term: Surplus Funds, Consistent Series
Long-Term: Contemporary Series
Long-Term: Consisten …

Economics | Economics » Finance

Retrosheet: Game Logs (play-by-play) for Major League Baseball Games *****

A record of major league games played from 1871-2008 | Added by Infochimps 8 months ago

The game logs contain a record of major league games played from 1871-2008. At a minimum, it provides a listing of the date and score of each game. Where our research is more complete, we include information such as team statistics, winning and losing pitchers, linescores, attendance, starting pit …

Sports » Baseball

Twitter Census - Conversation Metrics: One year of URLs, Hashtags, Smileys usage (Smiley Counts) *****

Occurrence counts of tweet tokens: hashtags, URLs, & smileys by hour or month | Twitter Census | Added by MonkeywrenchConsultancy 4 months ago

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 35 million users, 500 million tweets, and 1 billion relationships between users.

This dataset is a corpus of tokens collected from tweets sent between March 2006 a …

Computers » Social Networks

Texas Assessment of Knowedge and Skills (TAKS Exams) 2003-2007 (2003-2007 Test Data) *****

Texas Assessment of Knowledge and Skills (TAKS) Scores by Year, Student, Grade and School | Added by mrflip 6 months ago

Texas Assessment of Knowledge and Skills (TAKS) Scores by Year, Student, Grade and School.

Student IDs are anonymized, but consistent from year to year. Some of the data is scrubbed to comply with NEPA privacy regulations.

Otherwise, this offers the answer to every question on the Texas stat …

Social Sciences » Education

MySpace User Activity Stream: User count by lat/long *****

MySpace Real-Time Stream | Added by MonkeywrenchConsultancy 5 days ago

This data is derived from the MySpace real-time stream API. It contains all users in our dataset, around 11 million, with well-formed latitude/longitude.

Computers » Social Networks | Geography

MySpace User Activity Stream: User count by zip code *****

MySpace Real-Time Stream | Added by MonkeywrenchConsultancy 5 days ago

This data is derived from the MySpace real-time stream API. It contains all users in our dataset, around 11 million, with well-formed zip codes.

Computers » Social Networks | Geography

MySpace User Activity Stream: Word count by hour from December 2009-March 2010 *****

MySpace Real-Time Stream | Added by MonkeywrenchConsultancy 5 days ago

This data is derived from the MySpace real-time stream API. The word count is from the free-form text fields MySpace moods, forum topic titles, replies to forum topics, text from sharing a link or item, and status mood updates. For the last three months the words from these fields has been extra …

Computers » Social Networks | Linguistics

MySpace User Activity Stream: Word count by day from December 2009-March 2010 *****

MySpace Real-Time Stream | Added by MonkeywrenchConsultancy 5 days ago

This data is derived from the MySpace real-time stream API. The word count is from the free-form text fields MySpace moods, forum topic titles, replies to forum topics, text from sharing a link or item, and status mood updates. For the last three months the words from these fields has been extra …

Computers » Social Networks | Linguistics

Word List - 100,000+ official crossword words (Excel readable) *****

Moby Project Word Lists | Added by Infochimps almost 2 years ago

113,809 official crosswords A list of words permitted in crossword games such as Scrabble™. Compatible with the first edition of the Official Scrabble Players Dictionary™. Since this list has all forms: -ing, -ed, -s, and so on of words, it makes a good addition when building a custom spell …

Linguistics » Word Lists

Word List - 100,000+ official crossword words (with Definitions, Excel format) *****

Moby Project Word Lists | Added by Infochimps almost 2 years ago

113,809 official crosswords A list of words permitted in crossword games such as Scrabble™. Compatible with the first edition of the Official Scrabble Players Dictionary™. Since this list has all forms: -ing, -ed, -s, and so on of words, it makes a good addition when building a custom spell …

Linguistics » Word Lists

Word List - 250,000+ Hyphenated, Capitalized and Compound English words *****

Moby Project Word Lists | Added by Infochimps almost 2 years ago

Over 256,700 hyphenated or other entries containing more than one word as well as all capitalized words and acronyms. Phrases were considered ‘common’ if they or variations of them occur in standard dictionaries or thesauruses.

Linguistics » Word Lists

MySpace User Activity Stream: Cumulative word count from from Dec 2009 to March 2010 *****

MySpace Real-Time Stream | Added by MonkeywrenchConsultancy 5 days ago

This data is derived from the MySpace real-time stream API. The word count is from the free-form text fields MySpace moods, forum topic titles, replies to forum topics, text from sharing a link or item, and status mood updates. For the last three months the words from these fields has been extra …

Computers » Social Networks | Linguistics

Word List - 350,000+ Simple English Words (with Definitions, Excel format) *****

Moby Project Word Lists | Added by Infochimps almost 2 years ago

Over 354,000 single words, excluding proper names, acronyms, or compound words and phrases. This list does not exclude archaic words or significant variant spellings.

Linguistics » Word Lists

Linguistic Data Consortium (LDC) - Collection of Linguistic Corpora and Datasets *****

Pete Skomoroch's Bookmarks | Added by Infochimps 11 months ago

The Linguistic Data Consortium is an open consortium of universities, companies and government research laboratories. It creates, collects and distributes speech and text databases, lexicons, and other resources for research and development purposes. The University of Pennsylvania is the LDC’s hos …

Linguistics

Freebase Data Dump *****

Added by Infochimps about 1 year ago

A data dump of all the current facts and assertions in the Freebase system.

Freebase is an open database of the worlds information, covering millions of topics in hundreds of categories. Drawing from large open data sets like Wikipedia, MusicBrainz, and the SEC archi …

Encyclopedic » Encyclopedias