Listing 53 datasets tagged with "language"

Word List - 100,000+ official crossword words (Excel readable) *****

Moby Project Word Lists | Added by Infochimps almost 2 years ago

113,809 official crosswords A list of words permitted in crossword games such as Scrabble™. Compatible with the first edition of the Official Scrabble Players Dictionary™. Since this list has all forms: -ing, -ed, -s, and so on of words, it makes a good addition when building a custom spell …

Linguistics » Word Lists

Word List - 100,000+ official crossword words (with Definitions, Excel format) *****

Moby Project Word Lists | Added by Infochimps almost 2 years ago

113,809 official crosswords A list of words permitted in crossword games such as Scrabble™. Compatible with the first edition of the Official Scrabble Players Dictionary™. Since this list has all forms: -ing, -ed, -s, and so on of words, it makes a good addition when building a custom spell …

Linguistics » Word Lists

Word List - 100,000+ official crossword words (Excel readable) *****

Moby Project Word Lists | Added by Infochimps almost 2 years ago

113,809 official crosswords A list of words permitted in crossword games such as Scrabble™. Compatible with the first edition of the Official Scrabble Players Dictionary™. Since this list has all forms: -ing, -ed, -s, and so on of words, it makes a good addition when building a custom spell …

Linguistics » Word Lists

Word List - 250,000+ Hyphenated, Capitalized and Compound English words *****

Moby Project Word Lists | Added by Infochimps almost 2 years ago

Over 256,700 hyphenated or other entries containing more than one word as well as all capitalized words and acronyms. Phrases were considered ‘common’ if they or variations of them occur in standard dictionaries or thesauruses.

Linguistics » Word Lists

Word List - 350,000+ Simple English Words (with Definitions, Excel format) *****

Moby Project Word Lists | Added by Infochimps almost 2 years ago

Over 354,000 single words, excluding proper names, acronyms, or compound words and phrases. This list does not exclude archaic words or significant variant spellings.

Linguistics » Word Lists

Linguistic Data Consortium (LDC) - Collection of Linguistic Corpora and Datasets *****

Pete Skomoroch's Bookmarks | Added by Infochimps 11 months ago

The Linguistic Data Consortium is an open consortium of universities, companies and government research laboratories. It creates, collects and distributes speech and text databases, lexicons, and other resources for research and development purposes. The University of Pennsylvania is the LDC’s hos …

Linguistics

Word List - 350,000+ Simple English Words (Excel readable) *****

Moby Project Word Lists | Added by Infochimps almost 2 years ago

Over 354,000 single words, excluding proper names, acronyms, or compound words and phrases. This list does not exclude archaic words or significant variant spellings.

Linguistics » Word Lists

Word List - 350,000+ Simple English Words (with Definitions, Excel format) *****

Moby Project Word Lists | Added by Infochimps almost 2 years ago

Over 354,000 single words, excluding proper names, acronyms, or compound words and phrases. This list does not exclude archaic words or significant variant spellings.

Linguistics » Word Lists

Word List - 1,000 Most Frequently Used English Words by Frequency (with Definitions, Excel format) ****

Moby Project Word Lists | Added by Infochimps almost 2 years ago

This file consists of the 1,000 most frequently used English words from a wide variety of common texts listed in decreasing order of frequency

Linguistics » Word Lists

Word List - 1000 Most Frequent Words from an Internet Corpus ***

Moby Project Word Lists | Added by Infochimps almost 2 years ago

This file consists of the 1,000 most frequently used English words as used on the Internet computer network in 1992.

Linguistics » Text Corpora

Word List - 1,000 Most Frequently Used English Words by Frequency (with Definitions, Excel format) ***

Moby Project Word Lists | Added by Infochimps almost 2 years ago

This file consists of the 1,000 most frequently used English words from a wide variety of common texts listed in decreasing order of frequency

Linguistics » Word Lists

Word List - 74,000+ Common English Dictionary Words (with Definitions, Excel format) ***

Moby Project Word Lists | Added by Infochimps almost 2 years ago

74,550 common dictionary words — A list of words in common with two or more published dictionaries. This gives the developer of a custom spelling checker a good beginning pool of relatively common words.

Linguistics » Word Lists

Word List - 10,000+ Common Place Names ***

Moby Project Word Lists | Added by Infochimps almost 2 years ago

10,196 places (places.txt) a large selection of place names in the United States

Geography » Geographical Names

Word List - 74,000+ Common English Dictionary Words (with Definitions, Excel format) ***

Moby Project Word Lists | Added by Infochimps almost 2 years ago

74,550 common dictionary words — A list of words in common with two or more published dictionaries. This gives the developer of a custom spelling checker a good beginning pool of relatively common words.

Linguistics » Word Lists

Children Who Speak a Language Other Than English at Home: 2000 to 2004 **

Table 223 of the 2008 US Statistical Abstract | Statistical Abstract of the United States | Added by Infochimps

The Statistical Abstract files are distributed by the US Census Department as Microsoft Excel files. These files have data mixed with notes and references, multiple tables per sheet, and, worst of all, the table headers are not easily matched to …


Language Spoken at Home--Cities of 100,000 or More: 2005 **

Table 54 of the 2008 US Statistical Abstract | Statistical Abstract of the United States | Added by Infochimps

The Statistical Abstract files are distributed by the US Census Department as Microsoft Excel files. These files have data mixed with notes and references, multiple tables per sheet, and, worst of all, the table headers are not easily matched to …


Languages Spoken at Home by Language: 2005 **

Table 52 of the 2008 US Statistical Abstract | Statistical Abstract of the United States | Added by Infochimps

The Statistical Abstract files are distributed by the US Census Department as Microsoft Excel files. These files have data mixed with notes and references, multiple tables per sheet, and, worst of all, the table headers are not easily matched to …


Foreign Language Enrollments in Public High Schools by Type of Language: 1970 to 2000 **

Table 262 of the 2008 US Statistical Abstract | Statistical Abstract of the United States | Added by Infochimps

The Statistical Abstract files are distributed by the US Census Department as Microsoft Excel files. These files have data mixed with notes and references, multiple tables per sheet, and, worst of all, the table headers are not easily matched to …

Social Sciences » Education

Language Spoken at Home by State: 2005 **

Table 53 of the 2008 US Statistical Abstract | Statistical Abstract of the United States | Added by Infochimps

The Statistical Abstract files are distributed by the US Census Department as Microsoft Excel files. These files have data mixed with notes and references, multiple tables per sheet, and, worst of all, the table headers are not easily matched to …


TalkBank **

The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps 10 months ago

  1. About

About TalkBank:

> The goal of TalkBank is to foster fundamental research in the study of human and animal communication. It will construct sample databases within each of the subfields studying communication. It will use these databases to advance the development of standards and tools …