Listing 22 datasets tagged with "dictionary"

Wordnet *****

Free

A large lexical database of English | The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

“WordNet® is a large lexical database of English, developed under the direction of George A. Miller. Nouns, verbs, adjectives and adverbs are grouped into sets of cognitive synonyms (synsets), each expressing a distinct concept. Synsets are interlinked by means of conceptual-semantic and lexical …

Linguistics » Word Lists

Moby Project Word Lists | Added by Infochimps

This file consists of the 1,000 most frequently used English words from a wide variety of common texts listed in decreasing order of frequency

Linguistics » Word Lists

Moby Project Word Lists | Added by Infochimps

This file consists of the 1,000 most frequently used English words as used on the Internet computer network in 1992.

Linguistics » Text Corpora

Moby Project Word Lists | Added by Infochimps

This file consists of the 1,000 most frequently used English words from a wide variety of common texts listed in decreasing order of frequency

Linguistics » Word Lists

The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

  1. About

Apertium Breton—French dictionary

  1. Openness
file says material is under GPL.

The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

  1. About

From the creators:

> The purpose of this project is to create a free, open simple dictionary for students to use. The words in the dictionary will reviewed for quality and appropriateness and ultimately “frozen” for export into a variety of formats, including text, PDF, ebooks, wikis, w …


The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

  1. About

From [about page](http://www.dict.cc/?s=about%3A):

> dict.cc is not only an online dictionary. It’s an attempt to create a platform where users from all over the world can share their knowledge in the field of translations. Every visitor can suggest new translations and correct or conf …


The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

  1. Description

A set of dictionary related material (definitions, thesauri, jargon files etc) bundled together with an API.

A full list of the data included can be found on [this page](http://www.dict.org/bin/Dict?Form=Dict1&Query=00-database-info&Strategy=&Database=)

APIs in form of standar …


The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

  1. About

Overview:

> The JMdict (Japanese-Multilingual Dictionary) project has at its aim the compilation of a multilingual lexical database with Japanese as the pivot language. The project began in 1999 as an offshoot of the EDICT Japanese-English Electronic Dictionary project. It involved a m …


eu-terminology **

Free

The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

Excellent tool for following the same terms of EU-speak from language to language.


Wiktionary **

Free

The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

“Welcome to the English-language Wiktionary, a collaborative project to produce a free, multilingual dictionary with definitions, etymologies, pronunciations, sample quotations, synonyms, antonyms and translations. Wiktionary is the lexical companion to the open-content encyclopedia Wikipedia.”


FreeDict **

Free

The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

  1. About

Summary from [SourceForge page](http://sourceforge.net/projects/freedict/):

> Free translating dictionaries. The data is kept as XML complying to the TEI DTD. This enables to include features such as phonetics, part of speech and etymology information in a project independent format. …


The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

Official translations of technical terminology.

From the Office of the French Language.

No obvious way to download the source.


Eurfa **

Free

The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

  1. About

Eurfa consists of an English-Welsh and a Welsh-English dictionary. There are currently around 13,000 words. Authored by Kevin Donnelly, it is currently being used in the development of [Apertium-cy] (http://www.cymraeg.org.uk), a Welsh-English translator.

For further information, see [ …


The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

  1. About

German English dictionary from Frank Richter at [Chemnitz University of Technology](http://www.tu-chemnitz.de/). It has been maintained since 1995 (see the [readme file](http://ftp.tu-chemnitz.de/pub/Local/urz/ding/de-en/Readme)). There are now over 216,000 entries.

  1. Format

Format …


The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

Scans of the first edition of the Oxford English Dictionary along with some software to search those scans. [The post](http://lists.canonical.org/pipermail/kragen-tol/2006-March/000816.html) details work up to volume 6 (as of March 2006) and it is not clear whether any more digitization has been d …


Pete Skomoroch's Bookmarks | Added by Infochimps


The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

  1. About

From the website:

> XDXF is a project to unite all existing open dictionaries and provide both users and developers with universal XML-based format, convertible to and from other popular dictionary formats.

There are currently 308 dictionary files in various languages.

  1. Format

I …


Wikipedia Infoboxes | Added by Infochimps

This dataset consists of a collection of Infoboxes from Wikipedia on the topic of Australian Dictionary Of Biography.