Datasets Categories Tags
order by: score (current) | title
interestingness:
100

Freebase Data Dump

... || category: EncyclopedicEncyclopedias || downloadable as: tsv, rdf

A data dump of all the current facts and assertions in the Freebase system. Freebase is an open database of the worlds information, covering millions of topics in hundreds of categories. Drawing from large open data sets like Wikipedia, MusicBrainz, and the SEC archives, it contains structured information on many po …more

interestingness:
99

Freebase.com Wikipedia Extraction (WEX)

... || category: EncyclopedicEncyclopedias || downloadable as: wex

The Freebase Wikipedia Extraction (WEX) is a processed dump of the English language Wikipedia. The wiki markup for each article is transformed into machine-readable XML, and common relational features such as templates, infoboxes, categories, article sections, and redirects are extracted intabular form. Freebase WEX is provided as a set of …more

interestingness:
92

Daily 1970-Current Open, Close, Hi, Low and Volume (NYSE exchange)

... || category: EconomicsFinance || downloadable as: csv, yaml, csv, yaml

Daily Open, Close, Low, High and Volume.

interestingness:
90

Daily 1970-Current Open, Close, Hi, Low and Volume (NASDAQ exchange)

... || category: EconomicsFinance || downloadable as: csv, yaml, csv, yaml

Daily Open, Close, Low, High and Volume.

interestingness:
89

Daily 1970-Current Open, Close, Hi, Low and Volume (AMEX exchange)

... || category: EconomicsFinance || downloadable as: csv, yaml, csv, yaml

Daily Open, Close, Low, High and Volume.

interestingness:
87

MusicBrainz

... || category: Arts and CultureMusic || downloadable as:
  1. Description

The MusicBrainz database stores all the data of the MusicBrainz music metadata catalogue. This data includes all the data about Artists, Releases, Tracks, AdvancedRelationships between them, but also the MusicBrainz users (editors) and the changes they entered into the database (edits).

  1. Format

Postgres database dump (us …more

interestingness:
86

DBPedia Main

... || category: EncyclopedicEncyclopedias || downloadable as: rdf

DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently describes more than 2.6 million things, including at least 213,000 persons, 328,000 places, 57,000 music albums, 36,000 films, 20,000 companies. The knowledge base consists of 2 …more

interestingness:
83

Wordnet

... || category: LinguisticsWord Lists || downloadable as:

“WordNet® is a large lexical database of English, developed under the direction of George A. Miller. Nouns, verbs, adjectives and adverbs are grouped into sets of cognitive synonyms (synsets), each expressing a distinct concept. Synsets are interlinked by means of conceptual-semantic and lexical relations. The resulting network of meaningful [”…more":/datasets/wordnet]

interestingness:
82

Word List - 100,000+ official crossword words

... || category: LinguisticsWord Lists || downloadable as: flat, flat

113,809 official crosswords A list of words permitted in crossword games such as Scrabble™. Compatible with the first edition of the Official Scrabble Players Dictionary™. Since this list has all forms: -ing, -ed, -s, and so on of words, it makes a good addition when building a custom spelling dictionary.

interestingness:
82

Word List - 250,000+ Hyphenated, Capitalized and Compound English words

... || category: LinguisticsWord Lists || downloadable as: flat, flat

Over 256,700 hyphenated or other entries containing more than one word as well as all capitalized words and acronyms. Phrases were considered ‘common’ if they or variations of them occur in standard dictionaries or thesauruses.

interestingness:
78

The Whitburn Project: 120 Years of Music Chart History

For the last ten years, obsessive record collectors in Usenet have been working on the Whitburn Project — a huge undertaking to preserve and share high-quality recordings of every popular song since the 1890s. To assist their efforts, they’ve created a spreadsheet of 37,000 songs and 112 columns of raw data, including each song’s duration, …more

interestingness:
76

Word List - 350,000+ Simple English Words

... || category: LinguisticsWord Lists || downloadable as: flat, flat

Over 354,000 single words, excluding proper names, acronyms, or compound words and phrases. This list does not exclude archaic words or significant variant spellings.

interestingness:
60

UK National Health Service Choices web services

  1. About

From website:

> Web services support business-to-business syndication of content over the internet. NHS Choices has created a set of web services to allow approved partners to interact with the service, free of charge. The web services return NHS Choices content in a form that can be easily integrated into a website or application. …more

New dataset