Freebase Data Dump

A data dump of all the current facts and assertions in the Freebase system.

Freebase is an open database of the worlds information, covering millions of topics in hundreds of categories. Drawing from large open data sets like Wikipedia, MusicBrainz, and the SEC archives, it contains structured information on many popular topics, including movies, music, people and locations – all reconciled and freely available. This information is supplemented by the efforts of a passionate global community of users who are working together to add structured information on everything from philosophy to European railway stations to the chemical properties of common food ingredients. For more answers check the Freebase FAQ.

Freebase provides full data dumps of all the current facts and assertions in their system. Freebase data dumps are complete, general-purpose extracts of the Freebase data in a variety of formats. Freebase releases a fresh data dump every three months.

Two formats are currently available:

  • TSV A tab-separated file for each type in Freebase, suitable for loading into spreadsheets or database software. Each line in these files represents an instance of a Freebase type, the columns represent the available properties for the type. You may download the full set, or browse Freebase domains and types to find specific data sets.
  • The January 2009 full download is approximately 492 Mbytes compressed in the Bzip2 format.
  • The January 2009 browseable set contains 3319 TSV files in 76 domains.
  1. Link Export A full dump of Freebase assertions in a simple utf8 text format. This is a complete “low level” dump of data which is suitable for post processing into RDF or XML datasets. The format of the link export is a series of lines, one assertion per line. The lines are tab separated quadruples, , , , An assertion is a statement of fact about the object. In any assertion, either the or or both and are present.
  • A sample of this output is available.
  • The January 2009 Link Export is approximately 1411 Mbytes compressed in the Bzip2 format (updated January 13, 2009).
Visit source

This data is not yet in the Infochimps repository — please continue your journey offsite to get it from its primary source.


Tags:
Price: Free
Categories:
Collection:
Sources:
License:
Added about 1 year ago by Infochimps
External http://www.freebase.com

Edit Dataset