Listing 7 datasets tagged with "bigdata"

Occurrence counts of tweet tokens: hashtags, URLs, & smileys by hour or month | Twitter Census | Added by Infochimps

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 35 million users, 500 million tweets, and 1 billion relationships between users.

This dataset is a corpus of tokens collected from tweets sent between March 2006 a …

Computers » Social Networks | Social Sciences » Communications | Social Sciences » Sociology | History » Modern History

Freebase Data Dump *****

Free

Added by Infochimps

A data dump of all the current facts and assertions in the Freebase system.

Freebase is an open database of the worlds information, covering millions of topics in hundreds of categories. Drawing from large open data sets like Wikipedia, MusicBrainz, and the SEC archi …

Encyclopedic » Encyclopedias

Added by Infochimps

The Freebase Wikipedia Extraction (WEX) is a processed dump of the English language Wikipedia. The wiki markup for each article is transformed into machine-readable XML, and common relational features such as templates, infoboxes, categories, article sections, and redirects are extracted intabul …

Encyclopedic » Encyclopedias

DBPedia Main *****

Free

Added by Infochimps

DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently describes more than 2.6 million things, including at least 213,000 persons, 328,000 places, 57,000 music albums, 36,000 films, 20,0 …

Encyclopedic » Encyclopedias

Federal Climate Complex GSOD (Global Surface Summary of Day) version 7 | Added by Infochimps

The GSOD (Global Daily) Data

The GSOD dataset is from National Climate Data Center, and downloadable at ftp://ftp.ncdc.noaa.gov/pub/data/gsod/

You can fetch your own copy with

wget -r -l3 —no-clobber —no-parent —no-verbos …
Science » Meteorology

Occurrence counts of tweet tokens: hashtags, URLs, & smileys by hour or month | Twitter Census | Added by Infochimps

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 35 million users, 500 million tweets, and 1 billion relationships between users.

This dataset is a corpus of tokens collected from tweets sent between March 2006 a …

Computers » Social Networks | Social Sciences » Communications | Social Sciences » Sociology | History » Modern History

Occurrence counts of tweet tokens: hashtags, URLs, & smileys by hour or month | Twitter Census | Added by Infochimps

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 35 million users, 500 million tweets, and 1 billion relationships between users.

This dataset is a corpus of tokens collected from tweets sent between March 2006 a …

Computers » Social Networks

Federal Climate Complex GSOD (Global Surface Summary of Day) version 7 | Added by Infochimps

About

This is an extract from the “Global Daily Weather Data from the National Climate Data Center (NCDC)” dataset for just austin.

Graphs

!http://infochimps.org/static/ga …

Science » Meteorology

The Comprehensive Knowledge Archive Network (CKAN) Collection | Added by Infochimps

  1. About

> One web page for every book ever published. It’s a lofty, but achievable, goal.

> To build it, we need hundreds of millions of book records, a brand new database infrastructure for handling huge amounts of dynamic information, a wiki interface, multi-language support, and people w …


User Counts for User Profile Background Colors | Twitter Census | Added by Infochimps

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users.

This dataset is a list of user profile background color counts collected …

Computers » Social Networks | Social Sciences » Communications | Social Sciences » Sociology | History » Modern History

User Counts by the Number of Friends a User is Following | Twitter Census | Added by Infochimps

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users.

This dataset is a list of user counts for the number of friends followed co …

Computers » Social Networks | Social Sciences » Communications | Social Sciences » Sociology | History » Modern History

User Counts by the Number of Other Users (Followers) Following a User | Twitter Census | Added by Infochimps

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users.

This dataset is a list of user counts for the number of followers collected …

Computers » Social Networks | Social Sciences » Communications | Social Sciences » Sociology | History » Modern History

Number of User Accounts Created by Month | Twitter Census | Added by Infochimps

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users.

This dataset is a list of the number of user counts by the month in which t …

Computers » Social Networks | Social Sciences » Communications | Social Sciences » Sociology | History » Modern History

Number of User Accounts Created by Day | Twitter Census | Added by Infochimps

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users.

This dataset is a list of the number of user counts by the day on which the …

Computers » Social Networks | Social Sciences » Communications | Social Sciences » Sociology | History » Modern History

Number of User Accounts Created by Hour | Twitter Census | Added by Infochimps

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users.

This dataset is a list of the number of user counts by the hour in which th …

Computers » Social Networks | Social Sciences » Communications | Social Sciences » Sociology | History » Modern History

Number of Tweets Created by Month | Twitter Census | Added by Infochimps

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users.

This dataset is a list of the number of tweet counts by the month in which …

Computers » Social Networks | Social Sciences » Communications | Social Sciences » Sociology | History » Modern History

Number of Tweets Created by Day | Twitter Census | Added by Infochimps

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users.

This dataset is a list of the number of tweet counts by the day on which th …

Computers » Social Networks | Social Sciences » Communications | Social Sciences » Sociology | History » Modern History

User Count by Location Given in User Profile | Twitter Census | Added by Infochimps

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users.

This dataset is a list of user locations collected from user profiles betwe …

Computers » Social Networks | Social Sciences » Communications | Social Sciences » Sociology | History » Modern History

Occurrence counts of tweet tokens: hashtags, URLs, & smileys by month | Twitter Census | Added by Infochimps

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users.

This dataset is a corpus of tokens collected from tweets sent between March …

Computers » Social Networks | Social Sciences » Communications | Social Sciences » Sociology | History » Modern History

List of emoticons (smileys) extracted from tweets | Twitter Census | Added by mrflip

This data comes from a scrape of the Twitter social network conducted by the Monkeywrench Consultancy. The full scrape consists of 40 million users, 1.6 billion tweets, and more than 1 billion relationships between users.

This dataset is a list of emoticons (smiley faces) collected from tweets …

Computers » Social Networks | Social Sciences » Communications | Social Sciences » Sociology | History » Modern History