Data
Each year for Viz Week there are a variety of challenges posted to see which groups around the world can most effectively use visualization to solve problems. We are going to look at the 120 MB dataset from the 2011 mini challenge 1 - Geospatial and Microblogging - Characterization of an Epidemic Spread, which is mostly time-stamped and location-stamped text messages from a one month period.
Data can be downloaded from http://hcil.cs.umd.edu/localphp/hcil/vast11/
Data are available in CSV format, with the following attributes:
The CSV file has been imported into a mysql online server so that it was possible to easily analyze and manage its content. Location and Date have been further manipulated to optimize the access to raw information. Thus, five columns have been added to each instance:
To connect the application with the database we use this library.
Data can be downloaded from http://hcil.cs.umd.edu/localphp/hcil/vast11/
Data are available in CSV format, with the following attributes:
- ID – personal identifier of the individual posting the message
- Created_at – date and time of the post
- Location – latitude and longitude coordinates of the mobile device at the time of post
- Text – the posted message
The CSV file has been imported into a mysql online server so that it was possible to easily analyze and manage its content. Location and Date have been further manipulated to optimize the access to raw information. Thus, five columns have been added to each instance:
- POSIX Date
- Hour
- Day
- Latitude
- Longitude
To connect the application with the database we use this library.