About

Cloudberry is a research prototype to support interactive analytics and visualization of large amounts of spatial-temporal data.

Basic Information:

  • Data set: Tweets
  • Number of records: > 20,000,000
  • Collection period: From 2016-03-31 to 2016-04-07
  • Total data size: > 17G bytes
  • The live tweets is appending to db at the speed of ~40 tweets/sec
  • Source code

The backend is running the big data management system Apache AsterixDB to support large compute clusters. For questions and comments, please contact ics-cloudberry@uci.edu