Featured Projects
Twitter Ambrose is a platform for visualization and real-time monitoring of MapReduce data workflows. It presents a global view of all...
Parquet is a columnar storage format that supports nested data. Parquet metadata is encoded using Apache Thrift. We created Parquet to...
Fast, testable, Scala HTTP services built on Finagle and Twitter-Server.
See the...
Storehaus is a library that makes it easy to work with asynchronous key value stores. Storehaus is built on top of Twitter’s...
Scala extensions for the Storm distributed computation system. Tormenta adds a type-safe wrapper over Storm’s Kafka and Kestrel...
Snowflake is a network service for generating unique ID numbers at high scale with some simple guarantees.
fatcache is memcache on SSD. Think of fatcache as a cache for your big data.
Libcrunch is a lightweight mapping framework that maps data objects to a number of nodes, subject to user-specified constraints.
...
hRaven collects run time data and statistics from map reduce jobs running on Hadoop clusters and stores the collected job history in an...








