Traptor – A distributed Twitter feed

traptor is a framework to help manage your twitter data collection. What differentiates traptor from the many other Twitter libraries out there is that it does real-time distributed streaming of data based on rule sets using the Twitter Streaming API.

It uses a combination of Kafka, Redis, and the excellent birdy module. The goal is to have a convenient way to aggregate all of your twitter application data into one data stream and (optionally) a database. It uses birdy to make Twitter API connections, redis to handle the rule management among different traptor instances, and kafka to handle the data streams.

Please see for documentation and the Quick Start guide.


Overview of Traptor and its dependencies.

Quick Start

Get running with a local Traptor stream!


Deploying Traptor to a distributed environment.

Traptor API

The Traptor API.