Traptor – A distributed Twitter feed

https://travis-ci.org/istresearch/traptor.svg?branch=master

traptor is a framework to help manage your twitter data collection. What differentiates traptor from the many other Twitter libraries out there is that it does real-time distributed streaming of data based on rule sets using the Twitter Streaming API.

It uses a combination of Kafka, Redis, and the excellent birdy module. The goal is to have a convenient way to aggregate all of your twitter application data into one data stream and (optionally) a database. It uses birdy to make Twitter API connections, redis to handle the rule management among different traptor instances, and kafka to handle the data streams.

Please see http://traptor.readthedocs.org for documentation and the Quick Start guide.

Overview

Overview of Traptor and its dependencies.

Quick Start

Get running with a local Traptor stream!

Production

Deploying Traptor to a distributed environment.

Traptor API

The Traptor API.