Setup the containers for the various steps of the data pipeline
We need to setup 3 containers for running the 3 steps of the pipeline.
Here is a schematic representation of the pipeline:
STATE 1 STATE 2 STATE3
+-----------+ +-----------+ +-------------+
| raw data |------->| sanitized |------->| DB (public) |
+-----------+ +-----------+ +-------------+
/data/raw /data/sanitized /data/reports
To move into STATE 1 you must run: https://trac.torproject.org/projects/tor/ticket/13566.
To move into STATE 2 you must run: https://trac.torproject.org/projects/tor/ticket/13563
To move into STATE 3 you must run: https://trac.torproject.org/projects/tor/ticket/13564