Thanks for sharing this! Recovering data with Snowplow can be tricky, but replaying raw events from S3 and carefully re-processing them usually helps resolve most issues. Keeping backups and monitoring the pipeline closely is always a good idea.
Recover data
Hi ,
We had outage in the last weekend for enrich server.
We have raw events stored in s3 bucket. Is there any way we can replay the events from s3 raw bucket.
Thankyou
0
3
6
Yes, you can definitely replay raw events from your S3 bucket. If you're using Snowplow’s batch pipeline, you can reprocess the data using the EmrEtlRunner or the newer Snowplow BDP pipeline tools. Just make sure the folder structure and permissions are intact. Also, check for any malformed files before replaying to avoid issues during enrichment.
Mattew wade
Edited
Hi Lakshmi Inavolu can you share details on how you have deployed the pipeline please? Is it Community Edition / what settings have you turned on?