Learn about the RDB Loader

Alec Moloney  
Edited

This article describes a feature or service available to Snowplow BDP customers.

Our RDB Loader provides the ability to load batches of events into Redshift, Snowflake, and Databricks. It is our recommended solution for Redshift and Databricks and runs on both AWS and GCP environments.

 

Architecture

The RDB Loader is made up of two separate components, a transformer that converts the enriched TSV format into SQL statements, and the loader which copies the events into the warehouse.

 

Configuration Options

The RDB Loader supports user-password authentication to connect and load events to your data warehouse. Below is a list of the configuration options available.

 

Transformer

Configuration Option Description
Since Timestamp/Age The start timestamp and look-back period that the transformer will load events from.
Schedule The schedule the transformer runs on. More frequent runs are required for pipelines with larger event volumes.
In-batch Deduplication Deduplicate events within the same batch by enabling this option. Requires event fingerprinting.

 

Loader

Configuration Option Description
Host / Port The endpoint used to connect to your data warehouse.
Username User you intend to load data with.
Password Password for the above user.
No Operations Schedule A crontab representation of time periods when the loader shouldn't run. We don't recommend using this option.
Redshift Max Errors Specifically used when loading to Redshift, this option sets the sensitivity of the loader to STL_LOAD_ERRORS before it fails the batch.

 

FAQs

See below frequently asked questions about the RDB Loader.

 

How can I rotate the user password?

If you're wanting to rotate the password for your RDB Loader, you'll need to reach out to our Support team and share the intended password using the secure Credential sharing form in Console. To avoid disruption to your pipeline, we'll coordinate a time with you to update the credentials.