Datafold, a data reliability company, and Hightouch, the leading data activation provider, today announced an integration to quickly provide data engineers with visibility into how dbt model changes impact activated data.
Hightouch helps companies automate the integration of data from data warehouses to operational systems such as CRMs, ERPs, advertising tooling, or custom applications. For example, Hightouch allows marketers to segment users based on their activity within a trial and then connect that data to an email tool to send custom messages to encourage further exploration of the product.
The integration with Datafold ensures that whenever an upstream data model is modified within a schema or pipeline, the data engineer will be alerted to the impact to any Hightouch sync. This prevents every update from inadvertently breaking the sync or changing the data within a sync in an unexpected way. Companies are building important automations with Hightouch, and Datafold makes this activated data even more reliable.
Savvy companies are using Hightouch to activate models from their data warehouse for mission-critical sales and marketing applications, by using Datafold and Hightouch together, our customers can proactively ensure that those mission-critical applications are powered by accurate data. Data engineers can visualize a complete data lineage graph and alert themselves when changing a model will affect their data activation workflows downstream.” Kashish Gupta, co-CEO at Hightouch.
Data activation is one of the most exciting developments in the modern data stack, and we are eager to make it even more reliable, Combining Data Diff with Hightouch’s data activation capabilities and dbt modeling provides a faster, smoother path to syncing data with operational tools. Data engineers can see impact reports of how their change would affect Hightouch syncs in every pull request before they commit their changes to production.” Gleb Mezhanskiy, founder and CEO at Datafold.
As an organization expands, its data models typically grow more complex, and with every update the number of affected systems increases. Changes can often have unexpected consequences, leading to data quality problems that undermine confidence, erode revenue, and lead to poor customer experiences.
Datafold prevents those problems by enabling data engineers to visualize the impact of each potential sync in the form of a column-level lineage graph. Because Datafold and Hightouch both integrate dbt modeling seamlessly, engineers can quickly and easily see how their dbt model update might impact their Hightouch sync within a pull request. That gives them the ability to identify potential problems both upstream and downstream of their changes before those problems have a chance to manifest in the production database.