How to Validate Data Diffs in the Extract + Load Process with Asset Checks #16918
sungchun12
started this conversation in
Show and tell
Replies: 1 comment 1 reply
-
Hi @sungchun12 ! The Guides are meant for existing Dagster Libraries, e.g. Outside of the integration, if you'd like we can also pair on a blog post together. Feel free to reach out to me on the Dagster Slack and we can decide on the most appropriate path |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hey There!
I've been working with @tacastillo to build out a simple integration how-to on using open source
data-diff
to solve a problem in data replication: proving data is the same (think: postgres <> snowflake replicated via fivetran)This makes most sense in a guide here as it's related to an open source library : https://docs.dagster.io/integrations
![image](https://private-user-images.githubusercontent.com/19176976/271685669-22179fd9-9d15-42e9-b1a1-8e8bce94a04f.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjAxODQwMDAsIm5iZiI6MTcyMDE4MzcwMCwicGF0aCI6Ii8xOTE3Njk3Ni8yNzE2ODU2NjktMjIxNzlmZDktOWQxNS00MmU5LWIxYTEtOGU4YmNlOTRhMDRmLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA3MDUlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNzA1VDEyNDgyMFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWNhZGNlMzE1ZGI4MWU5YWIyMjBhYjNmM2FlMmY2NmUwMjcyYmU5MDIxNjZmZTY4YzkzM2RiZDY3OTU1ZTA1MjgmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.gI5ZqTw1skHxizE-XI3oYRChEXP9QXSfanlaezu9G5I)
Example tested by Tim! https://gist.github.com/tacastillo/bcf457a14fc70701825e410581c3eee9
I want to make it dead simple to trust data before end users yell at data teams for easily preventable errors.
Beta Was this translation helpful? Give feedback.
All reactions