Use Spark to transform and load from your Kafka cluster #6

MegCheppy · 2022-10-03T11:39:36Z

Using PySpark, write code that will transform and load the data from the data lake
By using Kafka as an input source for Spark Structured Streaming and Delta Lake as a storage layer, build a complete streaming data pipeline to consolidate our data - you should read From Kafka to Delta Lake using Apache Spark Structured Streaming (michelin.io)

MegCheppy assigned MegCheppy, Yohanes-GR and tibarekb Oct 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Spark to transform and load from your Kafka cluster #6

Use Spark to transform and load from your Kafka cluster #6

MegCheppy commented Oct 3, 2022

Use Spark to transform and load from your Kafka cluster #6

Use Spark to transform and load from your Kafka cluster #6

Comments

MegCheppy commented Oct 3, 2022