glue-job

Star

Here are 20 public repositories matching this topic...

edrrezend / ETL_Streaming_DataLake

Star

ETL using application streaming and creating a Data Lake

python crawler athena etl s3 kinesis kinesis-firehose kinesis-stream datalake dataengineering glue-job glue-catalog

Updated Apr 7, 2023
Jupyter Notebook

Ansuman21 / IA-Final-Group-5

Star

This project outlines the final project requirements for DAV6100 - Information Architectures, focusing on group assignments, scoring criteria, topic selection, core requirements, and project components such as design, development, visualization, and executive presentation.

visualization aws framework dashboard cloud-computing datawarehouse etl-pipeline glue-job dataarchitecture quicksight-dashboard awsservices informationarchitecture

Updated May 14, 2024
HTML

rishita27 / ETL-Operations-using-AWS-Glue-and-Redshift

Star

Used AWS Glue to perform ETL operations and load resultant data to AWS Redshift. In the second phase used AWS CloudWatch rules and LAMBDA to automatically run GLUE Jobs

aws aws-lambda etl redshift glue-job

Updated Mar 3, 2022

g-lorena / aws_streaming_pipeline

Star

AWS streaming pipeline for real-time analysis

python lambda ec2 aws-s3 kinesis sns firehose glue-job

Updated Mar 11, 2024
Python

camposvinicius / aws-snowflake-etl

Star

This is a data pipeline built with the purpose of serving a business team.

aws lambda cloudformation spark aws-lambda serverless athena glue snowflake pyspark serverless-framework cicd aws-cloudformation aws-cdk glue-job glueworkflow externaltable

Updated Feb 28, 2023
Python

GustavoGuarany / projeto-engenharia-dados-tv-jornalismo

Star

O projeto foi elaborado com o objetivo de estabelecer uma arquitetura na AWS, originada a partir de uma migração de um banco de dados existente em um ambiente local (on-premise).

python docker aws crawler sql sql-server spark athena terraform s3 glue jupyter-notebook s3-bucket dms powerbi glue-job database-migration-service rds-postgres

Updated Aug 17, 2023
Python

JayyShah / data-pipelines-terraform

Star

This Project aims to automate the process of infrastructure creation.

terraform glue datapipeline dockercompose awslambda airflow-dags glue-job

Updated Sep 25, 2023
HCL

NSVpriya / Youtube_Data_ETL_Project

Star

This project aims to analyze the popularity of YouTube content across different regions by leveraging datasets sourced from Kaggle. It employs a systematic approach to data preprocessing, cleaning, and analysis using various AWS (Amazon Web Services) services including S3, Lambda, Glue, and others, to build an automated ETL pipeline.

python shell aws sql aws-lambda powerbi glue-job dataanlytics

Updated Apr 26, 2024
Python

GabrielDan92 / AWS_Terraform_PySpark-ETL_Job

Star

Terraform configuration that creates several AWS services, uploads data in S3 and starts the Glue Crawler and Glue Job.

aws terraform s3-bucket pyspark glue-job glue-catalog aws-glue-crawler

Updated Feb 10, 2022
Python

ritikdhame / Automated-Telecom-Customer-Churn-Analysis

Star

AWS Glue & Airflow automate weekly churn analysis pipeline (S3, Redshift) for telco (1M+ customers) with actionable Tableau dashboard (user segmentation, churn reasons, geographical distribution).

aws airflow athena s3 redshift tableau quicksight glue-job

Updated Apr 1, 2024
Python

phaniteja5789 / Event-Driven-Data-Processing-and-Workflow-Orchestration-on-AWS

Star

python lambda s3 iam sns kinesis-stream cloudshell stepfunctions glue-job

Updated Oct 28, 2023
Python

jaredfiacco2 / AWS_Glue_DQ

Star

Glue Data Quality Example - Deploy to your AWS Account w/ Terraform to test

aws terraform glue parquet-files github-actions glue-job terraform-cloud aws-glue-data-quality

Updated Feb 21, 2024
HCL

DieGit0 / windfarm

Star

Data Engineer project using Python and some AWS data services

python aws athena s3 kinesis data-catalog glue-job glue-crawler

Updated Jul 5, 2024
Jupyter Notebook

komminarlabs / terraform-aws-glue-job

Star

Terraform module to create and manage a AWS Glue job

aws terraform-module glue-job

Updated Apr 18, 2024
HCL

g-lorena / aws_etl_pipeline

Star

AWS ETL Pipleine

python lambda terraform s3 glue glue-job

Updated May 16, 2024
HCL

vitalibo / terraform-aws-glue-job

Star

Terraform module which creates Glue Job resources on AWS.

aws terraform terraform-module glue-job

Updated Jun 4, 2022
HCL

awslabs / sensitive-data-protection-on-aws

Star

The Sensitive Data Protection on AWS solution allows enterprise customers to create data catalogs, discover, protect, and visualize sensitive data across multiple AWS accounts. The solution eliminates the need for manual tagging to track sensitive data such as Personal Identifiable Information (PII) and classified information.

aws security-audit serverless analytics s3 rds gdpr sensitive-data glue-job pii-detection