Skip to content
View kpkaranpatil600's full-sized avatar
🎯
Budding Data
🎯
Budding Data
  • Boston, MA
  • 10:35 (UTC -04:00)
Block or Report

Block or report kpkaranpatil600

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kpkaranpatil600/README.md

Hi there, I'm Karan Patil

Profile Views:

VisitorCount

    Welcome to my portfolio. I'm a Business Intelligence/Data Engineering enthusiast who is passionate about digging raw data and turning it into meaningful data. Always been intrigued by visualizing how novel real-world problems can be solved just by critical thinking and innovation, nothing matters more than just acting at the right time.

πŸ”­ I’m currently looking for Full-time opportunities. Please reach out to me if you're hiring, have any questions.

πŸ‘― I’m looking to collaborate on Business Intelligence, Data Analyst, Data Engineer, Data Warehouse and Data Science.

Technical Skills:

postgresql mssql oracle python aws azure gcp git hadoop hive kafka

AmazonDynamoDB MySQL Dbeaver Tableau PowerBI

NumPy Pandas SciPy Plotly Databricks dbt Talend

Docker Postman MongoDB Redis Confluence Jira

πŸ§‘β€πŸ’» My Recent Projects:

Some amazing Data/BI projects πŸš€ coming soon

δ·› Repository Description
1️ Data Warehousing and Business_Intelligence for IMDB (Alteryx, Talend, PostgreSQL)
Designed Enterprise Data Warehouse on PostgreSQL by dimensional modeling to ingest 380 million records using Alteryx.
Leveraged data profiling and wrangling to resolve data quality issues, reduced anomalies by 30% and improved data integrity.
Conducted Error handling, SCDs and Performance Tuning, reducing load time of transformed data from 6 Hours to 45 Minutes
2 E-Commerce Data Architecture and Analysis (Google BigQuery, Airflow, Python, Looker)
Built data pipeline in Python and Airflow DAG to load data from diverse sources into BigQuery, reduced manual work by 38%
Optimized overall query performance and response time for analytics workloads through incremental data pulls, change data capture
Implemented forecasting and demand analysis to visualize e-commerce KPI (sales, inventory visibility, web traffic, conversion rates, user engagement) in Looker, 25% reduction in response time to market trends
3 Financial Asset Management System (SQL, PL/SQL, Oracle Cloud)
Developed an application in Oracle, providing individuals with centralized platform to track all investments in one place
Enabled real-time profit/loss reporting for individual assets, empowering users to make informed financial decisions
4 Strom Data Analysis and Nowcasting of Weather (Python, Apache Airflow, AWS, Docker, FastAPI)
Structured an end-to-end application using Streamlit and FastAPI to forecast NOAA data from the SEVIR dataset for Federal Aviation; deployed on GCP Compute Engine, resulting in a 40% reduction in data processing time
Integrated AWS Lambda, Docker and ECR to perform Sentimental analysis and Narrative Summarization of storm data using NER model from the Hugging Face library, achieved 20% decrease in overall processing costs
5 New York Citi Bike Analysis (BigQuery, Tableau prep, MySQL, Excel)
Developed ETL pipeline to loading 120M+ records from MySQL and public datasets into in BigQuery using Tableau prep
Analyzed bike trip records and generated Power BI reports on bike usage trends by location, gender, age; identified opportunities for 20% increase in bike rentals within 6 months
6 Boston Train Operations Management System (Microsoft SQl Server, ER Studio, Tableau, Excel)
Constructed a normalized Database using SQL Server for Train Management System for searching scheduling and canceling tickets to improve the existing system
Designed Entity Relationship model and physical data model using ER Studio
Built 5+ SQL functions, stored procedures and triggers to store, maintain data to track ticket book history and their status

Pinned Loading

  1. Data-Warehousing-and-Business-Intelligence-for-IMDB Data-Warehousing-and-Business-Intelligence-for-IMDB Public

    Designed Multi Star schema with 10 Fact & 33 Dimension tables & developed Data Integration Pipeline by ETL workflow to load all tables (380M+ rows) from multiple sources such as CSV, MySQL, MSSQL &…

    TSQL 2

  2. SQL_DataCamp_Solutions SQL_DataCamp_Solutions Public

    This repository contains solutions of DataCamp SQL problems I solved

    1

  3. SQL_HackerRank_Solutions SQL_HackerRank_Solutions Public

    This repository contains solutions of HackerRank_SQL problems I solved on MS SQL Server

    SQL 1