Skip to content

This project conducts Exploratory Data Analysis (EDA) on the Stack Overflow Annual Developer Survey data. The goal of the analysis is to gain insights into the demographics, preferences, and trends among developers based on the survey responses.

Notifications You must be signed in to change notification settings

Chethan57/EDA-on-Stackoverflow-survey

Repository files navigation

Exploratory Data Analysis (EDA) on Stack Overflow Survey Data

Overview

This project conducts Exploratory Data Analysis (EDA) on the Stack Overflow Annual Developer Survey data. The goal of the analysis is to gain insights into the demographics, preferences, and trends among developers based on the survey responses.

Data Source

The dataset used in this analysis is the Stack Overflow Annual Developer Survey data, which is publicly available on Stack Overflow's website. The dataset contains survey responses from thousands of developers around the world and includes information on demographics, programming languages, technologies, job roles, and more.

Files

  • Stackoverflow-eda: Jupyter Notebook containing the Python code for the EDA.
  • survey_results_public.csv: CSV file containing the raw survey data used in the analysis.

Libraries Used

  • Pandas
  • NumPy
  • Matplotlib
  • Seaborn

Analysis Steps

  1. Data Loading and Cleaning: Loading the survey data into a DataFrame and performing necessary data cleaning steps such as handling missing values, renaming columns, and filtering relevant columns.
  2. Exploratory Data Analysis: Conducting exploratory data analysis to gain insights into various aspects of developers' demographics, preferences, and trends. This includes visualizations such as histograms, bar plots, and heatmaps to analyze distributions, correlations, and trends in the data.
  3. Key Findings: Summarizing the key findings and insights obtained from the analysis, including trends in programming languages, technologies, job satisfaction, and demographics among developers.

Visualisation

1-Top Countries of Respondents

Image Description

2-Age of Respondents

Image Description

3-Top Education of Respondents

Image Description

4- When did you first start to code

Image Description

4- Years of coding experience

Image Description

Conclusion

  • The survey may not fully represent certain demographics, with fewer responses from individuals in non-English-speaking countries, as well as from women and non-binary genders.
  • Efforts should be made to support and encourage underrepresented groups across various dimensions such as age, country, race, gender, and more to foster a more inclusive community.
  • Formal education in computer science is not a prerequisite for pursuing a career in programming, as evidenced by a significant portion of programmers without a computer science degree.
  • Part-time work and freelance opportunities are prevalent among programmers, offering flexibility and accessibility for individuals, particularly newcomers, to enter the field and gain experience.
  • JavaScript and HTML/CSS emerge as the most utilized programming languages in 2020, closely followed by SQL and Python, highlighting their significance in contemporary software development practices.

In conclusion, while the programming community continues to evolve and diversify, there remains room for improvement in fostering inclusivity and recognizing alternative pathways into the field. Addressing these aspects can create a more vibrant and equitable programming ecosystem for all individuals involved.


About

This project conducts Exploratory Data Analysis (EDA) on the Stack Overflow Annual Developer Survey data. The goal of the analysis is to gain insights into the demographics, preferences, and trends among developers based on the survey responses.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages