Ruby Poddar
The project aims to analyze global happiness scores using various data visualization techniques and statistical tests. The goal is to gain insights into factors influencing happiness across different countries.
- Python: Programming language used for data manipulation, analysis, and visualization.
- Pandas: Library for data manipulation and analysis in Python.
- Matplotlib: Library for creating static, animated, and interactive visualizations in Python.
- Seaborn: Statistical data visualization library based on Matplotlib.
- Plotly: Library for interactive, web-based visualization.
- Dash: Framework for building analytical web applications.
- Loaded the dataset containing global happiness scores.
- Checked for missing values and handled them as necessary.
- Converted data types to ensure consistency and accuracy.
- Conducted summary statistics to understand the distribution of happiness scores.
- Visualized data using histograms, scatter plots, and correlation matrices to explore relationships between variables.
- Created new features based on existing data to derive additional insights.
- Utilized Matplotlib and Seaborn to visualize relationships and distributions in the data.
- Generated scatter plot matrices and heatmaps to analyze correlations between variables.
- Performed statistical tests such as ANOVA to examine significant differences between groups of data.
- Used Dash and Plotly to create an interactive web-based dashboard.
- Designed the dashboard to display key visualizations dynamically, allowing users to explore data interactively.
graph TD
A[Start] --> B[Data Loading and Preparation]
B --> C[Exploratory Data Analysis]
C --> D[Feature Engineering]
D --> E[Data Visualization]
E --> F[Statistical Analysis]
F --> G[Dashboard Creation]
G --> H[End]
Through this project, we gained valuable insights into global happiness trends and factors influencing happiness scores across countries. The combination of data exploration, visualization techniques, and statistical analysis provided a comprehensive understanding of the dataset. The interactive dashboard created using Dash and Plotly enables users to explore and interpret the data effectively.
- Expand the dataset to include more years or additional factors influencing happiness.
- Implement machine learning models for predictive analysis of happiness scores.
- Enhance the dashboard with more interactive features and real-time data updates for ongoing analysis.
This project is licensed under the MIT Public License .
- Clone the repository:
git clone https://github.com/rubypoddar/HappinessScoreVisualizer.git
- Navigate to the project directory:
cd HappinessScoreVisualizer
- Install the required dependencies:
pip install -r requirements.txt
- Run the dashboard:
python app.py
- Python 3.8+
- Pandas 1.3.3
- Matplotlib 3.4.3
- Seaborn 0.11.2
- Plotly 5.1.0
- Dash 1.21.0
If you have any questions or suggestions, feel free to contact me at [email protected].