Visualize time series data using:
- ๐จ Line charts
- ๐ Bar charts
- ๐ Box plots
You will work with Pandas, Matplotlib, and Seaborn to analyze the number of daily page views on the freeCodeCamp.org forum between 2016-05-09 and 2019-12-03.
Gain insights into user behavior by identifying trends and seasonal patterns in forum visits.
The dataset contains daily page view counts from the freeCodeCamp forum. The cleaning process filters out extreme outliers (top 2.5% and bottom 2.5%).
- File:
fcc-forum-pageviews.csv
- Columns:
date
(Index): Date of page views.value
: Daily page view count.
-
Data Import and Cleaning
- Import the data using Pandas.
- Remove top and bottom 2.5% of page views.
-
Visualizations
-
Line Plot
- Visualize daily page views over time.
- Include title, axis labels, and proper formatting.
-
Bar Chart
- Display average daily page views per month grouped by year.
- Add month legends and axis labels.
-
Box Plots
- Show yearly trends and monthly seasonality.
- Create two side-by-side box plots.
-
Title: Daily freeCodeCamp Forum Page Views 5/2016-12/2019
- X-Axis: Years
- Y-Axis: Average Page Views
- Year-wise Trend: Highlights yearly distribution.
- Month-wise Seasonality: Focuses on monthly patterns.
time_series_visualizer.py
: Main script.main.py
: Testing script.test_module.py
: Unit tests.
-
Clone the repository.
git clone https://github.com/Alogyn/data-analysis-with-python-projects cd data-analysis-with-python-projects/boilerplate-page-view-time-series-visualizer
-
Install dependencies.
pip install -r requirements.txt
-
Run the script for development.
python main.py
-
Generate visualizations.
- Ensure the
fcc-forum-pageviews.csv
file is in the working directory.
- Ensure the
-
Save and return the charts.
- Use
test_module.py
to verify functionality. - Run tests with:
python -m unittest test_module.py
- โ Cleaned Time Series Data: Outliers removed for accurate trends.
- ๐ Three Types of Visualizations: Line, Bar, and Box Plots.
- ๐ Python Libraries: Leveraging Pandas, Matplotlib, and Seaborn.
- Languages: Python
- Libraries: Pandas, Matplotlib, Seaborn
- Environment: Gitpod Starter Code
- Add interactivity to the visualizations.
- Expand the analysis to include more metrics.
This project is licensed under the MIT License.