Skip to content

Files

Latest commit

 

History

History

boilerplate-page-view-time-series-visualizer

๐Ÿ“Š Page View Time Series Visualizer

๐Ÿ”ง Project Overview

Visualize time series data using:

  • ๐ŸŽจ Line charts
  • ๐Ÿ“Š Bar charts
  • ๐Ÿ”„ Box plots

You will work with Pandas, Matplotlib, and Seaborn to analyze the number of daily page views on the freeCodeCamp.org forum between 2016-05-09 and 2019-12-03.


๐Ÿ” Objective

Gain insights into user behavior by identifying trends and seasonal patterns in forum visits.


๐Ÿ“Š Data Description

The dataset contains daily page view counts from the freeCodeCamp forum. The cleaning process filters out extreme outliers (top 2.5% and bottom 2.5%).

  • File: fcc-forum-pageviews.csv
  • Columns:
    • date (Index): Date of page views.
    • value: Daily page view count.

๐Ÿ—’ Tasks

  1. Data Import and Cleaning

    • Import the data using Pandas.
    • Remove top and bottom 2.5% of page views.
  2. Visualizations

    • Line Plot

      • Visualize daily page views over time.
      • Include title, axis labels, and proper formatting.
    • Bar Chart

      • Display average daily page views per month grouped by year.
      • Add month legends and axis labels.
    • Box Plots

      • Show yearly trends and monthly seasonality.
      • Create two side-by-side box plots.

๐ŸŽจ Visualizations

1. ๐Ÿ”ฎ Line Chart

Title: Daily freeCodeCamp Forum Page Views 5/2016-12/2019

2. ๐ŸŒˆ Bar Chart

  • X-Axis: Years
  • Y-Axis: Average Page Views

3. ๐Ÿ”ข Box Plots

  • Year-wise Trend: Highlights yearly distribution.
  • Month-wise Seasonality: Focuses on monthly patterns.

๐Ÿ› ๏ธ Development

๐Ÿ—‹ Files

  • time_series_visualizer.py: Main script.
  • main.py: Testing script.
  • test_module.py: Unit tests.

๐ŸŽง Instructions

  1. Clone the repository.

    git clone https://github.com/Alogyn/data-analysis-with-python-projects
    cd data-analysis-with-python-projects/boilerplate-page-view-time-series-visualizer
  2. Install dependencies.

    pip install -r requirements.txt
  3. Run the script for development.

    python main.py
  4. Generate visualizations.

    • Ensure the fcc-forum-pageviews.csv file is in the working directory.
  5. Save and return the charts.


๐Ÿ”ข Testing

  • Use test_module.py to verify functionality.
  • Run tests with:
    python -m unittest test_module.py

๐Ÿ”„ Key Features

  • โœ… Cleaned Time Series Data: Outliers removed for accurate trends.
  • ๐Ÿ“Š Three Types of Visualizations: Line, Bar, and Box Plots.
  • ๐Ÿ† Python Libraries: Leveraging Pandas, Matplotlib, and Seaborn.

๐Ÿ”ง Tools Used

  • Languages: Python
  • Libraries: Pandas, Matplotlib, Seaborn
  • Environment: Gitpod Starter Code

๐Ÿ› ๏ธ Future Improvements

  • Add interactivity to the visualizations.
  • Expand the analysis to include more metrics.

๐ŸŽฅ Example Outputs

Line Plot

Line Plot

Bar Chart

Bar Chart

Box Plots

Box Plots


๐ŸŒ License

This project is licensed under the MIT License.