The goal of ds4owdanalytics is to make dark data from the DS4OWD course publicly available
You can install the development version of ds4owdanalytics from GitHub with:
# install.packages("devtools")
devtools::install_github("openwashdata/ds4owdanalytics")
## Run the following code in console if you don't have the packages
## install.packages(c("dplyr", "knitr", "readr", "stringr", "gt", "kableExtra"))
library(dplyr)
library(knitr)
library(readr)
library(stringr)
library(gt)
library(kableExtra)
Alternatively, you can download the individual datasets as a CSV or XLSX file from the table below.
- Click Download CSV. A window opens that displays the CSV in your browser.
- Right-click anywhere inside the window and select “Save Page As…”.
- Save the file in a folder of your choice.
dataset | CSV | XLSX |
---|---|---|
course_participation | Download CSV | Download XLSX |
ds4owd_browser | Download CSV | Download XLSX |
ds4owd_country | Download CSV | Download XLSX |
ds4owd_source | Download CSV | Download XLSX |
ds4owd_timeseries | Download CSV | Download XLSX |
locations | Download CSV | Download XLSX |
postsurvey | Download CSV | Download XLSX |
presurvey | Download CSV | Download XLSX |
publishing_metadata | Download CSV | Download XLSX |
The package provides access to multiple datasets. These are documented in the reference guide.
library(ds4owdanalytics)
The dataset course_participation
contains anonymized data about
participation in the online session for the DS4OWD course It has 734
observations and 5 variables
course_participation |>
head(3) |>
gt::gt() |>
gt::as_raw_html()
duration_minutes | guest | date | session | anonymized_name |
---|---|---|---|---|
195 | No | 31.10.2023 | 1 | def38455859ad75e499820e9747f215d |
163 | Yes | 31.10.2023 | 1 | 1eddec942ea3cca17a070e3fb1cfe61c |
159 | Yes | 31.10.2023 | 1 | 9d0f4c5a7802b0aa5e67c0c12e30ccde |
For an overview of the variable names, see the following table.
variable_name |
variable_type |
description |
---|---|---|
duration_minutes |
numeric |
Minutes spent in the session |
guest |
character |
Whether the user is a guest or not. The course instructor and their notetaker is not a guest |
date |
character |
Date of the session |
session |
numeric |
Session number |
anonymized_name |
character |
Anonymized hash of the name as an identifier |
Examples can be found as a dashboard published at https://openwashdata.github.io/dashboard/dashboard.html
Data are available as CC-BY.
Please cite this package using:
citation("ds4owdanalytics")
#> To cite package 'ds4owdanalytics' in publications use:
#>
#> Dubey Y (2025). "ds4owdanalytics: DS4OWD Analytics."
#> doi:10.5281/zenodo.14778963
#> <https://doi.org/10.5281/zenodo.14778963>,
#> <https://github.com/openwashdata/ds4owdanalytics>.
#>
#> A BibTeX entry for LaTeX users is
#>
#> @Misc{dubey:2025,
#> title = {ds4owdanalytics: DS4OWD Analytics},
#> author = {Yash Dubey},
#> year = {2025},
#> doi = {10.5281/zenodo.14778963},
#> url = {https://github.com/openwashdata/ds4owdanalytics},
#> abstract = {Data collected from the Data Science for Open WASH Data (DS4OWD) course and the associated website and data publishing endeavour.},
#> version = {0.1.0},
#> }