This repository parses the *.objml
XML files found in Chapter 4 of Model-Based Machine Learning into flat CSV files. The MBML book is an excellent source for designing probabilistic programs. Yet, with some datasets given as XML, each with a custom schema, parsing the datasets and learning from the book can add an unnecessary burden when working with the material.
The parsed objml
XML files are stored in data/
.
If you want to parse the data from the book's Github just run the get_data.py
script as python get_data.py save_data path/to/save
- Wanted to playaround with parsing XML files, dust off the cobwebbs, level up some skills
- Will eventually use this data for myself
- Inspired by Omniverse's YouTube video titled Model Based Machine Learning with Pyro, the code can be found here
I also have some notes on the Wiki tab 👀 ... shhh it's a secret 🤐