This is a collection of scripts using Beautifulsoup crawling the mutopia,musedata and freescore for data and inserting it to a db at google app engine via preomrgae.py.
Uses BeautifulSoup, Html5lib and urlgrabber.
This code is developed as a part of a project named Optical character recognition for structural information from high-quality scanned music which is part of the study credit for my two-years masters(cand.scient) at the University Of Copenhagen, Department of Computer Science
Other parts can be seen in
- PreOmr github - Code that uses Gamera and exploits whatever possible to try to remove text and dynamics.
- Diku ONR github. LaTeX source of the paper "driving" this code. (Work In Progress)
Based on the works of Brian Søborg and Kim Juncher which in turn based their work on the work of Johan Sejr Brinch Nielsen.