Skip to content

Downloading course material at https://ocw.mit.edu/ comes with a lot of bloat, this just scrapes the course material.

License

Notifications You must be signed in to change notification settings

Rfauss/mit_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

mit_scraper

Downloading course material at https://ocw.mit.edu/ comes with a lot of bloat, this GUI uses the flet framework to find and scrape just the course material.


Currently supported:

Pdf files and zip folders.


Planned Future Support:

Mp4, jpeg, and plain text. Will evaluate other file formats as as necessary.



Prerequisites

-Python >= 3.10


-Python virtual environment manager (Anaconda recommended)


-Pip


-Git



Installation

Navigate to the folder you want to install to and clone repository into it using:


git clone https://github.com/Rfauss/mit_scraper.git\
cd mit_scraper

Create a new virtual environment(Anaconda):

conda create --name mit_scraper python=3.10

Activate virtual environment(Anaconda):

conda activate mit_scraper

Install requirements:

pip install -r > requirements.txt

Run web scraper with:

python app.py 

About

Downloading course material at https://ocw.mit.edu/ comes with a lot of bloat, this just scrapes the course material.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages