Skip to content

Files

Latest commit

 

History

History
31 lines (22 loc) · 552 Bytes

File metadata and controls

31 lines (22 loc) · 552 Bytes

Python-Web-Crawler-Threading

109-02 NCU CE3002B Operating System

tags: GitHub

Test Environment

  • Python 3.8.4
  • virtualenv 20.4.2
    • beautifulsoup4 4.9.3
    • certify 2020.12.5
    • chardet 4.0.0
    • idna 2.10
    • Pillow 8.2.0
    • requests 2.25.1
    • soupsieve 2.2.1
    • urllib3 1.26.4
    • wget 3.2

WorkFlow

argv

-h: for this message.
-b bookName: for input book name on the url.
-t threadNum: for the max thread number.
-p: (optional) for enable merging and converting to PDF.