WBC archive served via HTTP API and as a web front-end application.
Docker Compose running the following:
- Manticore Search instance (this is a fork of Sphinx)
- Flask-powered app providing HTTP API
macbre/wbc
can fetch and convert DJVU files to XML format that can be indexed by SphinxSE.
Run the following:
docker-compose up -d sphinx
cd app && virtualenv env -p python3.8 && source env/bin/activate && pip install -e . && ./server_debug.sh
The local instance of wbc.macbre.net should be ready at http://0.0.0.0:8080/
Needs to be prefixed with
/api/v1
(e.g./api/v1/search?q=foo
)
List of all publications
Meta data of a given publication
Get all documents in a given issue
Get a given document
Get a given document in txt file format
Search within all publications
Return search suggestions
acme.sh --issue -d wbc.macbre.net --stateless --force
- get XML content from
http://s3.macbre.net/wbc/kronika_gazeta_wielkiego_ksiestwa.xml.gz
(indexed bymacbre/wbc
) - run
make index
to index XML file in sphinx
using config file '/opt/sphinx/conf/sphinx.conf'...
indexing index 'wbc'...
collected 11980 docs, 246.9 MB
sorted 35.1 Mhits, 100.0% done
total 11980 docs, 246858497 bytes
total 318.765 sec, 774419 bytes/sec, 37.58 docs/sec
total 97 reads, 1.865 sec, 2095.4 kb/call avg, 19.2 msec/call avg
total 1650 writes, 0.733 sec, 390.8 kb/call avg, 0.4 msec/call avg