- Project name: Information Retrieval System for European Union Legislation
- Short description: Built on Solr, with a custom frontend, system used to retrieve information from the European Union Legislation, provided by EUR-Lex
- Environment: Browser
- Tools: Python, Solr, HTML/JS/CSS
- Institution: FEUP
- Course: PRI (Information Processing and Retrieval)
- Project grade: 19.2/20.0
- Group members:
The data file containing the dataset is 3.44GB uncompressed, and even when compressed is ~400MB, which does not fit in moodle file size restrictions. The combined.csv
data file is available at https://drive.google.com/drive/folders/1aXdNni0DtbRXS6HDofQAXyDOJcAiB4Ld. You have to download combined.zip
, unzip it and move file combined.csv
to path solr/data
, so that the new file path is solr/data/combined.csv
.
To get synonyms, go to folder solr/synonyms
, and run make
. This command works under Ubuntu 20.04.
Change current directory to solr
, and run
docker-compose up --build
To kill the container, use
docker-compose down
Solr is available at port 8983 (localhost:8983), the frontend is available at port 8000 (localhost:8000).