Thesis Research Repository

This repository contains experiments and proofs of concept for Edoardo Riggio's Master's thesis "API Scout: An Information Retrieval System for OpenAPI Specifications."

Repository Structure

.
├── .run
├── data
├── notebooks
├── out
│   ├── latex
│   ├── models
│   │   ├── universal-sentence-encoder
│   │   └── doc2vec.model
│   ├── pdfs
│   └── plots
└── proposal

Models

For the vectorization of the specifications, I've experimented with both the doc2vec model by gensim (trained on my data), and with the Universal Sentence Encoder model by Google (only used to vectorize the documents, no training was necessary).

Universal Sentence Encoder

The notebooks elasticsearch.ipynb and migration.ipynb require the "Universal Sentence Encoder" module from Google. This model can be downloaded by running the following command:

mkdir ./out/models/universal-sentence-encoder

curl -L "https://tfhub.dev/google/universal-sentence-encoder/2?tf-hub-format=compressed" | tar -zxvC ./out/models/universal-sentence-encoder

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.run		.run
data		data
notebooks		notebooks
out		out
proposal		proposal
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
server.template.ini		server.template.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Thesis Research Repository

Repository Structure

Models

Universal Sentence Encoder

About

Languages

APIScout/research

Folders and files

Latest commit

History

Repository files navigation

Thesis Research Repository

Repository Structure

Models

Universal Sentence Encoder

About

Resources

Stars

Watchers

Forks

Languages