Sevi: Speech-to-Visualization through Neural Machine Translation

Sevi is an end-to-end data visualization system that acts as a virtual assistant to allow anyone to create visualizations through either natural language or speech.

Environment Setup

Python3.6+
PyTorch 1.7
torchtext 0.8
ipyvega

Install Python dependency via pip install -r requirements.txt when the environment of Python and Pytorch is setup.

How to use?

Data preparation

[Optional] Only if you change the train/dev/test.csv under the ./dataset/ folder, you need to run process_dataset.py under the preprocessing foler.

Runing Example

Open the Sevi.ipynb to try the running example.

NOTE: The user should configure the Google Speech-to-Text API. Alternatively, the user can send an email to the author to get Keys.

Training

If you want to re-train the model, please run train.py.

Details

ncNet

Supporting the translation from natural language (NL) query to visualization (NL2VIS) can simplify the creation of data visualizations because if successful, anyone can generate visualizations by their natural language from the tabular data.

We present ncNet, a Transformer-based model for supporting NL2VIS, with several novel visualization-aware optimizations, including using attention-forcing to optimize the learning process, and visualization-aware rendering to produce better visualization results.

Input and Output

Input:

a tabular dataset (csv, json, or sqlite3)
a natural language query used for NL2VIS
an optional chart template

Output:

Vega-Zero: a sequence-based grammar for model-friendly, by simplifying Vega-Lite

Please refer to our paper at IEEE VIS 2021 for more details.

License

The project is available under the MIT License.

Contact

If you have any questions, feel free contact Jiawei Tang (23jtang [AT] asd.edu.qa) or Yuyu Luo (luoyy18 [AT] mails.tsinghua.edu.cn).

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
dataset		dataset
img		img
model		model
preprocessing		preprocessing
save_models		save_models
speech		speech
utilities		utilities
.gitattributes		.gitattributes
README.md		README.md
Sevi.ipynb		Sevi.ipynb
Sevi.py		Sevi.py
get_covid19_data.ipynb		get_covid19_data.ipynb
ncNet-VIS21.pdf		ncNet-VIS21.pdf
ncNet.ipynb		ncNet.ipynb
ncNet.py		ncNet.py
requirements.txt		requirements.txt
test.py		test.py
test_ncNet.ipynb		test_ncNet.ipynb
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sevi: Speech-to-Visualization through Neural Machine Translation

Environment Setup

How to use?

Data preparation

Runing Example

Training

Details

ncNet

Input and Output

License

Contact

About

Releases

Packages

Contributors 2

Languages

Thanksyy/Sevi

Folders and files

Latest commit

History

Repository files navigation

Sevi: Speech-to-Visualization through Neural Machine Translation

Environment Setup

How to use?

Data preparation

Runing Example

Training

Details

ncNet

Input and Output

License

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages