Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

ent_ent_index.txt and ent_sent_index.txt files never created #5

Open
liusglz opened this issue Oct 2, 2023 · 0 comments
Open

ent_ent_index.txt and ent_sent_index.txt files never created #5

liusglz opened this issue Oct 2, 2023 · 0 comments

Comments

@liusglz
Copy link

liusglz commented Oct 2, 2023

Hi, I'm trying to get this to work but haven't being able to do so.

I was able to use the preprocessing tool (https://github.com/mickeysjm/HiExpan/tree/master/src/corpusProcessing) and I did got the sentences.json file along with this other files:

AutoPhrase_multi-words.txt
AutoPhrase_single-word.txt
AutoPhrase.txt
entity2freq.txt
entity2id.txt
entity2surface_names.txt
segmentation.txt

Then I go back to CoRel and try:

cd c
bash run_emb_part_tax.sh

But it fails because it can not find the following files:

ent_ent_index.txt
ent_sent_index.txt

I was able to run CoRel with the yelp dataset because I got the ent_ent_index.txt and ent_sent_index.txt files directly from the repo. But how can I create those files from my existing corpus?

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant