Given a paragraph of natural language text, we want to extract the susbstantial 'action-argument' sentences from text, giving the order by semantics meaning, in order to plan the automatic process to accomplish a new unkonwn task. So far we are able to give the possibility of 'action-argument' using LSTM, Pas Tagging techniques.
We made a web crawler to grab receipe data from websites, and keep the steps descriptions for cooking dishes.
- Run
python replacePrint.py
-> temp.txt (can be deleted), cookingTutorialCrawlEdit.csv are created - Use editor open cookingTutorialCrawlEdit.csv, delete unnecessary text before 'Step 1: ...' and save as stage1_CT.csv
- Use tool (MS Office, Libre Office) to open stage1_CT.csv
- Replace action verbs with #, action arguments with @, save as stage2_CT.csv
- Delete all the words except for # and @, save as mark_CT.csv