NLP Information Extraction for the easily bored

NLP/IE workshop for the Tucson Data Science meetup (6/30/2016)

Please fork this repository and follow along.

If you fork this repo and changes are made to this repository after that, you'll want to sync your fork.

If you clone your forked repo locally, here's how to keep your forked clone up-to-date:

git remote add upstream https://github.com/myedibleenso/nlp-for-the-easily-bored
# check for updates in myedibleenso/nlp...bored
git fetch upstream  
# checkout your own local master branch
git checkout master
# pull in latest changes from myedibleenso/nlp...bored to your local master
git merge upstream/master

NOTE: this is a work in progress. Check back later for updates...

slides / notebook
Is it a positive or negative review? If we don't have a score, can we identity the sentiment and assign a score based on the review text?
NOTE: To really get into this example, you'll need a rotten tomatoes developer key
Challenge: Predict critics consensus scores based only on the review text
- Use whatever method you want
  - feature-based classifier, latent feature model, etc.
- What works and why?

Installation

There a couple of things you'll need to run the notebooks in this repository...

Requirements

Java 8
2 or 3GB of RAM available for running the NLP server

Python dependencies via `conda`

conda create -n bored python=3
source activate bored
# assuming you're in the "nlp-for-the-easily-bored" directory
pip install -r requirements.txt

Running the notebooks

The notebooks are all under /notebooks

If you want to run/alter them locally after installing the project dependencies, simply run this command:

jupyter notebook

Resources

See resources.md for links to NLP datasets, free courses, etc.

Questions

Have a question? See the FAQ. It may have already been asked/answered.

Contributing

Thanks for the help! Take a look at contributing.md

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
notebooks		notebooks
resources		resources
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
contributing.md		contributing.md
faq.md		faq.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP Information Extraction for the easily bored

Table of Contents

Examples

Installation

Requirements

Python dependencies via `conda`

Running the notebooks

Resources

Questions

Contributing

About

Releases

Packages

Contributors 2

Languages

License

myedibleenso/nlp-for-the-easily-bored

Folders and files

Latest commit

History

Repository files navigation

NLP Information Extraction for the easily bored

Table of Contents

Examples

Installation

Requirements

Python dependencies via conda

Running the notebooks

Resources

Questions

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Python dependencies via `conda`

Packages