This will download and process the Google Ngram data.
-
Updated
Nov 29, 2022 - Jupyter Notebook
This will download and process the Google Ngram data.
Autocompletion based on frequency of word occurrence in English corpus.
Language Modeling
Investigations into Evolutionary Linguistics using the Google Ngrams corpus. Sub-projects include Birth and Death of English Lexemes in Closed Lexical Classes | Lexicon Size
Generate names for anything using dictionary words
In this project, we designed a Map-Reduce system built on the AWS EMR (student) environment using Hadoop. The mission was to process big data (Hebrew literature from the Google N-Grams dataset) and calculate the probability P(W3 | W1, W2) for each of the word triplets (W1, W2, W3) we observed in the data.
A terminal game that parses JSON data from Google's Ngram viewer
A script to rank words by their relative popularity based on their occurrence in a large corpus of books over time.
Add a description, image, and links to the google-ngram topic page so that developers can more easily learn about it.
To associate your repository with the google-ngram topic, visit your repo's landing page and select "manage topics."