Skip to content

lsqLoveCoding/PageRank-and-TF-IDF

Repository files navigation

BigData_Algorithm

数据来源如下: https://www.kaggle.com/aashita/nyt-comments/home

在Hadoop环境下,利用TF-IDF算法实现一个小型搜索引擎,要求用户输入一个单词或句子,给出评论的排序结果。 在Hadoop环境下,利用数据集的webURL字段,实现分布式PageRank算法。

About

MapReduce implementation of PageRank and TF-IDF

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages