From 6c2238f709ee8b4a7341016c6d04436011954a14 Mon Sep 17 00:00:00 2001 From: Emile Cornamusaz Date: Fri, 20 Dec 2024 21:02:39 +0100 Subject: [PATCH] =?UTF-8?q?d=C3=A9grad=C3=A9?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- README.md | 5 ++--- 1 file changed, 2 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 7aae75d..5b91aa8 100644 --- a/README.md +++ b/README.md @@ -58,9 +58,8 @@ We tried another approach to detect unusual trends in name counts following a ke ### Name detection -To identify the main characters in our movies, we processed the plot_summaries.txt file, which contains plot summaries for 42,306 movies extracted from English-language Wikipedia. Each entry in the file follows a consistent structure: - -Wikipedia ID \t Plot Summary \n +To identify the main characters in our movies, we processed the plot_summaries.txt file, which contains plot summaries for 42,306 movies extracted from English-language Wikipedia. +Every line in the file represents a movie, with its wikipedia id and plot summary separated by a tabulation. Using this format, we extracted both the Wikipedia ID and the plot summary, linking each movie’s name to its corresponding Wikipedia ID and release year.