Fake News Classification

Description

The Fake News Classification project consists of two models trained to recognize fake news using the LIAR dataset. This dataset contains 12.8K human-labeled short statements obtained from PolitiFact.com, with each statement rated for truthfulness by a PolitiFact.com editor. You can access the LIAR dataset here.

Two models have been trained for this project:

BERT Model: One model is trained using BERT.
Random Forest Model: Another model is trained using the Random Forest algorithm.

Hyperparameters used for training have been optimized, and the training results can be found in the docs/training_results folder. After training, the models are compared with each other and a model from Hugging Face: Jawaher/LIAR-fake-news-roberta-base.

Installation

To get started with the Fake News Classification project, follow these steps:

Clone the repository:

$ git clone https://github.com/iame-uni-bonn/final-project-irem-y

Navigate to the project folder in your terminal.
Create a Conda environment using the provided environment.yml file:

$ conda env create -f environment.yml

Activate the Conda environment:

$ conda activate final-project-irem-y

Usage

Disclaimer: Depending on your hardware, the runtime may vary as training the model is time-intensive and can take multiple hours.

To use the project, follow these steps:

Navigate to the project folder in your terminal.
Run the following command in the command line:

$ python ./src/training_and_evaluate_functions/run_program.py

To adjust the training parameters, check the src/training_and_evaluate_functions/run_program.py file.
For general training examples, refer to the src/training_and_evaluate_functions/training_examples.py file. Here, you can customize and train the model according to your needs by adjusting the training parameters and commenting out the code that you don't need.

Results and Evaluation

Due to the long training time, the hyperparameter optimisation of the models was difficult and the full potential couldn't be reached. In comparison with the Hugginface model, both models achieved better results in all metrics (accuracy, precision, recall and F1 score). A more stable environment could help to train the BERT model more efficiently.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
docs		docs
src/training_and_evaluate_functions		src/training_and_evaluate_functions
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fake News Classification

Description

Installation

Usage

Results and Evaluation

About

Releases 1

Packages

Languages

iame-uni-bonn/final-project-irem-y

Folders and files

Latest commit

History

Repository files navigation

Fake News Classification

Description

Installation

Usage

Results and Evaluation

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages