README

Dear Reader and Judge,

This document explains how to run the Protego code and how to access the prototype demo.

What is going on?

The "Protego_backend_alpha.py" script plays a role in the backend of PROTEGOs dashboard for trend analysis and reputation management.

The prototype demo can be seen at: http://www.robots.ox.ac.uk/~favour/protego

Machine Learning Algorithm

Method: GradientBoostingClassifier

Why: It performed the best in our parameter analysis.

Others Tested: AdaBoost, XGBoost

Why Abandoned: The were outperformed by GradientBoostingClassifier on various parameters in our analysis.

Insight: It is easy to get the performance up to 74% and difficult to get the performance alot higher. It is similarly hard to perform at more than 80% on the training set with the methods we attempted.

Our Assumptions: We restricted ourselves to usin the data given. We restricted ourselves to ML approaches that we can train in a short amount of time, e.g. Deep Learning is too GPU-computation heavy.

How to run the code

Open a python 3 environment with the following libraries installed: tqdm, sklearn, numpy, scipy, nltk
Run the python 3 file "Protego_backend_alpha.py" scans through the training data and evaluates on the test data.
This gives a model, which can classify the relationship between the header and the body of texts.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
features		features
fnc-1		fnc-1
splits		splits
utils		utils
.gitignore		.gitignore
Protego_backend_alpha.py		Protego_backend_alpha.py
README.md		README.md
feature_engineering.py		feature_engineering.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

What is going on?

Machine Learning Algorithm

How to run the code

About

Releases

Packages

Languages

fmnyikosa/protego

Folders and files

Latest commit

History

Repository files navigation

README

What is going on?

Machine Learning Algorithm

How to run the code

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages