Skip to content

This method is a solution to imbalanced data problem.

Notifications You must be signed in to change notification settings

AmiteshDTU/DST-ML-EkNN

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 

Repository files navigation

DST-ML-EkNN

This is a smart clustering method which relies on metric learning eg. Neighbourhood Component Analysis (NCA) to solve class imbalanced data problem. To know about it, please follow the link: https://link.springer.com/chapter/10.1007/978-981-15-3514-7_26. Cite our paper, if you are using it in your research work.

The citation of our paper is:

[1] Susan, Seba, and Amitesh Kumar. "DST-ML-EkNN: data space transformation with metric learning and elite k-nearest neighbor cluster formation for classification of imbalanced datasets." In Advances in Artificial Intelligence and Data Engineering, pp. 319-328. Springer, Singapore, 2021.

For reading free PDF, you can get it at: https://www.researchgate.net/publication/343657145_DST-ML-EkNN_Data_Space_Transformation_with_Metric_Learning_and_Elite_k-Nearest_Neighbor_Cluster_Formation_for_Classification_of_Imbalanced_Datasets

To know more about imbalanced data, please follow this link: https://onlinelibrary.wiley.com/doi/full/10.1002/eng2.12298. The citation of this paper is:

[2] Susan, Seba, and Amitesh Kumar. "The balancing trick: Optimized sampling of imbalanced datasets—A brief survey of the recent State of the Art." Engineering Reports 3, no. 4 (2021): e12298.

For reading free PDF, you can get it at: https://www.researchgate.net/publication/344781218_The_balancing_trick_Optimized_sampling_of_imbalanced_datasets-A_brief_survey_of_the_recent_State_of_the_Art

Important !!!!!

Caution: only files with *.arff extensions are used in this program.

Method of evaluation: We have used F-Score for evaluation purposse.

You have to provide training and testing datasets seprately.

First, execute NCA_Transformation.py to transform these datasets.

After that, execute DST_ML_EkNN.java over these transformed files.

Use, Java 9 and Python 3.6 with Eclipse IDE for execution.

About

This method is a solution to imbalanced data problem.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published