Skip to content

skn1998/Data-Analysis-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Data-Analysis-Project

KMeans Clustering on Cancer Data Set

Data Set Description:

  1. Number of Attributes: 10 plus the class attribute

  2. Attribute Information: (class attribute has been moved to last column)

    1. Sample code number id number
    2. Clump Thickness 1 - 10
    3. Uniformity of Cell Size 1 - 10
    4. Uniformity of Cell Shape 1 - 10
    5. Marginal Adhesion 1 - 10
    6. Single Epithelial Cell Size 1 - 10
    7. Bare Nuclei 1 - 10
    8. Bland Chromatin 1 - 10
    9. Normal Nucleoli 1 - 10
  3. Mitoses 1 - 10

  4. Class: (2 for benign, 4 for malignant)

  5. Missing attribute values: 16

    There are 16 instances in Groups 1 to 6 that contain a single missing (i.e., unavailable) attribute value, now denoted by "?".

  6. Class distribution:

    Benign: 458 (65.5%)