Skip to content

The database was created with records of absenteeism at work from July 2007 to July 2010 at a courier company in Brazil. The objective here is to predict for each new individual, whether he is going to be absent for more than 3 hours or no (3 hours is the median for the absenteeism hours).

Notifications You must be signed in to change notification settings

Aziz-s99/absenteeism_analysis_classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Dataset information

The database was created with records of absenteeism at work from July 2007 to July 2010 at a courier company in Brazil.

You can find the database as well as additional information at the link below.

Absenteeism at work - UC Irvine Machine Learning Repository

Attribute Information

  1. Individual identification (ID)

  2. Reason for absence (ICD).

Absences attested by the International Code of Diseases (ICD) stratified into 21 categories (I to XXI) as follows:

I Certain infectious and parasitic diseases  

II Neoplasms  

III Diseases of the blood and blood-forming organs and certain disorders involving the immune mechanism  

IV Endocrine, nutritional and metabolic diseases  

V Mental and behavioural disorders  

VI Diseases of the nervous system  

VII Diseases of the eye and adnexa  

VIII Diseases of the ear and mastoid process  

IX Diseases of the circulatory system  

X Diseases of the respiratory system  

XI Diseases of the digestive system  

XII Diseases of the skin and subcutaneous tissue  

XIII Diseases of the musculoskeletal system and connective tissue  

XIV Diseases of the genitourinary system  

XV Pregnancy, childbirth and the puerperium  

XVI Certain conditions originating in the perinatal period  

XVII Congenital malformations, deformations and chromosomal abnormalities  

XVIII Symptoms, signs and abnormal clinical and laboratory findings, not elsewhere classified  

XIX Injury, poisoning and certain other consequences of external causes  

XX External causes of morbidity and mortality  

XXI Factors influencing health status and contact with health services.

And 7 categories without (CID) patient follow-up (22), medical consultation (23), blood donation (24), laboratory examination (25), unjustified absence (26), physiotherapy (27), dental consultation (28).

  1. Month of absence

  2. Day of the week (Monday (2), Tuesday (3), Wednesday (4), Thursday (5), Friday (6))

  3. Seasons

  4. Transportation expense

  5. Distance from Residence to Work (kilometers)

  6. Service time

  7. Age

  8. Work load Average/day

  9. Hit target

  10. Disciplinary failure (yes=1; no=0)

  11. Education (high school (1), graduate (2), postgraduate (3), master and doctor (4))

  12. Son (number of children)

  13. Social drinker (yes=1; no=0)

  14. Social smoker (yes=1; no=0)

  15. Pet (number of pet)

  16. Weight

  17. Height

  18. Body mass index

  19. Absenteeism time in hours (variable that will be used to build the categorical target)

About

The database was created with records of absenteeism at work from July 2007 to July 2010 at a courier company in Brazil. The objective here is to predict for each new individual, whether he is going to be absent for more than 3 hours or no (3 hours is the median for the absenteeism hours).

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages