Skip to content

mvoggu/bphc-timetable

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

bphc-timetable

Scripts and static assets related to parsing timetable pdf of bphc

Instructions

  • Adjust variables like path/url to pdf, start & end page numbers, area for tabula, columns to parse in pdf2json.py
  • Ensure you have a Java runtime and set the PATH for it
  • pip install -r requirements.txt
  • python3 pdf2json.py

Note

Lookout for the following while parsing the output json:

  • null values in midsem_date , compre_date in courses
  • empty lists for days, hours in sections