Two methods to extract data from tabular structures contained in images. Methods attempt to preserve structure by using tesseract-generated bounding boxes. Method 1 was a preliminary method that only works for tables with a very defined structure and only single-lined headers. Method 2 may work for any table structure, though it is entirely dependent on the perfect accuracy of the OCR.
-
Notifications
You must be signed in to change notification settings - Fork 0
Table Extraction Methods in R
License
1carvercoleman/tabulator
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
About
Table Extraction Methods in R
Resources
License
Stars
Watchers
Forks
Releases
No releases published