Skip to content

Latest commit

 

History

History
21 lines (10 loc) · 755 Bytes

README.md

File metadata and controls

21 lines (10 loc) · 755 Bytes

File Converter (to PDF format)

Convert your documents in pdf format and extract information from them. Supports many extension like docs, docx, rtf etc

How it works?

  1. open the main folder and run the command pip install -r requirements.txt to install required packages

  2. run the command python3 app.py

  3. open the local host server and follow the given instructions there.

Note: the converted files will be in uploads folder of the app directory

you can play around with the code to extract information from the document

Currently this app supports file conversion and extracting emails and mobile numbers from the converted document and put them in a csv file and show the output. Change the regex for extracting other information.