Skip to content

Scrape PDF data for key attributes to load into Apple Numbers Document

License

Notifications You must be signed in to change notification settings

cameron-lafreniere/home-search

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

home-search

Planning to allow Aggregating, Analyzing, and Displaying Data for Home Listings.

Currently able to parse pdfs and store in json format to then allow as input to a Data Lake as semi-structured data.

Next steps will be to parse json for key attributes to extract, transform, and load into a database.

Usage

python3 pdf_scraper.py <path/to/file.pdf>

About

Scrape PDF data for key attributes to load into Apple Numbers Document

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages