Skip to content

Latest commit

 

History

History
13 lines (8 loc) · 383 Bytes

README.md

File metadata and controls

13 lines (8 loc) · 383 Bytes

home-search

Planning to allow Aggregating, Analyzing, and Displaying Data for Home Listings.

Currently able to parse pdfs and store in json format to then allow as input to a Data Lake as semi-structured data.

Next steps will be to parse json for key attributes to extract, transform, and load into a database.

Usage

python3 pdf_scraper.py <path/to/file.pdf>