-
Notifications
You must be signed in to change notification settings - Fork 16.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
community[minor]: 03 - Refactoring PyPDF parser #29330
Conversation
…3-pypdf # Conflicts: # docs/docs/integrations/document_loaders/pymupdf.ipynb # docs/docs/integrations/document_loaders/pypdfloader.ipynb # libs/community/langchain_community/document_loaders/parsers/images.py # libs/community/langchain_community/document_loaders/parsers/pdf.py # libs/community/langchain_community/document_loaders/pdf.py # libs/community/tests/integration_tests/document_loaders/parsers/test_images.py # libs/community/tests/integration_tests/document_loaders/parsers/test_pdf_parsers.py # libs/community/tests/integration_tests/document_loaders/test_pdf.py
# Conflicts: # libs/community/langchain_community/document_loaders/parsers/images.py # libs/community/langchain_community/document_loaders/parsers/pdf.py # libs/community/langchain_community/document_loaders/pdf.py # libs/community/tests/integration_tests/document_loaders/parsers/test_pdf_parsers.py # libs/community/tests/integration_tests/document_loaders/test_pdf.py # libs/community/tests/unit_tests/document_loaders/parsers/test_pdf_parsers.py
The latest updates on your projects. Learn more about Vercel for Git ↗︎
|
b434d66
to
ece53fa
Compare
ece53fa
to
a7fa2b6
Compare
@eyurtsev can you review this code? I think you'll understand it the fastest, since it's a continuation of the previous one. I'd also like to thank you for our exchanges, which have resulted in a relevant foundation for LangChain. |
# Conflicts: # libs/community/langchain_community/document_loaders/parsers/pdf.py
@eyurtsev can you review this code? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@pprados PR looks good.
The main issue is just clearing up documentation for plain
vs. layout
. Once that's updated, we can merge!
libs/community/langchain_community/document_loaders/parsers/pdf.py
Outdated
Show resolved
Hide resolved
libs/community/langchain_community/document_loaders/parsers/pdf.py
Outdated
Show resolved
Hide resolved
@eyurtsev |
This is one part of a larger Pull Request (PR) that is too large to be submitted all at once.
This specific part focuses on updating the PyPDF parser.
For more details, see PR 28970.