Unwanted vertical lines appear #1225
yzt2024
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment 1 reply
-
I would suggest examining the vertical lines / rects ( |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello everyone,
I'm trying to export the table from the following pdf, but there are vertical lines that doesn't exist in the original text when the table is divided. I tried to modify snap_tolerance but did not get the expected division result. My parameters are as follows:
table_setting={
"vertical_strategy": "lines",
"horizontal_strategy": "lines",
"snap_tolerance": 3,
"snap_tolerance": 3,
"join_tolerance": 3,
"edge_min_length": 3,
"min_words_vertical": 20,
"min_words_horizontal": 3,
"text_tolerance": 3,
"intersection_tolerance": 1,
}
new.pdf
if i use a snap_tolerance=4,then the division of the left boundary will be problematic.
So what am I supposed to do? I see from the discussion that boundaries can be determined manually by use the "explicit_vertical_lines", I haven't tried yet. I want to know if this parameter needs to specify all vertical lines, because I actually only need to modify the leftmost boundary, can I modify only one line and the other vertical lines are still automatically generated? What should I do? Thank you very much!
Beta Was this translation helpful? Give feedback.
All reactions