-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Updated prompt to extract text and format it in Markdown, including a… #200
base: main
Are you sure you want to change the base?
Updated prompt to extract text and format it in Markdown, including a… #200
Conversation
…dditional visual details, instead of only describing the image.
@microsoft-github-policy-service agree |
Hi @dzemeuksis, """Convert this image into a structured markdown representation that preserves its data and relationships. Follow these conversion guidelines based on content type: For Tables:
For Mathematical Formulas: For Charts and Graphs:
For Flowcharts and Diagrams: graph LR
A-->B
B-->C
For Process Flows: For Technical Diagrams:
Additional Guidelines:
Let me know if you see it as reasonable to push it together. Petr |
@PetrAPConsulting , that sounds great! What do you think about combining our proposals into something like this:
|
@dzemeuksis I suppose you proposed prompt is fine but if you would agree I would extend it in some content types and keept example of mermaid. But again, it's up to you. Analyze the image and extract all visible text in the original language. Reproduce the extracted text in a structured Markdown format, preserving any formatting such as headings, bullet points, and highlights. Follow these additional guidelines based on the content type: Tables:
Mathematical Formulas:
Charts and Graphs:
Flowcharts and Diagrams:
graph LR
A-->B
B-->C
For any visual elements that cannot be represented directly in Markdown, describe them in plain text under a section titled "Visual Notes." Maintain numerical precision exactly as shown, preserve all labels and annotations as markdown text, and structure the output for both human and machine readability. Output only the converted Markdown text without any additional commentary or explanations. |
only one comment, to keep price of conversion reasonable pictures should not be bigger than ~1000x1000 px |
Updated prompt to extract text and format it in Markdown, including additional visual details, instead of only describing the image.
The markitdown module is designed to extract text from various documents and save it in Markdown format, as stated in its purpose. This change aligns the default behavior of image processing with the overall goal of the module.
Previously, providing an image resulted in a plain text description, which likely did not meet user expectations. Users are more likely to expect extracted text and formatting when supplying an image, making this change a better fit for the module's intended functionality.