Ability to prompt for json output ? #37

snimavat · 2024-09-13T13:16:30Z

The base seems to be there, would be great, if there was flexibility to provide/extend prompt and expect back a json response.
I would like to send a pdf which has structure, and want gpt to convert it to specific json format... it does it very well,
so instead of coding it myself, i would rather prefer to use/extend zerox if there was flexibility of output format and ability to costomize prompt

pradhyumna85 · 2024-09-13T18:22:13Z

@snimavat The python api has option to override the system prompt Refer the readme.

But still there are some postprocessing/cleanups which zerox does on the output from the vision model.

So I would say in your custom system prompt, say to output your required JSON but still encapsulate it in ```json markdown block (so that it doesn't result in some unexpected behaviour) which you can trim out later on your own and convert the json text to say python dict using json.loads function,

@tylermaran, this issue can be closed.

snimavat · 2024-09-14T07:02:12Z

@pradhyumna85 thanks for response, but i think the issue should not be closed.

Not keeping zerox tied to markdown only will increase the utility of this helpful project and can increase usage.

There's no reason, why zerox should remain tied to markdown.... its could be a little ocr framework tht makes it easy to get ocr done through LLM and get the output in format of user's choice. it can support markdown/json out of the box, but can expose the raw output text too tht user can post process to as he desires

Zerox as general purpose llm based ocr library will be more attractive thn zerox as ocr markdown library

pradhyumna85 · 2024-09-14T07:20:11Z

@snimavat make sense. Adding a flag to the zerox api to skip markdown based post-processing (formatmarkdown function) could be done incase user is interested in raw output in conjunction with custom system prompt. I can raise a PR for this.
@tylermaran, @annapo23 any thoughts on how to proceed?

Edit:
@tylermaran, @annapo23, I have raised PR #40 for your review.

- added post_process_function param to override/skip Zerox's default format_markdown post processing on the model's text output. - removed output_dir param and added output_file_path which is more flexible for arbitrary file extensions - page_separator param added (used when writing the consolidated output to the output_file_path

pradhyumna85 linked a pull request Sep 15, 2024 that will close this issue

Feat. Postprocessing control - custom page separator, postprocess function etc #40

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ability to prompt for json output ? #37

Ability to prompt for json output ? #37

snimavat commented Sep 13, 2024 •

edited

Loading

pradhyumna85 commented Sep 13, 2024

snimavat commented Sep 14, 2024 •

edited

Loading

pradhyumna85 commented Sep 14, 2024 •

edited

Loading

Ability to prompt for json output ? #37

Ability to prompt for json output ? #37

Comments

snimavat commented Sep 13, 2024 • edited Loading

pradhyumna85 commented Sep 13, 2024

snimavat commented Sep 14, 2024 • edited Loading

pradhyumna85 commented Sep 14, 2024 • edited Loading

snimavat commented Sep 13, 2024 •

edited

Loading

snimavat commented Sep 14, 2024 •

edited

Loading

pradhyumna85 commented Sep 14, 2024 •

edited

Loading