We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hey guys,
I'm trying to use the "convert-pdf" preprocessor without success:
curl -X POST -H "Content-Type: application/json" -d '{"img_url":"https://jeroen.github.io/images/ocrscan.pdf","engine":"tesseract", "preprocessors":["convert-pdf"]}' http://myhost:9292/ocr
any clue?
thanks a lot!
the logs are:
rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:33:08 === rabbitmq_1 | accepting AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672) openocr_1 | 04:33:08.579373 OCR_CLIENT: callbackQueue name: amq.gen-Aj41Hj-a4TCFpUVKkPpSrA openocr_1 | 04:33:08.580361 OCR_CLIENT: looping over deliveries.. openocr_1 | 04:33:09.556468 OCR_CLIENT: ocrRequest before: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: [convert-pdf] openocr_1 | 04:33:09.556504 OCR_CLIENT: publishing with routing key "convert-pdf" openocr_1 | 04:33:09.556516 OCR_CLIENT: ocrRequest after: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: [] openocr_1 | 04:35:09.563580 ERROR: Timeout waiting for RPC response -- open-ocr.HandleOcrRequest() at ocr_http_handler.go:80 openocr_1 | 04:35:09.563630 ERROR: Unable to perform OCR decode. Error: Timeout waiting for RPC response -- open-ocr.(*OcrHttpHandler).ServeHTTP() at ocr_http_handler.go:40 rabbitmq_1 | rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:35:09 === rabbitmq_1 | closing AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672) rabbitmq_1 | rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:33:08 === rabbitmq_1 | accepting AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672) openocr_1 | 04:33:08.579373 OCR_CLIENT: callbackQueue name: amq.gen-Aj41Hj-a4TCFpUVKkPpSrA openocr_1 | 04:33:08.580361 OCR_CLIENT: looping over deliveries.. openocr_1 | 04:33:09.556468 OCR_CLIENT: ocrRequest before: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: [convert-pdf] openocr_1 | 04:33:09.556504 OCR_CLIENT: publishing with routing key "convert-pdf" openocr_1 | 04:33:09.556516 OCR_CLIENT: ocrRequest after: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: [] openocr_1 | 04:35:09.563580 ERROR: Timeout waiting for RPC response -- open-ocr.HandleOcrRequest() at ocr_http_handler.go:80 openocr_1 | 04:35:09.563630 ERROR: Unable to perform OCR decode. Error: Timeout waiting for RPC response -- open-ocr.(*OcrHttpHandler).ServeHTTP() at ocr_http_handler.go:40 rabbitmq_1 | rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:35:09 === rabbitmq_1 | closing AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672) rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:33:08 === rabbitmq_1 | accepting AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672) openocr_1 | 04:33:08.579373 OCR_CLIENT: callbackQueue name: amq.gen-Aj41Hj-a4TCFpUVKkPpSrA openocr_1 | 04:33:08.580361 OCR_CLIENT: looping over deliveries.. openocr_1 | 04:33:09.556468 OCR_CLIENT: ocrRequest before: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: [convert-pdf] openocr_1 | 04:33:09.556504 OCR_CLIENT: publishing with routing key "convert-pdf" openocr_1 | 04:33:09.556516 OCR_CLIENT: ocrRequest after: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: [] openocr_1 | 04:35:09.563580 ERROR: Timeout waiting for RPC response -- open-ocr.HandleOcrRequest() at ocr_http_handler.go:80 openocr_1 | 04:35:09.563630 ERROR: Unable to perform OCR decode. Error: Timeout waiting for RPC response -- open-ocr.(*OcrHttpHandler).ServeHTTP() at ocr_http_handler.go:40 rabbitmq_1 | rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:35:09 === rabbitmq_1 | closing AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672) rabbitmq_1 | rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:33:08 === rabbitmq_1 | accepting AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672) openocr_1 | 04:33:08.579373 OCR_CLIENT: callbackQueue name: amq.gen-Aj41Hj-a4TCFpUVKkPpSrA openocr_1 | 04:33:08.580361 OCR_CLIENT: looping over deliveries.. openocr_1 | 04:33:09.556468 OCR_CLIENT: ocrRequest before: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: [convert-pdf] openocr_1 | 04:33:09.556504 OCR_CLIENT: publishing with routing key "convert-pdf" openocr_1 | 04:33:09.556516 OCR_CLIENT: ocrRequest after: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: [] openocr_1 | 04:35:09.563580 ERROR: Timeout waiting for RPC response -- open-ocr.HandleOcrRequest() at ocr_http_handler.go:80 openocr_1 | 04:35:09.563630 ERROR: Unable to perform OCR decode. Error: Timeout waiting for RPC response -- open-ocr.(*OcrHttpHandler).ServeHTTP() at ocr_http_handler.go:40 rabbitmq_1 | rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:35:09 === rabbitmq_1 | closing AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672)
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Hey guys,
I'm trying to use the "convert-pdf" preprocessor without success:
curl -X POST -H "Content-Type: application/json" -d '{"img_url":"https://jeroen.github.io/images/ocrscan.pdf","engine":"tesseract", "preprocessors":["convert-pdf"]}' http://myhost:9292/ocr
any clue?
thanks a lot!
the logs are:
rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:33:08 ===
rabbitmq_1 | accepting AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672)
openocr_1 | 04:33:08.579373 OCR_CLIENT: callbackQueue name: amq.gen-Aj41Hj-a4TCFpUVKkPpSrA
openocr_1 | 04:33:08.580361 OCR_CLIENT: looping over deliveries..
openocr_1 | 04:33:09.556468 OCR_CLIENT: ocrRequest before: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: [convert-pdf]
openocr_1 | 04:33:09.556504 OCR_CLIENT: publishing with routing key "convert-pdf"
openocr_1 | 04:33:09.556516 OCR_CLIENT: ocrRequest after: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: []
openocr_1 | 04:35:09.563580 ERROR: Timeout waiting for RPC response -- open-ocr.HandleOcrRequest() at ocr_http_handler.go:80
openocr_1 | 04:35:09.563630 ERROR: Unable to perform OCR decode. Error: Timeout waiting for RPC response -- open-ocr.(*OcrHttpHandler).ServeHTTP() at ocr_http_handler.go:40
rabbitmq_1 |
rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:35:09 ===
rabbitmq_1 | closing AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672)
rabbitmq_1 |
rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:33:08 ===
rabbitmq_1 | accepting AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672)
openocr_1 | 04:33:08.579373 OCR_CLIENT: callbackQueue name: amq.gen-Aj41Hj-a4TCFpUVKkPpSrA
openocr_1 | 04:33:08.580361 OCR_CLIENT: looping over deliveries..
openocr_1 | 04:33:09.556468 OCR_CLIENT: ocrRequest before: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: [convert-pdf]
openocr_1 | 04:33:09.556504 OCR_CLIENT: publishing with routing key "convert-pdf"
openocr_1 | 04:33:09.556516 OCR_CLIENT: ocrRequest after: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: []
openocr_1 | 04:35:09.563580 ERROR: Timeout waiting for RPC response -- open-ocr.HandleOcrRequest() at ocr_http_handler.go:80
openocr_1 | 04:35:09.563630 ERROR: Unable to perform OCR decode. Error: Timeout waiting for RPC response -- open-ocr.(*OcrHttpHandler).ServeHTTP() at ocr_http_handler.go:40
rabbitmq_1 |
rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:35:09 ===
rabbitmq_1 | closing AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672)
rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:33:08 ===
rabbitmq_1 | accepting AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672)
openocr_1 | 04:33:08.579373 OCR_CLIENT: callbackQueue name: amq.gen-Aj41Hj-a4TCFpUVKkPpSrA
openocr_1 | 04:33:08.580361 OCR_CLIENT: looping over deliveries..
openocr_1 | 04:33:09.556468 OCR_CLIENT: ocrRequest before: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: [convert-pdf]
openocr_1 | 04:33:09.556504 OCR_CLIENT: publishing with routing key "convert-pdf"
openocr_1 | 04:33:09.556516 OCR_CLIENT: ocrRequest after: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: []
openocr_1 | 04:35:09.563580 ERROR: Timeout waiting for RPC response -- open-ocr.HandleOcrRequest() at ocr_http_handler.go:80
openocr_1 | 04:35:09.563630 ERROR: Unable to perform OCR decode. Error: Timeout waiting for RPC response -- open-ocr.(*OcrHttpHandler).ServeHTTP() at ocr_http_handler.go:40
rabbitmq_1 |
rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:35:09 ===
rabbitmq_1 | closing AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672)
rabbitmq_1 |
rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:33:08 ===
rabbitmq_1 | accepting AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672)
openocr_1 | 04:33:08.579373 OCR_CLIENT: callbackQueue name: amq.gen-Aj41Hj-a4TCFpUVKkPpSrA
openocr_1 | 04:33:08.580361 OCR_CLIENT: looping over deliveries..
openocr_1 | 04:33:09.556468 OCR_CLIENT: ocrRequest before: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: [convert-pdf]
openocr_1 | 04:33:09.556504 OCR_CLIENT: publishing with routing key "convert-pdf"
openocr_1 | 04:33:09.556516 OCR_CLIENT: ocrRequest after: ImgUrl: , EngineType: ENGINE_TESSERACT, Preprocessors: []
openocr_1 | 04:35:09.563580 ERROR: Timeout waiting for RPC response -- open-ocr.HandleOcrRequest() at ocr_http_handler.go:80
openocr_1 | 04:35:09.563630 ERROR: Unable to perform OCR decode. Error: Timeout waiting for RPC response -- open-ocr.(*OcrHttpHandler).ServeHTTP() at ocr_http_handler.go:40
rabbitmq_1 |
rabbitmq_1 | =INFO REPORT==== 10-Mar-2021::04:35:09 ===
rabbitmq_1 | closing AMQP connection <0.3778.0> (172.22.0.5:39744 -> 172.22.0.2:5672)
The text was updated successfully, but these errors were encountered: