feature: Arrow table input/output #4119

judahrand · 2023-08-15T08:30:29Z

Feature request

I think that it would be great to add Arrow Tables as an IO type for BentoML endpoints. This would be particularly beneficial for the GRPC server where the Arrow IPC format (not Parquet) could be used directly by dumping the data in the serialized_bytes field of the Protobuf message.

Motivation

Parquet is currently used to move Pandas DataFrames around in BentoML and is a great storage format but it doesn't maintain all of the great properties of the in-memory Arrow format (because it is designed as an on-disk format) like strict register alignment. It maaay reduce on-the-wire data size but will almost certain increase serialization/deserialization time.

I believe that this addition would:

reduce serialization/deserialization latency
allow for the easy use of other tools within the Arrow ecosystem (Polars, Datafusion, DuckDB, etc etc.)

Other

No response

The text was updated successfully, but these errors were encountered:

parano · 2023-10-31T17:44:27Z

Hi @judahrand - we are working on a new iteration of IO Descriptor in BentoML and it will come with Arrow support! cc @frostming

judahrand · 2023-10-31T18:29:16Z

Does the code that's in development exist somewhere? I'd be interested in having a read.

frostming · 2023-11-01T13:10:52Z

Does the code that's in development exist somewhere? I'd be interested in having a read.

Sure, #4240

judahrand · 2024-03-05T15:23:02Z

@parano Did Arrow support ever get added?

judahrand added the enhancement Enhancement proposals label Aug 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature: Arrow table input/output #4119

feature: Arrow table input/output #4119

judahrand commented Aug 15, 2023 •

edited

Loading

parano commented Oct 31, 2023

judahrand commented Oct 31, 2023

frostming commented Nov 1, 2023

judahrand commented Mar 5, 2024

feature: Arrow table input/output #4119

feature: Arrow table input/output #4119

Comments

judahrand commented Aug 15, 2023 • edited Loading

Feature request

Motivation

Other

parano commented Oct 31, 2023

judahrand commented Oct 31, 2023

frostming commented Nov 1, 2023

judahrand commented Mar 5, 2024

judahrand commented Aug 15, 2023 •

edited

Loading