Contribute Media
A thank you to everyone who makes this possible: Read More

Hyperrest: A new Apache Arrow API For High Performance Data Access in Pandas

Description

Pandas is one of the most popular data analytics frameworks for Python, and is widely used in machine learning applications. Pandas provides access to many data formats through a relatively slow ODBC interface. We will review performance benchmarks using Arrow with Pandas, and demonstrate a new API for Arrow called Hyperrest implemented in Dremio, a new open source project for Data Fabric.

Details

Improve this page