Apache DataFusion SQL Query Engine
-
Updated
Jun 28, 2024 - Rust
Apache DataFusion SQL Query Engine
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
Kotlin studies. Includes Project Arrow, Kotlin Coroutines, Flows and More
MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.
KNIME Python Integration
A public transport declaration calculator. It is used to automatically filter out all relevant travelling segments when travelling with a general public transport card. It receives a PDF file as an input and generates a CSV file with the results as Local Date / Complete value.
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Apache Datafusion JVM User Defined Functions (UDF), integration nobody asked for 😀
A toolkit for working with compressed Arrow in-memory, on-disk, and over-the-wire
Apache DataFusion Ballista Distributed Query Engine
A note manager application that implements a search engine for story telling texts. It feeds a UI and the idea is to show how can we, with Arrow, use its benefits to create a more robust, resilient, back-pressure resistent, innovative application with support for high capacity and high availability.
Exon is an OLAP query engine specifically for biology and life science applications.
Spark ClickHouse Connector build on DataSourceV2 API
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
Λrrow - Functional companion to Kotlin's Standard Library
Add a description, image, and links to the arrow topic page so that developers can more easily learn about it.
To associate your repository with the arrow topic, visit your repo's landing page and select "manage topics."