is an in-process SQL OLAP database management system. The speed is very good on even gigabytes of data on local machines. Fugue has a deep integration with DuckDB. Fugue not only uses DuckDB as the SQL engine, but also implemented all execution engine met
Polars is a Rust-based DataFrame library that supports multi-threaded and out-of-core operations. The performance of Polars is already very good on a local machine, so the focus of the Fugue-Polars integration is scaling out to a cluster. Fugue also has t
Have questions? Chat with us on Github or Slack: provides an easier interface to using distributed compute effectively and accelerates big data projects. It does this by minimizing the amount of code you need to write, in addition to taking care of tricks
is an open-source workflow orchestration framework designed for the modern data stack. The prefect-fugue collection allows users to prototype their code locally, and then execute it on a distributed computing cluster (Spark, Dask, Ray) when production rea
This is a short introduction of FugueSQL geared for new users. FugueSQL is the SQL interface for The Fugue project aims to make big data effortless by accelerating iteration speed and providing a simpler interface for users to utilize distributed computin
This page is divided into three parts: setup, SQL usage, and Python usage of the BigQuery integration.
The most common way is to use the Make sure the GOOGLE_APPLICATION_CREDENTIALS is set to a credential file. You can also read the of BigQuery. If you are using a distributed cluster with Spark, Dask, or Ray, all of the machines in the cluster need to be a
is an open-source package by Capital One that started as a way to compare two Pandas DataFrames with some more functionality than just Pandas.DataFrame.equals(Pandas.DataFrame It allows users to specify tolerances and prints out statistics.
The tries to bridge the gap between local Python and including distributed systems such as Spark and Dask. The main idea is to create a pythonic interface to express SQL semantics, so the expression is agnostic to the backends.