Skip to content Marginalia About Donate Random

Search The Internet

Showing search results from fugue-tutorials.readthedocs.io.
https://fugue-tutorials.readthedocs.io/tutorials/integrations/backends/duckdb.html

Fugue with DuckDB — Fugue Tutorials

is an in-process SQL OLAP database management system. The speed is very good on even gigabytes of data on local machines. Fugue has a deep integration with DuckDB. Fugue not only uses DuckDB as the SQL engine, but also implemented all execution engine met

Js ⢚⡟⠀⠠⣙⣿⠀⠀
Terms appear in 24 positions

https://fugue-tutorials.readthedocs.io/tutorials/integrations/backends/polars.html

Fugue with Polars — Fugue Tutorials

Polars is a Rust-based DataFrame library that supports multi-threaded and out-of-core operations. The performance of Polars is already very good on a local machine, so the focus of the Fugue-Polars integration is scaling out to a cluster. Fugue also has t

Js ⠀⡀⢸⠀⠀⠀⠀⠀
Terms appear in 5 positions

https://fugue-tutorials.readthedocs.io/index.html

Welcome to the Fugue Tutorials! — Fugue Tutorials

Have questions? Chat with us on Github or Slack: provides an easier interface to using distributed compute effectively and accelerates big data projects. It does this by minimizing the amount of code you need to write, in addition to taking care of tricks

Js ⠀⠀⡂⠁⠀⠀⠀⠀
Terms appear in 3 positions

https://fugue-tutorials.readthedocs.io/tutorials/integrations/ecosystem/prefect.html

Prefect — Fugue Tutorials

is an open-source workflow orchestration framework designed for the modern data stack. The prefect-fugue collection allows users to prototype their code locally, and then execute it on a distributed computing cluster (Spark, Dask, Ray) when production rea

Js ⠀⠀⠁⠀⠀⡄⠂⠀
Terms appear in 4 positions

https://fugue-tutorials.readthedocs.io/tutorials/quick_look/ten_minutes_sql.html

FugueSQL in 10 Minutes — Fugue Tutorials

This is a short introduction of FugueSQL geared for new users. FugueSQL is the SQL interface for The Fugue project aims to make big data effortless by accelerating iteration speed and providing a simpler interface for users to utilize distributed computin

Js ⠀⡀⠀⠀⠆⠀⠀⠀
Terms appear in 3 positions

https://fugue-tutorials.readthedocs.io/tutorials/integrations/warehouses/trino.html

Fugue with Trino — Fugue Tutorials

This page is divided into three parts: setup, SQL usage, and Python usage of the BigQuery integration.

Js ⠀⠀⠄⢀⠁⠀⠀⠀
Terms appear in 3 positions

https://fugue-tutorials.readthedocs.io/tutorials/integrations/warehouses/bigquery.html

Fugue with BigQuery — Fugue Tutorials

The most common way is to use the Make sure the GOOGLE_APPLICATION_CREDENTIALS is set to a credential file. You can also read the of BigQuery. If you are using a distributed cluster with Spark, Dask, or Ray, all of the machines in the cluster need to be a

Js ⠀⠀⢀⠀⢀⠀⠀⠀
Terms appear in 2 positions

https://fugue-tutorials.readthedocs.io/tutorials/integrations/ecosystem/datacompy.html

DataComPy — Fugue Tutorials

is an open-source package by Capital One that started as a way to compare two Pandas DataFrames with some more functionality than just Pandas.DataFrame.equals(Pandas.DataFrame It allows users to specify tolerances and prints out statistics.

Js ⠀⠂⠀⢐⠀⠀⠀⠀
Terms appear in 3 positions

https://fugue-tutorials.readthedocs.io/tutorials/integrations/backends/ibis.html

Fugue with Ibis — Fugue Tutorials

The tries to bridge the gap between local Python and including distributed systems such as Spark and Dask. The main idea is to create a pythonic interface to express SQL semantics, so the expression is agnostic to the backends.

Js ⠀⠀⢀⠀⠸⠀⠀⠀
Terms appear in 4 positions

Filters

  • Remove Javascript
  • Reduce Adtech
  • Recent Results
  • Search In Title

Domains

  • No Filter
  • Small Web
  • Blogosphere
  • Academia

  • Vintage
  • Plain Text
  • ~tilde

  • Wiki
  • Forum
  • Docs
  • Recipes

Syntax

This is a keyword-based search engine. When entering multiple search terms, the search engine will attempt to match them against documents where the terms occur in close proximity.

Search terms can be excluded with a hyphen.

While the search engine at present does not allow full text search, quotes can be used to specifically search for names or terms in the title. Using quotes will also cause the search engine to be as literal as possible in interpreting the query.

Parentheses can be used to add terms to the query without giving weight to the terms when ranking the search results.

Samples

soup -chicken
Look for keywords that contain soup, but not chicken.
"keyboard"
Look for pages containing the exact word keyboard, not keyboards or the like.
"steve mcqueen"
Look for pages containing the exact words steve mcqueen in that order, with no words in between.
apology (plato)
Look for pages containing apology and plato, but only rank them based on their relevance to apology

Special Keywords

Several special keywords are supported by the search engine.

KeywordMeaning
site:example.comDisplay site information about example.com
site:example.com keywordSearch example.com for keyword
browse:example.comShow similar websites to example.com
ip:127.0.0.1Search documents hosted at 127.0.0.1
links:example.comSearch documents linking to example.com
tld:edu keywordSearch documents with the top level domain edu.
?tld:edu keywordPrefer but do not require results with the top level domain edu. This syntax is also possible for links:..., ip:... and site:...
q>5The amount of javascript and modern features is at least 5 (on a scale 0 to 25)
q<5The amount of javascript and modern features is at most 5 (on a scale 0 to 25)
year>2005(beta) The document was ostensibly published in or after 2005
year=2005(beta) The document was ostensibly published in 2005
year<2005(beta) The document was ostensibly published in or before 2005
rank>50The ranking of the website is at least 50 in a span of 1 - 255
rank<50The ranking of the website is at most 50 in a span of 1 - 255
count>10 The search term must appear in at least 10 results form the domain
count<10 The search term must appear in at most 10 results from the domain
format:html5Filter documents using the HTML5 standard. This is typically modern websites.
format:xhtmlFilter documents using the XHTML standard
format:html123Filter documents using the HTML standards 1, 2, and 3. This is typically very old websites.
generator:wordpressFilter documents with the specified generator, in this case wordpress
file:zipFilter documents containing a link to a zip file (most file-endings work)
file:audioFilter documents containing a link to an audio file
file:videoFilter documents containing a link to a video file
file:archiveFilter documents containing a link to a compressed archive
file:documentFilter documents containing a link to a document
-special:mediaFilter out documents with audio or video tags
-special:scriptsFilter out documents with javascript
-special:affiliateFilter out documents with likely Amazon affiliate links
-special:trackingFilter out documents with analytics or tracking code
-special:cookiesFilter out documents with cookies

Results Legend

The estimated relevance of the search result is indicated using the color saturation of the color of the search result, as well as the order the results are presented.

Information about the position of the match is indicated using a dot matrix in the bottom bar of each search result. Each dot represents four sentences, and are presented in an order of top-to-bottom, left-to-right.

⣿⠃⠀⠀   — The terms occur heavily toward the beginning of the document.

⠠⠀⡄⠁   — The terms occur sparsely throughout the document.

⠀⠁⠀⠀   — The terms occur only in a single sentence.

Potentially problems with the document are presented with a warning triangle, e.g. ⚠ 3. Desktop users can mouse-over this to get a detailed breakdown.

Policies

This website complies with the GDPR by not collecting any personal information, and with the EU Cookie Directive by not using cookies. More Information.

Contact

Reach me at kontakt@marginalia.nu, @MarginaliaNu on twitter.

Open Source

The search engine is open source with an AGPL license. The sources can be perused at https://git.marginalia.nu/.

Data Sources

IP geolocation is sourced from the IP2Location LITE data available from https://lite.ip2location.com/ under CC-BY-SA 4.0.