Authors — Coiled documentation

Posts by Alexandre Chabot-Leclerc

20 October 2025 - MLflow on Coiled with Sidecars and Filestores

Posts by Coiled Team

21 December 2023 - Xarray at Large Scale: A Beginner’s Guide

Posts by Daniel van Strien

09 October 2024 - Scaling AI-Based Data Processing with Hugging Face + Dask

Posts by David Chudzicki

06 January 2023 - Handling Unexpected AWS IAM Changes

Posts by Florian Jetter

19 April 2024 - Dask vs. Spark

Posts by Franco Bosetti

12 November 2024 - Airflow, Dask, & Coiled: Adding Big Data Processing to Your Cloud Toolkit

Posts by Greg Hayes

06 January 2023 - Scaling Hyperparameter Optimization With XGBoost, Optuna, and Dask

19 December 2022 - Automated Data Pipelines On Dask With Coiled & Prefect

Posts by Guido Imperiale

23 August 2023 - Fine Performance Metrics and Spans

05 May 2023 - Performance testing at Coiled

Posts by Hendrik Makait

16 October 2024 - Large Scale Geospatial Benchmarks: First Pass

14 May 2024 - DataFrames at Scale Comparison: TPC-H

23 June 2023 - Dask performance benchmarking put to the test: Fixing a pandas bottleneck

16 May 2023 - Observability for Distributed Computing with Dask

15 March 2023 - Shuffling large data at constant memory in Dask

Posts by Jack Solomon

31 January 2024 - Real-world Grocery Demand Forecasting

Posts by James Bourbeau

16 October 2024 - Large Scale Geospatial Benchmarks: First Pass

09 October 2024 - Scaling AI-Based Data Processing with Hugging Face + Dask

09 September 2024 - Large Scale Geospatial Benchmarks

08 April 2024 - Easy Scalable Production ETL

23 January 2024 - Schedule Python Jobs with Prefect and Coiled

01 November 2023 - Processing Terabyte-Scale NASA Cloud Datasets with Coiled

07 September 2023 - Parallel Serverless Functions at Scale

10 August 2023 - Data-proximate Computing with Coiled Functions

14 June 2023 - Coiled notebooks

18 May 2023 - Distributed printing

18 April 2023 - Upstream testing in Dask

Posts by James Bourbeau and Florian Jetter

01 January 2022 - Snowflake and Dask: a Python Connector for Faster Data Transfer

Posts by Lucas Gabriel

09 August 2023 - Dask, Dagster, and Coiled for Production Analysis at OnlineApp

Posts by Luke Canavan

08 July 2026 - Coiled Stability Update

Posts by Matt Rocklin

09 September 2024 - Large Scale Geospatial Benchmarks

Posts by Matthew Powers

09 February 2022 - Reading CSV files into Dask DataFrames with read_csv

01 October 2021 - Converting a Dask DataFrame to a pandas DataFrame

Posts by Matthew Rocklin

02 January 2025 - Coiled 2024 in Review

19 November 2024 - SLURM-Style Job Arrays on the Cloud with Coiled

16 October 2024 - Large Scale Geospatial Benchmarks: First Pass

14 May 2024 - DataFrames at Scale Comparison: TPC-H

06 October 2023 - Ten Cents Per Terabyte

01 August 2023 - Easy Heavyweight Serverless Functions

01 January 2022 - Coiled, one year in

Posts by Miles Granger

15 May 2023 - GIL monitoring in Dask

Posts by Nat Tabris

14 June 2023 - Coiled notebooks

05 May 2023 - How well does Dask run on Graviton?

04 April 2023 - Burstable vs non-burstable AWS instance types for data engineering workloads

06 January 2023 - AWS Cost Explorer Tips and Tricks

Posts by Nat Tabris and Alexandre Chabot-Leclerc

21 August 2025 - Marimo on Coiled

Posts by Nathan Ballou

17 November 2023 - Polars queries in the cloud with Coiled

Posts by Patrick Hoefler

17 March 2025 - Reducing Memory Pressure for Xarray + Dask Workloads

17 December 2024 - Faster Xarray Quantile Computations with Dask

21 November 2024 - Improving GroupBy.map with Dask and Xarray

14 May 2024 - Dask DataFrame is Fast Now

05 October 2023 - TPC-H Benchmarks for Query Optimization with Dask Expressions

19 September 2023 - Coiled observability wins: Chunksize

01 September 2023 - Reduce training time for CPU intensive models with scikit-learn and Coiled Functions

07 August 2023 - Process Hundreds of GB of Data with DuckDB in the Cloud

04 August 2023 - High Level Query Optimization in Dask

24 July 2023 - How to Train a Neural Network on a GPU in the Cloud with coiled functions

23 June 2023 - Dask performance benchmarking put to the test: Fixing a pandas bottleneck

05 June 2023 - Utilizing PyArrow to improve pandas and Dask workflows

Posts by Pavithra Eswaramoorthy

22 November 2021 - Pandas parallel apply and map with Dask DataFrame

Posts by Quentin Lhoest

09 October 2024 - Scaling AI-Based Data Processing with Hugging Face + Dask

Posts by Samantha Hughes

23 February 2023 - Just in time Python environments

17 January 2023 - How many PEPs does it take to install a package?

Posts by Sameer Soi

01 January 2022 - Abalone Bio: Accelerating Antibody Discovery

Posts by Sarah Johnson

05 September 2025 - How to run pandas on GPUs with Coiled

09 October 2024 - Scaling AI-Based Data Processing with Hugging Face + Dask

14 May 2024 - DataFrames at Scale Comparison: TPC-H

19 April 2024 - Dask vs. Spark

05 February 2024 - One Trillion Row Challenge

16 January 2024 - One Billion Row Challenge (1BRC) in Python with Dask

10 October 2023 - Run Jupyter Notebooks on a GPU on the Cloud

05 September 2023 - Processing a 250 TB dataset with Coiled, Dask, and Xarray

05 May 2023 - How well does Dask run on Graviton?

23 February 2023 - Just in time Python environments

Posts by Stephen Schneider

12 November 2024 - Airflow, Dask, & Coiled: Adding Big Data Processing to Your Cloud Toolkit

Posts by null

01 January 2022 - Writing Parquet Files with Dask using to_parquet

01 January 2022 - Why we passed on Kubernetes

01 January 2022 - Use Mambaforge to Conda Install PyData Stack on your Apple M1 Silicon Machine

01 January 2022 - Understanding Managed Dask (Dask as a Service)

01 January 2022 - Tackling unmanaged memory with Dask

01 January 2022 - Speed up a pandas query 10x with these 6 Dask DataFrame tricks

01 January 2022 - Spark to Dask: The Good, Bad, and Ugly of Moving from Spark to Dask

01 January 2022 - Seven Stages of Open Software

01 January 2022 - Setting a Dask DataFrame index

01 January 2022 - Search at Grubhub and User Intent

01 January 2022 - Scikit-learn in the Cloud with Coiled

01 January 2022 - Scale your data science workflows with Python and Dask

01 January 2022 - Save Money with Spot

01 January 2022 - Repartitioning Dask DataFrames

01 January 2022 - Reducing memory usage in Dask workloads by 80%

01 January 2022 - Reduce memory usage with Dask dtypes

01 January 2022 - PyArrow Strings in Dask DataFrames

01 January 2022 - Prioritizing Pragmatic Performance for Dask

01 January 2022 - Perform a Spatial Join in Python

01 January 2022 - Introducing the Dask Active Memory Manager

01 January 2022 - How to Merge Dask DataFrames

01 January 2022 - How to Convert a pandas Dataframe into a Dask Dataframe

01 January 2022 - How Coiled sets memory limit for Dask workers

01 January 2022 - Filtering Dask DataFrames with loc

01 January 2022 - Enterprise Dask Support

01 January 2022 - Easily Run Python Functions in Parallel

01 January 2022 - Dask on GCP

01 January 2022 - Dask on Azure

01 January 2022 - Dask on AWS

01 January 2022 - Dask for Parallel Python

01 January 2022 - Dask and the PyData Stack

01 January 2022 - Dask Read Parquet Files into DataFrames with read_parquet

01 January 2022 - Creating Disk Partitioned Lakes with Dask using partition_on

01 January 2022 - Cost Savings with Dask and Coiled

01 January 2022 - Convert Large JSON to Parquet with Dask

01 January 2022 - Coiled Cloud Architecture

01 January 2022 - Code Formatting Jupyter Notebooks with Black

01 January 2022 - Cloud ETL automation with Github Actions and Coiled

01 January 2022 - Better Shuffling in Dask: a Proof-of-Concept

01 January 2022 - Accelerating Microstructural Analytics with Dask and Coiled