Posts by Coiled Team
21 December 2023 - Xarray at Large Scale: A Beginner’s Guide
Posts by Daniel van Strien
09 October 2024 - Scaling AI-Based Data Processing with Hugging Face + Dask
Posts by David Chudzicki
06 January 2023 - Handling Unexpected AWS IAM Changes
Posts by Florian Jetter
19 April 2024 - Dask vs. Spark
Posts by Franco Bosetti
12 November 2024 - Airflow, Dask, & Coiled: Adding Big Data Processing to Your Cloud Toolkit
Posts by Greg Hayes
06 January 2023 - Scaling Hyperparameter Optimization With XGBoost, Optuna, and Dask
19 December 2022 - Automated Data Pipelines On Dask With Coiled & Prefect
Posts by Guido Imperiale
23 August 2023 - Fine Performance Metrics and Spans
05 May 2023 - Performance testing at Coiled
Posts by Hendrik Makait
16 October 2024 - Large Scale Geospatial Benchmarks: First Pass
14 May 2024 - DataFrames at Scale Comparison: TPC-H
16 May 2023 - Observability for Distributed Computing with Dask
15 March 2023 - Shuffling large data at constant memory in Dask
Posts by Jack Solomon
31 January 2024 - Real-world Grocery Demand Forecasting
Posts by James Bourbeau
16 October 2024 - Large Scale Geospatial Benchmarks: First Pass
09 October 2024 - Scaling AI-Based Data Processing with Hugging Face + Dask
09 September 2024 - Large Scale Geospatial Benchmarks
08 April 2024 - Easy Scalable Production ETL
23 January 2024 - Schedule Python Jobs with Prefect and Coiled
01 November 2023 - Processing Terabyte-Scale NASA Cloud Datasets with Coiled
07 September 2023 - Parallel Serverless Functions at Scale
10 August 2023 - Data-proximate Computing with Coiled Functions
14 June 2023 - Coiled notebooks
18 May 2023 - Distributed printing
18 April 2023 - Upstream testing in Dask
Posts by Lucas Gabriel
09 August 2023 - Dask, Dagster, and Coiled for Production Analysis at OnlineApp
Posts by Matt Rocklin
09 September 2024 - Large Scale Geospatial Benchmarks
Posts by Matthew Powers
09 February 2022 - Reading CSV files into Dask DataFrames with read_csv
01 October 2021 - Converting a Dask DataFrame to a pandas DataFrame
Posts by Matthew Rocklin
02 January 2025 - Coiled 2024 in Review
19 November 2024 - SLURM-Style Job Arrays on the Cloud with Coiled
16 October 2024 - Large Scale Geospatial Benchmarks: First Pass
14 May 2024 - DataFrames at Scale Comparison: TPC-H
06 October 2023 - Ten Cents Per Terabyte
01 August 2023 - Easy Heavyweight Serverless Functions
Posts by Miles Granger
15 May 2023 - GIL monitoring in Dask
Posts by Nat Tabris
14 June 2023 - Coiled notebooks
05 May 2023 - How well does Dask run on Graviton?
06 January 2023 - AWS Cost Explorer Tips and Tricks
Posts by Nathan Ballou
17 November 2023 - Process Hundreds of GB of Data in the Cloud with Polars
Posts by Patrick Hoefler
17 March 2025 - Reducing Memory Pressure for Xarray + Dask Workloads
17 December 2024 - Faster Xarray Quantile Computations with Dask
21 November 2024 - Improving GroupBy.map with Dask and Xarray
14 May 2024 - Dask DataFrame is Fast Now
05 October 2023 - TPC-H Benchmarks for Query Optimization with Dask Expressions
19 September 2023 - Coiled observability wins: Chunksize
01 September 2023 - Reduce training time for CPU intensive models with scikit-learn and Coiled Functions
07 August 2023 - Process Hundreds of GB of Data with DuckDB in the Cloud
04 August 2023 - High Level Query Optimization in Dask
05 June 2023 - Utilizing PyArrow to improve pandas and Dask workflows
Posts by Pavithra Eswaramoorthy
22 November 2021 - Pandas parallel apply and map with Dask DataFrame
Posts by Quentin Lhoest
09 October 2024 - Scaling AI-Based Data Processing with Hugging Face + Dask
Posts by Samantha Hughes
23 February 2023 - Just in time Python environments
17 January 2023 - How many PEPs does it take to install a package?
Posts by Sarah Johnson
09 October 2024 - Scaling AI-Based Data Processing with Hugging Face + Dask
14 May 2024 - DataFrames at Scale Comparison: TPC-H
19 April 2024 - Dask vs. Spark
05 February 2024 - One Trillion Row Challenge
16 January 2024 - One Billion Row Challenge (1BRC) in Python with Dask
10 October 2023 - Run Jupyter Notebooks on a GPU on the Cloud
05 September 2023 - Processing a 250 TB dataset with Coiled, Dask, and Xarray
05 May 2023 - How well does Dask run on Graviton?
23 February 2023 - Just in time Python environments
Posts by Stephen Schneider
12 November 2024 - Airflow, Dask, & Coiled: Adding Big Data Processing to Your Cloud Toolkit