Data-Engineering

December 20, 2024

Mastering Spark: Session vs. DataFrameWriter vs. Table Configs

December 12, 2024

Should You Ditch Spark for DuckDb or Polars?

November 20, 2024

Announcing the Microsoft Fabric Shape Library for Excalidraw

November 4, 2024

Unlock Faster Writes in Delta Lake with Deletion Vectors

October 24, 2024

Breaking the Myth: Spark Isn’t as Scary as You'd Think (And Yes, It Supports SQL!)

October 14, 2024

Mastering Spark: Creating Resiliency with Retry Logic

October 11, 2024

Mastering Spark: Parallelizing API Calls and Other Non-Distributed Tasks

October 10, 2024

Mastering Spark: RDDs vs. DataFrames

September 27, 2024

Yet Another Way to Connect to the SQL Endpoint / Warehouse via Python

September 17, 2024

To V-Order or Not: Making the Case for Selective Use of V-Order in Fabric Spark

September 13, 2024

Mastering Spark: Enhancing Job Visibility

August 22, 2024

From Databricks to Fabric: A Deep Dive into Spark Cluster Differences

August 16, 2024

Optimizing Spark: A Deep Dive into Optimized Write in Microsoft Fabric

July 18, 2024

Elevate Your Code: Developing Python Libraries Using Microsoft Fabric

May 23, 2024

Fabric Announcements at Build '24

May 19, 2024

Generative AI and Spark: Leveraging LLMs for Accelerated Migrations

April 30, 2024

The TCO of Photon in Databricks: Is it a No Brainer?

April 26, 2024

The Fabric Concurrency Showdown: RunMultiple vs. ThreadPools

April 17, 2024

The SQL Decoder Ring for Replatforming to Fabric and Databricks

February 26, 2024

Beyond Information Schema: Metadata Mastery in a Fabric Lakehouse