Blog

This curated set of posts from the Starburst blog have been compiled for the Developer Center community.

alt
Understanding data analytics vs AI

When it comes to analytics, deriving value from data has always been the goal. Specifically,

Apr 1st, 2025

Read the blog post
alt
Celebrating women’s contributions to technology with Starburst

In honor of Women’s History Month, Starburst is proud to recognize women pioneers in technology

Mar 26th, 2025

Read the blog post
alt
The file explosion problem in Apache Iceberg and what to do when it happens to you

Apache Iceberg is one of the leading tools that can be used to manage metadata

Mar 12th, 2025

Read the blog post
alt
Introducing the Trino spooling protocol

Trino’s origin story is all about making interactive queries faster than the original schema-on-read solution,

Mar 12th, 2025

Read the blog post
alt
The 3 foundations of an AI data architecture

In my last blog, I showed how Starburst has become the foundation for an emerging

Feb 27th, 2025

Read the blog post
alt
Starburst + AI

I’ve always believed in technology that solves real business problems. Over the years, as the

Feb 26th, 2025

Read the blog post
alt
What is Apache Spark?

Any conversation about the data engineering space should begin with a discussion of what Apache

Feb 21st, 2025

Read the blog post
alt
Racing for commits on Delta Lake tables using Starburst

Delta Lake is great for data analytics and data engineering. One of the things that

Feb 5th, 2025

Read the blog post
alt
Do you have a data velocity problem?

We all know that the volume of data is increasing. In fact, information collected by

Jan 29th, 2025

Read the blog post
alt
Are you making the most of your Hadoop cluster?

When it first launched, Apache Hadoop was a tectonic shift in the big data landscape.

Jan 21st, 2025

Read the blog post
alt
How to do data centralization the right way

Is data centralization the new black?  For a long time, data centralization was the default

Jan 13th, 2025

Read the blog post
alt
Parquet and ORC’s many shortfalls for machine learning (ML) workloads, and what should be done about them

At the turn of the century (around a quarter of a decade ago), over 99%

Jan 7th, 2025

Read the blog post