// projects / case-study · 03

NYC TLC
trip records.

Billion-row dataset — a performance study in BigQuery partitioning, clustering, and query-cost tuning.

// dataset
bigquery-public-data.new_york_taxi_trips
// type
planned
Status: in development — case study lands when the project ships.
// 01 / scope

What's planned.

Take a billion-row BigQuery public dataset and treat it like a real warehouse problem — partitioning strategy, clustering keys, slot/cost tradeoffs, and how query design decisions show up in the bill.

  • Source: bigquery-public-data.new_york_taxi_trips
  • Focus: cost-per-query analysis across naive vs. tuned implementations
  • Output: a writeup with before/after query plans, cost numbers, and a Looker Studio summary
// 02 / planned stack

Stack.

BigQuery SQL Looker Studio