Home Up PDF Prof. Dr. Ingo Claßen
Data Systems

airbyte

alloydb

  • home (link)
  • Run AlloyDB anywhere - in your data center, your laptop, or in any cloud (link)

apache arrow

  • home (link)
  • blog (link)
  • Arrow Columnar Format (link)
  • pact book chapter 1 (link)
  • rust implementation - arrow2 (link)
  • Querying Parquet with Millisecond Latency (link)
  • Our journey at F5 with Apache Arrow (part 1) (link)
  • Supercharge Your Data Pipelines with Advanced Apache Arrow (link)
  • Apache Arrow Flight as a Data Catalog (link)
  • optd - Apache Arrow Datafusion (link)
  • I spent 6 hours learning Apache Arrow: Overview (link)
  • ADBC: The Future of Database Connectivity (link)

apache datafusion

apache doris

aws aurora

  • DSQL Vignette: Aurora DSQL, and A Personal Stor (link)
  • A Major Postgres Upgrade with Zero Downtime (link)

aws s3

  • AWS S3 Deep Dive (link)
  • Deep Dive into New Amazon S3 Tables (link)

cedarDB

clickhouse

  • home (link)
  • ClickHouse: A Blazingly Fast DBMS with Full SQL Join Support (link)
  • I spent 3 hours learning the overview of ClickHouse (link)
  • I spent 8 hours understanding ClickHouse Architecture (link)
  • I spent 5 hours learning how ClickHouse built their internal data warehouse (link)
  • Hash tables in ClickHouse and C++ Zero-cost Abstractions (link)

cockroachdb

cosmosdb

documentDB - microsoft

dragonflydb

druid

  • When to Use Apache Druid (and When Not to Use It) (link)

faunadb

hydra

hyper

glaredb

ibis

  • home (link)
  • What is Ibis and how does it help data engineering? (link)
  • Querying 1TB on a laptop with Python dataframes (link)
  • Ibis Basics (link)

influxdb

kafka

  • Processing guarantees in Kafka (link)
  • Real-time data pipeline using Kafka and ClickHouse (link)
  • Query Your Data in Kafka Using SQL (link)

kuzudb

lancedb

  • What the Heck is LanceDB? (link)

leveldb

memgraph

mongodb

mysql

  • MySQL High Availability (link)
  • Sharding Pinterest: How we scaled our MySQL fleet (link)

neon

nile

malloy

niledb

  • Introducing Nile, Serverless Postgres for modern SaaS (link) (link)

paradeDB

  • home (link)
  • doc (link)
  • blog (link)
  • pg_analytics: Transforming Postgres into a Fast OLAP Database (link)
  • ParadeDB - A New Postgres Block Storage Layout for Full Text Search (link)

polardb

  • PolarDB for PostgreSQL (link)

polars

  • user guide (link)
  • git (link)
  • DataFusion in Python (link)
  • Add examples showing how to execute SQL with Polars and Pandas (link)
  • Using Polars Plugins for a 14x Speed Boost with Rust (link)
  • Process Hundreds of GB of Data in the Cloud with Polars (link)

prometheus

puppygraph

redis

  • home (link)
  • doc (link)
  • Setting Up a Production-Ready Redis Cluster (link)
  • Introduction to the in-memory datastore Redis (link)

risingwave

rocksdb

  • wiki (link)
  • blog (link)
  • How RocksDB works (link)
  • RocksJava Basics (link)
  • Simple RocksDB with Java (link)
  • Try using RocksDB in Java (link)
  • Navigating the Minefield of RocksDB Configuration Options (link)
  • Storing Time Series in RocksDB: A Cookbook (link)
  • How RocksDB works (link)

rockset

scylladb

  • home (link)
  • doc (link)
  • blog (link)
  • ScyllaDB’s Path to Strong Consistency (link)
  • ScyllaDB’s Rust Developer Workshop: What We Learned (link)
  • Mind-blowing: PostgreSQL Meets ScyllaDB’s Lightning Speed and Monstrous Scalability (link)

snowflake

  • I spent 8 hours diving deep into Snowflake (again) (link)

sqlite

  • SQLite compiled to JavaScript (link)

starrocks

supabase

terminusdb

tigergraph

tidb, tikv

timescaledb

vannaAI

velox

umbradb