Articles by tanelpoder
1

Networking and eBPF Predictions for 2026 and Beyond (isovalent.com)

1

Huatuo: A cloud-native operating system observability project based on eBPF (github.com/ccfos)

2

JOL (Java Object Layout) is the tiny toolbox to analyze object layout in JVMs (github.com/openjdk)

1

An In-Depth Look at Pipe and Splice Implementation in Linux Kernel (oracle.com)

3

The $280k IT Director's Guide to Corporate Survival (twitter.com/it_unprofession)

1

Visualizing embedding vectors as heatmaps for explaining their low level nature (tanelpoder.com)

2

ANN v3: 200ms p99 query latency over 100B vectors (turbopuffer.com)

1

Kafka Dead Letter Queue Triage: Debugging 25,000 Failed Messages (skey.uk)

2

Xgotop: Realtime Go Runtime Visualizer (devpost.com)

1

High-bandwidth flash progress and future (blocksandfiles.com)

2

AliSQL is a MySQL branch originated from Alibaba Group (github.com/alibaba)

1

Shared execution plan cache for Amazon Aurora PostgreSQL (amazon.com)

2

Building an eBPF/XDP L2 Direct Server Return Load Balancer from Scratch (iximiuz.com)

1

Blocking-Lock Brownouts Can Escalate from Row-Level to Complete System Outages (ardentperf.com)

2

MySQL 8.4 disables AHI – Why and What you need to know (nitty-witty.com)

1

Clockwork: Runtime agnostic async executor with powerful configurable scheduling (github.com/nikhilgarg28)

2

Switch Join: PostgreSQL that adapts on the fly (alenarybakina.substack.com)

1

The Algebra of Speed: Mathematical Foundations of Computational Performance (ttsugriy.github.io)

2

Apache Arrow for the Database (dataengineeringcentral.substack.com)

2

Pragmatic bitmap filters in Microsoft SQL Server (vldb.org)

2

Syscallargs: List all Linux system calls with their arguments from tracefs (tanelpoder.com)

1

Pragmatic Bitmap Filters in Microsoft SQL Server [pdf] (vldb.org)

1

Does AI-Assisted Coding Deliver? A Study of Cursor's Impact on Software Projects (arxiv.org)

402

STFU (github.com/pankajtanwarbanna)

1

wal3: A Write-Ahead Log for Chroma, Built on Object Storage (trychroma.com)

4

From Building Houses to Storage Engines (tidesdb.com)

2

Alternatives to MinIO for single-node local S3 (rmoff.net)

32

psc: The ps utility, with an eBPF twist and container context (github.com/loresuso)

1

Dell warns against reusing SSDs as flash shortages bite (blocksandfiles.com)

2

Pg-safeupdate: A PostgreSQL extension requiring criteria for UPDATE and DELETE (github.com/eradman)

1

DuckDB beats Polars for 1TB of data (confessionsofadataguy.com)

1

GizmoSQL Documentation (gizmosql.com)

10

Generate QR Codes with Pure SQL in PostgreSQL (tanelpoder.com)

1

Vendor Locked CPUs, Restricting and Securing Hardware (cloudninjas.com)

3

Waymo Robotaxi Causes Traffic Jam in Miami Beach (miaminewtimes.com)

2

Zymtrace vs. Nsight: Profiling Nvidia GPU Clusters at Scale (zymtrace.com)

1

Hardening eBPF for Runtime Security: Lessons from Datadog Workload Protection (datadoghq.com)

2

Native NVMe Support in Windows Server 2025 (techcommunity.microsoft.com)

1

M-Lab: Measure the Internet, save the data, and make it accessible and useful (measurementlab.net)

2

Quickly Inspect Your Java Application with JStall (mostlynerdless.de)

2

The Data Center as a Computer: Designing Warehouse-Scale Machines 2026 ed. [pdf] (springer.com)

3

The Data Center as a Computer: Designing Warehouse-Scale Machines 2026 ed. [pdf] (springer.com)

1

We Debug Live Kernels Using Drgn – You Can Too (oracle.com)

3

Pyleak: Detect asyncio event loop blocking with stack traces in Python (github.com/deepankarm)

1

MATCH_RECOGNIZE in BigQuery (cloud.google.com)

3

DuckDB as the New jq (pgrs.net)

3

NVIDIA CUDA Tile programming model (nvidia.com)

1

podtrace: eBPF-based diagnostic tool for Kubernetes applications (github.com/gma1k)

1

Who Will Observe the Observability? eBPF Performance at Scale (zmalik.dev)

3

Plans for MySQL Vector Support and a MySQL Binlog Server (percona.com)

145

Linux Kernel Explorer (reverser.dev)

1

Kioxia 245TB SSD (tomshardware.com)

1

Researchers push "Context Engineering 2.0" as the road to lifelong AI memory (the-decoder.com)

8

AI Code Is Going to Kill Your Startup (and You're Going to Let It) (medium.com/kcl17)

3

The Art of Not Being Dumb (yewjin.substack.com)

2

Postgres Internals Hiding in Plain Sight (crunchydata.com)

11

Vortex: An extensible, state of the art columnar file format (github.com/vortex-data)

1

MT4G: A Tool for Auto-Discovery of NVIDIA and AMD GPU Compute, Memory Topologies (arxiv.org)

1

Postgres IPC:SyncRep – Sync Replication Is Not Actually Sync Replication (ardentperf.com)

75

650GB of Data (Delta Lake on S3). Polars vs. DuckDB vs. Daft vs. Spark (dataengineeringcentral.substack.com)

3

AI is all about inference now (infoworld.com)

2

Explorations of RDMA in LLM Systems (qun.ch)

1

Instance Explorer (datadoghq.com)

3

Larger Than RAM Vector Indexes for Relational Databases (planetscale.com)

1

Turning PySpark into a Universal DataFrame API (github.com/eakmanrq)

2

Mount Mayhem at Netflix: Scaling Containers on Modern CPUs (netflixtechblog.medium.com)

1

Google's MCP Toolbox for Databases: A Technical Deep Dive for Engineering Teams (agnost.ai)

1

11X Faster ScyllaDB Backup (scylladb.com)

2

Enabling Trillion-Parameter Models on AWS EFA (perplexity.ai)

1

pg_stat_plans: Track per-plan call counts, execution times and EXPLAIN texts (github.com/pganalyze)

3

Apache Iggy is a high-performance, persistent message streaming platform (apache.org)

1

Postgres_AI Monitoring (gitlab.com/postgres-ai)

8

Using the expand and contract pattern for schema changes (prisma.io)

1

aperf: A CLI tool to gather performance data and visualize using HTML graphs (github.com/aws)

1

ctop: Top-like interface for container metrics (github.com/bcicen)

2

Container Security book by Liz Rice (2nd edition, free download) (isovalent.com)

1

Reverie: An ergonomic and safe syscall interception framework for Linux (github.com/facebookexperimental)

6

History of Lambda Syntax (hydromatic.net)

3

Cloudflare Workers Automatic Tracing (cloudflare.com)

2

Async-Profiler 4.2 Released (github.com/async-profiler)

2

Making JFR Quack: Importing JFR Files into DuckDB (mostlynerdless.de)

117

A sharded DuckDB on 63 nodes runs 1T row aggregation challenge in 5 sec (gizmodata.com)

1

Sampling in Large Language Models (aiunpacked.net)

2

Show HN: xCapture v3 for thread-level dimensional performance analysis with eBPF (tanelpoder.com)

2

Firefly: Scalable, Ultra-Accurate Clock Synchronization for Datacenters (acm.org)

1

When Models Manipulate Manifolds: The Geometry of a Counting Task (transformer-circuits.pub)

1

A Fine-Grained Purpose-Based Access Control System for Large Data Warehouses (arxiv.org)

1

Method tracing and system-wide process sampling (github.com/async-profiler)

2

Filtering data in real time (at CERN) (cern.ch)

1

Using AI and automation to migrate between instruction sets (cloud.google.com)

8

The principles of extreme fault tolerance (planetscale.com)

2

Scaling a Valkey Cluster to 1B Request per Second (valkey.io)

2

Real-time TCP CWND monitoring and analysis toolkit using eBPF (github.com/lordprinz)

2

Streaming Patterns with DuckDB (duckdb.org)

1

Generate QR Codes with Pure SQL in PostgreSQL (tanelpoder.com)

48

Pipelining in psql (PostgreSQL 18) (verite.pro)

2

Gravitino is a high-performance, geo-distributed, and federated metadata lake (github.com/apache)

1

Taking over a Vibe Coded Project (reddit.com)

3

Hydro – a Rust framework for correct and performant distributed systems (hydro.run)

2

The Rabbit Hole of Building a Filesystem Watcher (amandeepsp.github.io)