Articles by tanelpoder
1

MATCH_RECOGNIZE in BigQuery (cloud.google.com)

3

DuckDB as the New jq (pgrs.net)

3

NVIDIA CUDA Tile programming model (nvidia.com)

1

podtrace: eBPF-based diagnostic tool for Kubernetes applications (github.com/gma1k)

1

Who Will Observe the Observability? eBPF Performance at Scale (zmalik.dev)

3

Plans for MySQL Vector Support and a MySQL Binlog Server (percona.com)

145

Linux Kernel Explorer (reverser.dev)

1

Kioxia 245TB SSD (tomshardware.com)

1

Researchers push "Context Engineering 2.0" as the road to lifelong AI memory (the-decoder.com)

8

AI Code Is Going to Kill Your Startup (and You're Going to Let It) (medium.com/kcl17)

3

The Art of Not Being Dumb (yewjin.substack.com)

2

Postgres Internals Hiding in Plain Sight (crunchydata.com)

11

Vortex: An extensible, state of the art columnar file format (github.com/vortex-data)

1

MT4G: A Tool for Auto-Discovery of NVIDIA and AMD GPU Compute, Memory Topologies (arxiv.org)

1

Postgres IPC:SyncRep – Sync Replication Is Not Actually Sync Replication (ardentperf.com)

75

650GB of Data (Delta Lake on S3). Polars vs. DuckDB vs. Daft vs. Spark (dataengineeringcentral.substack.com)

3

AI is all about inference now (infoworld.com)

2

Explorations of RDMA in LLM Systems (qun.ch)

1

Instance Explorer (datadoghq.com)

3

Larger Than RAM Vector Indexes for Relational Databases (planetscale.com)

1

Turning PySpark into a Universal DataFrame API (github.com/eakmanrq)

2

Mount Mayhem at Netflix: Scaling Containers on Modern CPUs (netflixtechblog.medium.com)

1

Google's MCP Toolbox for Databases: A Technical Deep Dive for Engineering Teams (agnost.ai)

1

11X Faster ScyllaDB Backup (scylladb.com)

2

Enabling Trillion-Parameter Models on AWS EFA (perplexity.ai)

1

pg_stat_plans: Track per-plan call counts, execution times and EXPLAIN texts (github.com/pganalyze)

3

Apache Iggy is a high-performance, persistent message streaming platform (apache.org)

1

Postgres_AI Monitoring (gitlab.com/postgres-ai)

8

Using the expand and contract pattern for schema changes (prisma.io)

1

aperf: A CLI tool to gather performance data and visualize using HTML graphs (github.com/aws)

1

ctop: Top-like interface for container metrics (github.com/bcicen)

2

Container Security book by Liz Rice (2nd edition, free download) (isovalent.com)

1

Reverie: An ergonomic and safe syscall interception framework for Linux (github.com/facebookexperimental)

6

History of Lambda Syntax (hydromatic.net)

3

Cloudflare Workers Automatic Tracing (cloudflare.com)

2

Async-Profiler 4.2 Released (github.com/async-profiler)

2

Making JFR Quack: Importing JFR Files into DuckDB (mostlynerdless.de)

117

A sharded DuckDB on 63 nodes runs 1T row aggregation challenge in 5 sec (gizmodata.com)

1

Sampling in Large Language Models (aiunpacked.net)

2

Show HN: xCapture v3 for thread-level dimensional performance analysis with eBPF (tanelpoder.com)

2

Firefly: Scalable, Ultra-Accurate Clock Synchronization for Datacenters (acm.org)

1

When Models Manipulate Manifolds: The Geometry of a Counting Task (transformer-circuits.pub)

1

A Fine-Grained Purpose-Based Access Control System for Large Data Warehouses (arxiv.org)

1

Method tracing and system-wide process sampling (github.com/async-profiler)

2

Filtering data in real time (at CERN) (cern.ch)

1

Using AI and automation to migrate between instruction sets (cloud.google.com)

8

The principles of extreme fault tolerance (planetscale.com)

2

Scaling a Valkey Cluster to 1B Request per Second (valkey.io)

2

Real-time TCP CWND monitoring and analysis toolkit using eBPF (github.com/lordprinz)

2

Streaming Patterns with DuckDB (duckdb.org)

1

Generate QR Codes with Pure SQL in PostgreSQL (tanelpoder.com)

48

Pipelining in psql (PostgreSQL 18) (verite.pro)

2

Gravitino is a high-performance, geo-distributed, and federated metadata lake (github.com/apache)

1

Taking over a Vibe Coded Project (reddit.com)

3

Hydro – a Rust framework for correct and performant distributed systems (hydro.run)

2

The Rabbit Hole of Building a Filesystem Watcher (amandeepsp.github.io)

1

A Tutorial of CPU Power Management, C-states and P-states (2018) (metebalci.com)

2

mdserve: Fast Markdown Preview for Terminal Workflows (jrfernandez.com)

2

Python Front End to LLVM IR for eBPF Programs in Pure Python (github.com/varun-r-mallya)

18

How to waste CPU like a Professional (mostlynerdless.de)

1

Method Tracing in Async-Profiler (github.com/async-profiler)

2

eBPF-InXpect: Lightweight XDP Profiling (github.com/vladimiropaschali)

2

Systematic Analysis of Kernel Security Performance and Energy Costs (acm.org)

3

Unwinding a Stack by Hand with Frame Pointers and ORC (oracle.com)

3

Chronon: A data platform for serving for AI/ML applications (github.com/airbnb)

1

Reducing Cold Start Latency for LLM Inference with NVIDIA Run:AI Model Streamer (nvidia.com)

1

Network Storage and Scaling Characteristics of a Distributed Filesystem (maknee.github.io)

2

What Every Computer Scientist Should Know About Floating-Point Arithmetic (1991) (oracle.com)

2

The Big LLM Architecture Comparison by Sebastian Raschka [video] (youtube.com)

3

NVIDIA Accelerated IO (XLIO) (nvidia.com)

1

LLM Benchmark and Optimization Explorer (bentoml.com)

2

Rethinking AI Infrastructure: The Rise of PCIe Switches (semiengineering.com)

2

StringTape: Apache Arrow-compatible space-efficient "tape" class in pure Rust (github.com/ashvardanian)

2

QEMU 10.1 experimental support for compiling to WASM (qemu.org)

3

Pg_DuckDB Version 1.0 (motherduck.com)

4

The xCapture and xtop eBPF tools are now in beta, with a demo dataset (tanelpoder.com)

1

XDP2 (EXpress DataPath 2) (github.com/xdp2-dev)

1

Scribe: Meta transports terabytes per second in real time [pdf] (vldb.org)

1

An In-Depth Look at Pipe and Splice Implementation in Linux Kernel (oracle.com)

2

Google AI Mode - google.com/ai (HN strips the posted URL) (google.com)

1

Google AI Mode (google.com)

2

Java Virtual Threads: Understanding JDK 21 Limitations Before the JDK 25 Release (medium.com/minadev)

37

GigaByte CXL memory expansion card with up to 512GB DRAM (gigabyte.com)

4

AI Influence Levels (danielmiessler.com)

1

A guide for developers who want to make contributions to the Linux kernel (gamma.app)

4

Ultra Ethernet's Design Principles and Architectural Innovations (arxiv.org)

2

NVIDIA Dynamo LLM Inference Framework (multimodalai.substack.com)

2

NVIDIA Posts Initial Linux Patches for Extended GPU Memory "EGM" Virtualization (phoronix.com)

1

Ollama 0.11.9 CPU/GPU Performance Optimization (phoronix.com)

15

The Kafka Replication Protocol with KIP-966 (github.com/vanlightly)

4

Sparrow: C++20 Idiomatic APIs for the Apache Arrow Columnar Format (github.com/man-group)

1

Troubleshooting Network Issues with Retina (microsoft.com)

1

A Demonstration of Q2O: Quantum-Augmented Query Optimizer [pdf] (vldb.org)

1

Latency, Tail Latency and Response time in distributed systems (andrewpakhomov.com)

3

The fast path/slow path mirage (medium.com/tom_84912)

3

Semantic search and document parsing tools for the command line (github.com/run-llama)

1

What Is Spurious Fault in Linux kernel? (medium.com/kimth0312)

2

vfio-user (qemu.org)

1

From Black Box to Blueprint (martinfowler.com)

1

Page Table Sharing in Linux (oracle.com)