Articles by matt_d
1

WebAssembly: How Low Can a Bytecode Go? (acm.org)

2

Koala: A benchmark suite for performance-oriented shell-optimization research (github.com/kbensh)

2

Evaluating the Effectiveness of Memory Safety Sanitizers (computer.org)

2

Anarchy in the Database: A Survey and Evaluation of DBMS Extensibility [pdf] (vldb.org)

2

2025 AsiaLLVM Developers' Meeting Talks (youtube.com)

2

A mostly allocation-free optional type (joachim-breitner.de)

1

On the Feasibility of Deduplicating Compiler Bugs with Bisection (arxiv.org)

18

GPEmu: A GPU emulator for rapid, low-cost deep learning prototyping [pdf] (vldb.org)

1

TYPES 2025: The 31st International Conference on Types for Proofs and Programs (strath.ac.uk)

15

Magnetic Tape Storage Technology: usage, history, and future outlook (acm.org)

3

10 Myths of Scalable Parallel Languages, Part 3: New vs. Extended (chapel-lang.org)

3

Converging AI and HPC: Design and Optimization of a CGRA Architecture [pdf] (cogarchworkshop.org)

14

Multi-Stage Programming with Splice Variables (tsung-ju.org)

2

Type Theory and Themes in Philosophical Logic – Greg Restall – TYPES 2025 (consequently.org)

1

BinDSA: Efficient, Precise Binary-Level Pointer Analysis (acm.org)

1

Bridging the Gaps Between GNNs and Data-Flow Analysis: The Closer, the Better (acm.org)

2

AST, Bytecode and the In Between: An Exploration of Interpreter Design Tradeoffs (ecoop.org)

1

Lifetime Dispersion and Generational GC: An Intellectual Abstract (acm.org)

1

Low Overhead Allocation Sampling in a Garbage Collected Virtual Machine (arxiv.org)

3

Telescopes Are Tries: A Dependent Type Shellac on SQLite (philipzucker.com)

2

Linux x86_64 Assembly Programming Part 5: Macros (github.com/0xax)

1

CVDP: LLM Benchmark for Verilog RTL Design and Verification (arxiv.org)

2

Trip report: June 2025 ISO C++ standards meeting (herbsutter.com)

6

2025 Alonzo Church Award: Paul Blain Levy for Call-by-Push-Value (CBPV) (siglog.org)

2

Practical Type Inference with Levels (PLDI 2025 – Distinguished Paper Award) (sigplan.org)

2

The Ethical Compiler: Addressing the Is-Ought Gap in Compilation (PEPM 2025) [video] (youtube.com)

78

Compiling LLMs into a MegaKernel: A path to low-latency inference (zhihaojia.medium.com)

1

Usability Barriers for Liquid Types (acm.org)

1

Exploring the Theory and Practice of Concurrency in the ECS Pattern [pdf] (curious.software)

1

Elaboration with Error Recovery (github.com/brendanzab)

1

Sound Static Data Race Verification for C: Is the Race Lost? (sigplan.org)

2

Monads are not like burritos (byorgey.github.io)

1

Inside torch.compile Guards: How They Work, What They Cost, and Ways to Optimize (youtube.com)

4

phkmalloc (freebsd.dk)

2

Advent of Computing: Episode 159 – The Intel 286: A Legacy Trap (libsyn.com)

2

Introduction to Competitive Programming in Haskell (byorgey.github.io)

2

Relational Abstractions Based on Labeled Union-Find (codex.top)

2

A Guided Tour of Polarity and Focusing (hyperkind.org)

2

"How slow is the tracing interpreter of PyPy's meta-tracing JIT?" (cfbolz.de)

1

JIT Code Generation with AsmJit [video] (youtube.com)

1

Type Theory and Themes in Philosophical Logic (consequently.org)

3

EDAN: Towards Understanding Memory Parallelism and Latency Sensitivity in HPC [pdf] (ethz.ch)

1

Inspecting Compiler Optimizations on Mixed Boolean Arithmetic Obfuscation (ndss-symposium.org)

22

Modern Minimal Perfect Hashing: A Survey (arxiv.org)

2

How to Make the Most Out of SIMD on AArch64? (ieee.org)

3

[RFC] MLIR Dialect for WebAssembly (llvm.org)

1

An SMT Formalization of Mixed-Precision Matrix Multiplication (arxiv.org)

1

From Boolean logic to bitmath and SIMD: transitive closure of tiny graphs (bitmath.blogspot.com)

1

Autocomp: LLM-Driven Code Optimization for Tensor Accelerators (charleshong3.github.io)

2

GPU Memory Consistency: Specs, Testing, and Opportunities for Perf Tooling (sigarch.org)

3

Potential and Limitation of High-Frequency Cores and Caches (2024) (ucdavis.edu)

1

GDBMiner: Mining Precise Input Grammars on Almost Any System (dagstuhl.de)

2

Spegion: Implicit and Non-Lexical Regions with Sized Allocations (arxiv.org)

2

Linearity and Uniqueness (kcsrk.info)

2

[dupe] INTERCAL Rides Again – Restoring a Lost Compiler (libsyn.com)

1

Understanding Memory Management, Part 6: Basic Garbage Collection (educatedguesswork.org)

1

Uniqueness for Behavioural Types (kcsrk.info)

1

TPDE: A Fast Adaptable Compiler Back-End Framework (arxiv.org)

2

The missing guide to Dataflow Analysis in MLIR (lowlevelbits.com)

1

Current Continuation E2: Satnam Singh (Groq) [video] (youtube.com)

3

Games on ARM64: Introduction to FEX EMU, a fast usermode x86-64 emulator [video] (youtube.com)

1

Bidirectional typing with unification for higher-rank polymorphism (github.com/brendanzab)

26

DWARF as a Shared Reverse Engineering Format (lief.re)

1

Finite-Choice Logic Programming (POPL 2025) [video] (youtube.com)

1

Data-Driven Loop Fusion (cheshmi.cc)

1

Brute E-Graphs Modulo Theories 2: Extraction, Proofs, and Context (philipzucker.com)

1

Workload-Aware Cloud Efficiency:A Large-Scale Empirical Study of Cloud Workloads (acm.org)

2

Against Curry-Howard Mysticism (liamoc.net)

1

Maximal Simplification of Polyhedral Reductions (POPL 2025) [video] (youtube.com)

3

Prime Path Coverage in the GNU Compiler Collection (arxiv.org)

3

Oodle 2.9.14 and Intel 13th/14th gen CPUs (fgiesen.wordpress.com)

1

Current Continuation E1: Ranjit Jhala (UCSD) [video] (youtube.com)

2

Algebraic Type Theory, Part 1: Martin-Löf Algebras (arxiv.org)

2

Memory-Centric Computing: Solving Computing's Memory Problem (arxiv.org)

1

Programming Models for Correct and Modular Distributed Systems (eecs.berkeley.edu)

1

FCCM 2025 (constantinides.net)

1

Global Hash Tables Strike Back! An Analysis of Parallel GROUP BY Aggregation (arxiv.org)

1

eqsat: An Equality Saturation Dialect for Non-destructive Rewriting (arxiv.org)

2

Comparing Parallel Functional Array Languages: Programming and Performance (arxiv.org)

1

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI (arxiv.org)

1

Amortized Analysis as a Cost-Aware Abstraction Function (cmu.edu)

3

Chapter 2: Serializability Theory (1987 Concurrency Control Book) (muratbuffalo.blogspot.com)

1

SoK: Challenges and Paths Toward Memory Safety for eBPF [pdf] (nebelwelt.net)

1

Rod Burstall: In Memoriam (arxiv.org)

25

Using obscure graph theory to solve programming languages problems (reasonablypolymorphic.com)

1

What you want to know (but dare to ask) about Conjunctive Query Processing (harp-lab.com)

4

2025 EuroLLVM Developers' Meeting (youtube.com)

2

Reverse-Engineering the Address Translation Caches [pdf] (yarom.org)

1

A whippet waypoint / Nofl: A Precise Immix (wingolog.org)

2

What Works (and Doesn't) Selling Formal Methods (galois.com)

28

6502 Illegal Opcodes in the Siemens PC 100 Assembly Manual (1980) (pagetable.com)

1

Writing a preloadable malloc in Rust, using MMTk (humprog.org)

1

From Haskell to a New Structured Combinator Processor (hw.ac.uk)

2

Reverse Engineering DVFS Mechanisms [pdf] (takhandipu.github.io)

1

Programming Language Design and Implementation (PLDI) 2025: Accepted Papers (sigplan.org)

17

An Interactive Debugger for Rust Trait Errors (brown.edu)

25

Orders of Infinity (terrytao.wordpress.com)

2

Ozaki Scheme II: A GEMM-oriented emulation of FP matrix multiplication using INT (arxiv.org)

6

The Algebra of Patterns (Extended Version) (arxiv.org)

1

Bring­ing ISA se­man­tics to Lean and Lean-MLIR – Léo Stefanesco [video] (youtube.com)