1
1
What Is Control Flow Analysis for Lambda Calculus? – Iowa Type Theory Commute (buzzsprout.com)
2
Benchmarking a Baseline Fully-in-Place Functional Language Compiler [pdf] (trendsfp.github.io)
3
Trends in Functional Programming (TFP) 2026 (trendsfp.github.io)
2
Categorical Foundations for CuTe Layouts (arxiv.org)
1
StackWarp: Exploiting Stack Layout Vulnerabilities in Modern Processors (roots.ec)
7
Cloud RAM (mikekohn.net)
1
Triton Linear Layout: Examples (lei.chat)
1
When XLA Isn't Enough: From Pallas to VLIW with Splash Attention on TPU (patricktoulme.substack.com)
1
Warp Specialization in Triton: Design and Roadmap (pytorch.org)
2
Challenges and Research Directions for Large Language Model Inference Hardware (arxiv.org)
1
Library Liberation-Competitive Performance Through Compiler-Composed Nanokernels (arxiv.org)
1
Non-Traditional Profiling: "you can just put whatever you want in a jitdump" (mgaudet.ca)
1
Triton Extensions: a framework for developing and building compiler extensions (github.com/triton-lang)
1
FlashInfer-Bench: Building the Virtuous Cycle for AI-Driven LLM Systems (arxiv.org)
45
High-Performance DBMSs with io_uring: When and How to use it (arxiv.org)
1
Are DBMS Researchers Making Correct Assumptions about Transaction Workloads? (muratbuffalo.blogspot.com)
2
vLLM: An Efficient Inference Engine for Large Language Models (eecs.berkeley.edu)
3
Microarchitecture: What Happens Beneath – Matt Godbolt [video] (youtube.com)
1
SMTMSMT: Gluing Together CVC5 and Z3 Nelson Oppen Style (philipzucker.com)
2
Tilus: A Tile-Level GPGPU Programming Language for Low-Precision Computation (acm.org)
1
Optimal Software Pipelining and Warp Specialization for Tensor Core GPUs (arxiv.org)
1
Oral History of Jeffrey Ullman [video] (youtube.com)
1
CPU Autoscaling with a Kernel of Truth (acm.org)
4
ACM Transactions on Programming Languages & Systems: New Year, New Paper Tracks (acm.org)
2
An Empirical Study of Bugs in the rustc Compiler (OOPSLA 2025) [video] (youtube.com)
2
A "Ready-to-Use" Template for LLVM Out-of-Tree Passes (github.com/federicobruzzone)
2
Mini-SGLang: Efficient Inference Engine in a Nutshell (lmsys.org)
1
FrontierCS: Evolving Challenges for Evolving Intelligence (arxiv.org)
3
svc-hook: hooking system calls on ARM64 by binary rewriting (acm.org)
1
The Simple Essence of Monomorphization (Oopsla 2025) [video] (youtube.com)
1
Abusing x86 instructions to optimize PS3 emulation [RPCS3] [video] (youtube.com)
4
Decompiling the Synergy: Human–LLM Teaming in Reverse Engineering [pdf] (zionbasque.com)
1
Soteria Rust: the first symbolic execution engine with full Tree Borrows support [video] (youtube.com)
1
Testing and Benchmarking of AI Compilers (broune.com)
1
Interpreters everywhere! – Lindsey Kuper [video] (youtube.com)
1
The Wild West of post-POSIX IO Interfaces [video] (youtube.com)
1
Using the `vpternlogd` instruction for signed saturated arithmetic (wunkolo.github.io)
2
Indexed Reverse Polish Notation, an Alternative to AST (burakemir.ch)
1
ASM Visualizer: a new assembly visualization tool (diveintosystems.org)
1
Oral History of Jensen Huang – Computer History Museum [video] (youtube.com)
1
The Equational Theories Project: Collaborative Mathematical Research at Scale (terrytao.wordpress.com)
1
The Quest Toward That Perfect Compiler – ACM SPLASH / OOPSLA 2025 Keynote [video] (youtube.com)
1
Learning to love mesh-oriented sharding (ezyang.com)
1
Microbenchmarking NVIDIA's Blackwell: An In-Depth Architectural Analysis (arxiv.org)
1
tritonBLAS: Triton-based Analytical Approach for GEMM Kernel Parameter Selection (arxiv.org)
1
RFC: Forming a Working Group on Formal Specification for LLVM (llvm.org)
3
hls4ml: A Flexible, OSS Platform for ML Acceleration on Reconfigurable Hardware (arxiv.org)
1
Nice to Meet You: Synthesizing Practical MLIR Abstract Transformers [pdf] (utah.edu)
1
SAT Etudes 2: Toy DPLL (philipzucker.com)
3
The Hitchhiker's Guide to Coherent Fabrics: 5 Programming Rules (sigarch.org)
1
Optimizing libdwarf .eh_frame enumeration (rovarma.com)
1
GSoC 2025: ClangIR Upstreaming (llvm.org)
2
Normal Forms for MLIR – 2025 US LLVM Developers' Meeting – Alex Zinenko [video] (youtube.com)
1
Place Capability Graphs: A General-Purpose Model of Rust's Ownership & Borrowing [video] (youtube.com)
1
LLM Inference Beyond a Single Node: From Bottlenecks to Mitigations (arxiv.org)
2
What Scala can learn from Rust, Swift, and C++ [video] (youtube.com)
1
Lifetime Safety in Clang – 2025 US LLVM Developers' Meeting [video] (youtube.com)
3
Constant-time support coming to LLVM: Protecting cryptographic code (trailofbits.com)
1
Seymour Cray at 100 – Clive England – TNMoC Talk [video] (youtube.com)
5
Mitigating Application Resource Overload with Targeted Task Cancellation (muratbuffalo.blogspot.com)
1
MetaOCaml: Ten Years Later System Description (sciencedirect.com)
1
Where "Simulation" Came From (decomposition.al)
1
Inside VOLT: Designing an Open-Source GPU Compiler (arxiv.org)
1
An MLIR Pipeline for Offloading Fortran to FPGAs via OpenMP (acm.org)
3
Inside Nvidia GPU: Blackwell's Limitations & Future Rubin's Microarchitecture (github.com/zartbot)
1
Kitsune: Enabling Dataflow Execution on GPUs with Spatial Pipelines (acm.org)
1
DMA Collectives for Efficient ML Communication Offloads (arxiv.org)
4
10 Myths of Scalable Parallel Languages Part 8: Striving Toward Adoptability (chapel-lang.org)
8
Slicing Is All You Need: Towards a Universal One-Sided Distributed MatMul (arxiv.org)
2
Machine Scheduler in LLVM – Part II (myhsu.xyz)
1
The content-addressed storage (CAS) model of incremental build systems (jonmsterling.com)
2
Defeating the Training-Inference Mismatch via FP16 (arxiv.org)
3
Opportunistically Parallel Lambda Calculus (acm.org)
1
Place Capability Graphs: A General-Purpose Model of Rust's Ownership & Borrowing (acm.org)
2
Linear effects, exceptions, resources: Curry-Howard destructors correspondence (arxiv.org)
3
Making the Clang AST Leaner and Faster (cppalliance.org)
3
Draw high dimensional tensors as a matrix of matrices (ezyang.com)
1
Wafer-Scale AI Compute: A System Software Perspective (sigops.org)
2
Towards Automated GPU Kernel Generation (simonguo.tech)
1
Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs (arxiv.org)
2
Triton Developer Conference 2025 Talks [video] (youtube.com)
1
OpenEstimate Evaluating LLMs on Reasoning Under Uncertainty with Real-World Data (arxiv.org)
1
torchcomms: A modern PyTorch communications API (github.com/meta-pytorch)
2
Building an Open ABI and FFI for ML Systems (apache.org)
1
Instruction Set Migration at Warehouse Scale (arxiv.org)
2
Secure Parsing and Serializing with Separation Logic Applied to CBOR, CDDL, COSE [pdf] (microsoft.com)
2
The Calculated Typer – Haskell Symposium (ICFP⧸SPLASH'25) [video] (youtube.com)
1
PickleBall: Secure Deserialization of Pickle-Based Machine Learning Models (github.com/columbia)
1
Clang Bytecode Interpreter Update (redhat.com)
1
Scaling Instruction-Selection Verification Against Authoritative ISA Semantics (doi.org)
2
10 Myths of Scalable Parallel Languages Part 7: Minimalist Language Designs (chapel-lang.org)
1
CPU Autoscaling with a Kernel of Truth (acm.org)
1
SafeRace: WebGPU Memory Safety in the Presence of Data Races (acm.org)
1
A guided tour through Oxidized OCaml (gavinleroy.com)
1
Functional Networking for Millions of Docker Desktops (Experience Report) (acm.org)
1
Does Linux Provide Performance Isolation for NVMe SSDs? Configuring cgroups [pdf] (atlarge-research.com)
1
International Conference on Managed Programming Languages & Runtimes (MPLR) 2025 (acm.org)
1
Collective Matrix Multiplication – JAX Pallas:Mosaic GPU (jax.dev)
1