matt_d - Hacker News

1

The Thing We All Obviously Want (kmicinski.com)

15 hours ago matt_d kmicinski.com

2

ActPlane: Programmable OS-Level Policy Enforcement for Agent Harnesses (arxiv.org)

19 hours ago matt_d arxiv.org

1

Fenwick trees for products mod 2ⁿ (bitmath.blogspot.com)

19 hours ago matt_d blogspot.com

1

A Fake Shell for Pangenomics (cornell.edu)

a day ago matt_d cornell.edu

1

Making Equality Saturation Usable for Developing Vectorized Compilers (acm.org)

a day ago matt_d acm.org

2

Reading AI Model Compilation in MLIR Through the Lens of Formal Theories (arxiv.org)

a day ago matt_d arxiv.org

2

Liveness Proofs in Veil, Part I: The First Step (proofsandintuitions.net)

a day ago matt_d proofsandintuitions.net

1

ParallelKernelBench: Can LLMs write fast multi-GPU kernels? (github.com/togethercomputer)

a day ago matt_d github.com

1

LXM: Better Splittable Pseudorandom Number Generators (and Almost as Fast) [video] (youtube.com)

2 days ago matt_d youtube.com

2

Code as Agent Harness (arxiv.org)

2 days ago matt_d arxiv.org

2

PICO: Performance Insights for Collective Operations (ieee.org)

2 days ago matt_d ieee.org

1

An Overview of Petri Net Theory [video] (youtube.com)

2 days ago matt_d youtube.com

1

2026 EuroLLVM Developers' Meeting Talks (youtube.com)

2 days ago matt_d youtube.com

2

VoltanaLLM: Energy-Efficient LLM Serving (supercomputing-system-ai-lab.github.io)

3 days ago matt_d github.io

2

Why Software Requirements Get Easier in an AI Economy (stng.substack.com)

3 days ago matt_d substack.com

1

Lifting E-Graphs: A Function Isn't a Constant (arxiv.org)

3 days ago matt_d arxiv.org

1

Inference Compute Shapes Frontier LLM Evaluation (arxiv.org)

3 days ago matt_d arxiv.org

1

Concordia: JIT-Compiled Persistent-Kernel Checkpt for Fault-Tolerant Inference (arxiv.org)

3 days ago matt_d arxiv.org

1

SIMT-Step Execution: A Flexible Operational Semantics for GPU Subgroup Behavior (arbersephirotheca.github.io)

3 days ago matt_d github.io

1

NektarIR: A Domain-Specific Compiler for High-Order FE Ops on Heterogeneous HW (arxiv.org)

3 days ago matt_d arxiv.org

2

C++ Lifetime-End Pointer-Zap and OOTA Progress (kernel.org)

4 days ago matt_d kernel.org

1

TIRx: An Open Compiler Stack for Evolving Frontier ML Kernels (apache.org)

4 days ago matt_d apache.org

1

What Is A Programming Language? – Advent of Computing Episode 184 (libsyn.com)

4 days ago matt_d libsyn.com

2

NVFP4 Blockscaled GEMM on NVIDIA RTX Pro Blackwell GPUs (SM12x) (colfax-intl.com)

5 days ago matt_d colfax-intl.com

3

LLVM-Snippy: An Instruction Sequence Generator. Part 1: Overview [video] (youtube.com)

6 days ago matt_d youtube.com

47

LLMs Are Complicated Now (ianbarber.blog)

a week ago matt_d ianbarber.blog

1

petite-vllm Part 2: KV Cache & Paged Attention (kristenmcintosh.dev)

a week ago matt_d kristenmcintosh.dev

1

Analyzing Bytes: Pre-Disassembly Static Binary Analysis (research.google)

a week ago matt_d research.google

1

Terminal-Bench Challenges: long-horizon, token-intensive, single-task benchmarks (tbench.ai)

a week ago matt_d tbench.ai

1

Dana Scott: Lambda Calculus, Forcing and the Foundations of Math: #14 aboutlogic [video] (youtube.com)

a week ago matt_d youtube.com

2

SE Radio 725: Danny Yang and Sam Goldman on the Pyrefly Type Checker (se-radio.net)

a week ago matt_d se-radio.net

9

Integer Quantization: Deep Dive (hello-fri-end.github.io)

a week ago matt_d github.io

3

M* (M-Star): A Modular, Extensible, Serving System for Multimodal Models (stanford.edu)

a week ago matt_d stanford.edu

3

From Minutes to Seconds: LLM-Guided Autotuning for Helion Kernels (pytorch.org)

a week ago matt_d pytorch.org

1

Zigzag Decoding with AVX-512 (zeux.io)

a week ago matt_d zeux.io

1

GenDB – LLM-Powered Generative Query Engine (solidlao.github.io)

a week ago matt_d github.io

20

AI Compute Extensions (ACE) Specification (x86ecosystem.org)

a week ago matt_d x86ecosystem.org

1

Loop Unrolling in the ML Era (hiraditya.github.io)

a week ago matt_d github.io

6

System call instrumentation on Linux/x86‑64 using memory‑indirect calls, part I (humprog.org)

a week ago matt_d humprog.org

1

Fearless Concurrency on the GPU (arxiv.org)

a week ago matt_d arxiv.org

1

Using Task Graph Caching to Accelerate TVM Code Generation (acm.org)

a week ago matt_d acm.org

1

Google's Training Supercomputers from TPU v2 to Ironwood: Five Generations (arxiv.org)

a week ago matt_d arxiv.org

6

The Return of Rigorous Full-System Timing Simulation (sigarch.org)

a week ago matt_d sigarch.org

4

Language integrated LLMs as an OCaml function (recoil.org)

a week ago matt_d recoil.org

2

Using OxCaml to implement type-safe reference counting between OCaml and Python (janestreet.com)

a week ago matt_d janestreet.com

2

Scalable GPU Acceleration of Scalar Functions in Analytical Databases (microsoft.com)

a week ago matt_d microsoft.com

1

Compiling Strassen-Like Matrix Multiplication Algorithms to Fast CUDA Kernels (acm.org)

a week ago matt_d acm.org

2

Programming Language Design and Implementation (PLDI) 2026 Live Streams (sigplan.org)

a week ago matt_d sigplan.org

1

Puzzling Success of Overparameterization: Lottery Tickets or Escape Dimensions? (epfl.ch)

a week ago matt_d epfl.ch

2

One More Type in the Tiny Type Theory (jcreedcmu.github.io)

a week ago matt_d github.io

3

A Galois Field Arithmetic Primer (tomverbeure.github.io)

a week ago matt_d github.io

1

An O(x)Caml book that runs (kcsrk.info)

a week ago matt_d kcsrk.info

4

Type Theory Forall #62 – Dependent Haskell – Vladislav Zavialov [video] (youtube.com)

a week ago matt_d youtube.com

4

Trip report: June 2026 ISO C++ standards meeting (Brno, Czechia) (herbsutter.com)

a week ago matt_d herbsutter.com

1

UnpredictaBench: A Benchmark for Evaluating Distributional Randomness in LLMs (arxiv.org)

2 weeks ago matt_d arxiv.org

1

Linear Algebra Kernels for the Age of Research (gpumode.com)

2 weeks ago matt_d gpumode.com

2

Latent learning: episodic memory complements parametric learning (openreview.net)

2 weeks ago matt_d openreview.net

1

NEURA: A Unified and Retargetable Compilation Framework for CGRAs (acm.org)

2 weeks ago matt_d acm.org

2

System Call Stack Alignment (humprog.org)

2 weeks ago matt_d humprog.org

1

Making FlashAttention-4 faster for inference (modal.com)

2 weeks ago matt_d modal.com

1

Precision Matters in Block Scales (constantinides.net)

2 weeks ago matt_d constantinides.net

2

Agents' Last Exam (arxiv.org)

2 weeks ago matt_d arxiv.org

2

Does the Harness Matter? Lessons from Ale-Claw on Agents' Last Exam (agents-last-exam.org)

2 weeks ago matt_d agents-last-exam.org

1

Demystifying NVSHMEM: System-Level: Symmetric Memory, Device-Initiated Ops (arxiv.org)

2 weeks ago matt_d arxiv.org

1

Enumerating Ill-Typed Programs for Testing Type Analyzers (acm.org)

2 weeks ago matt_d acm.org

1

Agentic Memory Management for GPU Code Generation (ucbskyadrs.github.io)

2 weeks ago matt_d github.io

1

CommBench: Can LLMs Write Correct and Efficient GPU Communication Code? (uccl-project.github.io)

2 weeks ago matt_d github.io

1

Frontier: A Discrete-Event Simulator for Modern LLM Serving (github.com/netx-lab)

2 weeks ago matt_d github.com

2

Piper: A Programmable Distributed Training System (washington.edu)

2 weeks ago matt_d washington.edu

1

Piper: A Programmable Distributed Training System (arxiv.org)

2 weeks ago matt_d arxiv.org

1

Radix Top-K: finding the top-k elements in an array without sorting (veitner.bearblog.dev)

2 weeks ago matt_d bearblog.dev

1

A Case for a Simulation-Driven Exploration of Distributed GenAI Platforms (acm.org)

2 weeks ago matt_d acm.org

1

Defeat the Heap: Zero-Copy Data Movement in AXI4MLIR (arxiv.org)

2 weeks ago matt_d arxiv.org

2

Breaking the Ice: Analyzing Cold Start Latency in vLLM (arxiv.org)

2 weeks ago matt_d arxiv.org

2

An Empirical Comparison of General Context-Free Parsers (arxiv.org)

2 weeks ago matt_d arxiv.org

1

RFC: Programming Languages Course Reboot, 2026 – Shriram Krishnamurthi (docs.google.com)

2 weeks ago matt_d google.com

1

CodegenBench: Can LLMs Write Efficient Code Across Architectures? (arxiv.org)

2 weeks ago matt_d arxiv.org

1

ACM SIGPLAN Programming Language Design and Implementation (PLDI) 2026 (acm.org)

2 weeks ago matt_d acm.org

2

Human Judgment as a Specification (brownplt.org)

2 weeks ago matt_d brownplt.org

3

OOBdump: Relocation Oriented Programming: Arbitrary code execution in objdump -g (calif.io)

2 weeks ago matt_d calif.io

3

Inference: Turning Electricity into Intelligence – Stanford CS336 – Dan Fu [video] (youtube.com)

2 weeks ago matt_d youtube.com

5

FP8 Is All You Need (Part 1): Debunking Hardware FP64 as the HPC Holy Grail (arxiv.org)

2 weeks ago matt_d arxiv.org

4

Modular Arithmetic Challenge (terrytao.wordpress.com)

2 weeks ago matt_d wordpress.com

1

The Return of Rigorous Full-System Timing Simulation (sigarch.org)

2 weeks ago matt_d sigarch.org

2

Co-Creator of Haskell: Functional Prog., Thinking in Types, Useless Languages [video] (youtube.com)

2 weeks ago matt_d youtube.com

1

Types for more than memory safety in OxCaml – Stephen Dolan – VeTSS 2026 [video] (youtube.com)

2 weeks ago matt_d youtube.com

112

The 29th International Obfuscated C Code Contest (IOCCC) 2025 Winners (ioccc.org)

2 weeks ago matt_d ioccc.org

3

Tensor Shapes in Pyrefly – Avik Chaudhuri – PyCon US 2026 Typing Summit [video] (youtube.com)

3 weeks ago matt_d youtube.com

1

BenchEvolver: Frontier Task Synthesis via Solution-Centric Evolution (benchevolver.github.io)

3 weeks ago matt_d github.io

2

JITDomain: Instruction-level JIT code isolation (sciencedirect.com)

3 weeks ago matt_d sciencedirect.com

2

Serving Transformers: Lessons from the Trenches – Stanford CS25 Transformers [video] (youtube.com)

3 weeks ago matt_d youtube.com

1

Constrained Adaptive Rejection Sampling (arxiv.org)

3 weeks ago matt_d arxiv.org

2

Training an Agentic Router for Optimal Cost-Performance on SWE Tasks (appliedcompute.com)

3 weeks ago matt_d appliedcompute.com

1

Diagramming Program Values by Spatial Refinement (brownplt.org)

3 weeks ago matt_d brownplt.org

1

Agent Arena: Causal Evaluation of Agents in the Real World (arena.ai)

3 weeks ago matt_d arena.ai

1

Can LLMs Reason Structurally? Benchmarking via the Lens of Data Structures (arxiv.org)

3 weeks ago matt_d arxiv.org

1

Recent improvements to the type checker – Swift Compiler (swift.org)

3 weeks ago matt_d swift.org

1

Semantic Reification: A New Paradigm for Random Program Generation (sigplan.org)

3 weeks ago matt_d sigplan.org

1

GPU Forecasters: Language Models as Selective Surrogates for Kernel Optimization (arxiv.org)

3 weeks ago matt_d arxiv.org

1

Type-Error Ablation and AI Coding Agents (arxiv.org)

3 weeks ago matt_d arxiv.org