2
3
Light-Weight Cache Replacement for Instruction Heavy Workloads (acm.org)
1
LoopFrog: In-Core Hint-Based Loop Parallelization (danglingpointers.substack.com)
3
CHERIoT RTOS: An OS for Fine-Grained Memory-Safe Compartments (acm.org)
1
Dynamic Load Balancer in Intel Xeon Scalable Processor (danglingpointers.substack.com)
2
The XOR Cache: A Catalyst for Compression (acm.org)
1
Rearchitecting the Thread Model of In-Memory Key-Value Stores with μTPS (danglingpointers.substack.com)
3
CHERIoT: Complete Memory Safety for Embedded Devices (acm.org)
1
Morsel-Driven Parallelism (danglingpointers.substack.com)
1
Rearchitecting the Thread Model of In-Memory Key-Value Stores with μTPS (danglingpointers.substack.com)
1
Tai Chi: A General High-Efficiency Scheduling Framework for SmartNICs (danglingpointers.substack.com)
3
Oasis: Pooling PCIe Devices over CXL to Boost Utilization (acm.org)
2
Backdoors to Typical Case Complexity (danglingpointers.substack.com)
2
DPU-KV: DPU Offloading for In-Memory Key-Value Stores (acm.org)
1
How to Copy Memory? Coordinated Asynchronous Copy as a First-Class OS Service (danglingpointers.substack.com)
2
Synopsys and Nvidia Double Down on Acceleration (morethanmoore.substack.com)
2
High-Performance Query Processing with NVMe Arrays (acm.org)
2
Why Wait or Yield When You Can Preempt? (danglingpointers.substack.com)
4
Fast and Scalable Data Transfer Across Data Systems (acm.org)
2
Falcon: Google's Hardware Transport (danglingpointers.substack.com)
2
Bounding Speculative Execution of Atomic Regions to a Single Retry (acm.org)
40
Optimizing Datalog for the GPU (danglingpointers.substack.com)
2
Efficiently Processing Joins and Grouped Aggregations on GPUs (acm.org)
2
No Cap, This Memory Slaps: Breaking Through the OLTP Memory Wall (danglingpointers.substack.com)
3
Extended User Interrupts (XUI): Fast and Flexible Notification Without Polling (acm.org)
2
Parendi: Thousand-Way Parallel RTL Simulation (acm.org)
2
Skia: Exposing Shadow Branches (acm.org)
1
Principles and Methodologies for Serial Performance Optimization (danglingpointers.substack.com)
2
Accelerate Distributed Joins with Predicate Transfer (acm.org)
2
State-Compute Replication: Parallelizing High-Speed Stateful Packet Processing (danglingpointers.substack.com)
2
Scaling IP Lookup to Large Databases Using the Cram Lens (usenix.org)
1
InvisiFlow: Telemetry That Flows Like Water (danglingpointers.substack.com)
2
Ripple: Asynchronous Programming for Spatial Dataflow Architectures (danglingpointers.substack.com)
1
Why is the AI Act so hard to kill? (siliconcontinent.com)
2
Compiling Python to Run Anywhere (codingconfessions.com)
1
ISO: Request-Private Garbage Collection (danglingpointers.substack.com)
2
Ripple: Asynchronous Programming for Spatial Dataflow Architectures (acm.org)
2
What Wall Street Sees in the Data Center Boom (nytimes.com)
2
Disentangling the Dual Role of NIC Receive Rings (usenix.org)
2
Filtering After Shading with Stochastic Texture Filtering (nvidia.com)
3
Don't Repeat Yourself, Coarse-Grained Circuit Deduplication to Accelerate Sim (danglingpointers.substack.com)
1
Necro-Reaper: Pruning Away Dead Memory Traffic in Warehouse-Scale Computers (danglingpointers.substack.com)
2
Predicate Transfer: Efficient Pre-Filtering on Multi-Join Queries (danglingpointers.substack.com)
3