78
2
32-Bit Integer Multiplication on Tenstorrent (jasondavies.com)
1
Optimal "where" on Tenstorrent (jasondavies.com)
3
Aristotle: IMO-Level Automated Theorem Proving (arxiv.org)
5
MirageLSD: The First Live-Stream Diffusion AI Video Model (decart.ai)
16
Why am I searched every time I go to Australia? (caseyhandmer.wordpress.com)
2
Tenstorrent: An Open Future (tenstorrent.com)
2
Hogs: Homogeneous Gaussian Splatting (kh129.github.io)
67
Bolt3D: Generating 3D Scenes in Seconds (szymanowiczs.github.io)
1
The Area of the Pythagoras Tree (penteract.github.io)
30
Sparse Voxels Rasterization: Real-Time High-Fidelity Radiance Field Rendering (svraster.github.io)
3
Large Language Diffusion Models (ml-gsai.github.io)
251
Mistral Small 3 (mistral.ai)
4
Startup Raises $200M to Bring Back the Woolly Mammoth (bloomberg.com)
31
Reversible computing escapes the lab (ieee.org)
1
Iterated Log Coding (adamscherlis.github.io)
3
Generative World Models for Film, Gaming, and Beyond (odyssey.systems)
2
LinGen: Text-to-Video Generation with Linear Computational Complexity (lineargen.github.io)
13
Don't Look Twice: Faster Video Transformers with Run-Length Tokenization (rccchoudhury.github.io)
1
Data movement bottlenecks to large-scale model training: Scaling past 1e28 FLOP (epochai.org)
96
We Can Terraform the American West (caseyhandmer.wordpress.com)
1
Cohere Multimodal Embed 3 (cohere.com)
4
Sabotage Evaluations for Frontier Models (anthropic.com)
1
Market Prices Are Not Probabilities (quantian.substack.com)
5
Google Executive Overseeing Search and Advertising Leaves Role (wsj.com)
4
Microsoft's new cross-platform virtual machine layer written in Rust (openvmm.dev)
12
Lotus: Diffusion-Based Visual Foundation Model for High-Quality Dense Prediction (lotus3d.github.io)
1
Bugs in LLM Training – Gradient Accumulation Fix (unsloth.ai)
1
Linearizing LLMs with LoLCATs (stanford.edu)
68
Machines of loving grace: How AI could transform the world for the better (darioamodei.com)
1
Diffusion for World Modeling: Visual Details Matter in Atari (diamond-wm.github.io)
25
INTELLECT–1: Launching the First Decentralized Training of a 10B Parameter Model (primeintellect.ai)
2
Gaussian Haircut: Human Hair Reconstruction with Strand-Aligned 3D Gaussians (eth-ait.github.io)
1
Everything Everywhere All at Once: LLMs Can In-Context Learn Multiple Tasks (arxiv.org)
1
Practical Rateless Set Reconciliation (arxiv.org)
3
Intelligence at the Edge of Chaos (arxiv.org)
4
How the US Lost the Solar Power Race to China (bloomberg.com)
3
COMO: Compact Mapping and Odometry (edexheim.github.io)
4
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning (arxiv.org)
1
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation (stevenlsw.github.io)
1
Clash of the Foundries: Gate All Around and Backside Power at 2nm (semianalysis.com)
151
Liquid Foundation Models: Our First Series of Generative AI Models (liquid.ai)
15
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robots (rekep-robot.github.io)
16
Molmo: a family of open multimodal AI models (allenai.org)
3
The Mamba in the Llama: Distilling and Accelerating Hybrid Models (together.ai)
2
Speculative decoding for high-throughput long-context inference (together.ai)
2
The Memory Wall: Past, Present, and Future of DRAM (semianalysis.com)
2
Entrepreneurship changed the way I think (caseyhandmer.wordpress.com)
2
HyperCard in the World, May 2016 [video] (youtube.com)
1
Anti-aging tech fixes demographic collapse (caseyhandmer.wordpress.com)
2
Anthropic: Artifacts are now generally available (anthropic.com)
1
Preliminary Report on DisTrO (Distributed Training Over-the-Internet) [pdf] (github.com/nousresearch)
36
Splatt3R: Zero-Shot Gaussian Splatting from Uncalibrated Image Pairs (active.vision)
1
Terraforming Mars with Nanowires (caseyhandmer.wordpress.com)
2
Prompt Caching with Claude (anthropic.com)
22
MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images (donydchen.github.io)
1
From Explicit Cot to Implicit Cot: Learning to Internalize Cot Step by Step (arxiv.org)
1
Global Structure-from-Motion Revisited (lpanaf.github.io)
1
The Limitations of Compute Thresholds as a Governance Strategy (arxiv.org)
1
OpenDiLoCo, globally distributed low-communication AI model training (primeintellect.ai)
4
3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes (gaussiantracer.github.io)
1
Magic Insert: Style-Aware Drag-and-Drop (magicinsert.github.io)
1
Diffusion Forcing: Next-Token Prediction Meets Full-Sequence Diffusion (boyuan.space)
2
The Mathematical Basics of Diffusion (saxton.ai)
1
Segment Any Text (arxiv.org)
19
DETRs Beat YOLOs on Real-Time Object Detection (zhao-yian.github.io)
1
Latent Intrinsics Emerge from Training to Relight (latent-intrinsics.github.io)
16
Computational Life: How self-replicating programs emerge from simple interaction (arxiv.org)
3
Apple, Microsoft Shrink AI Models to Improve Them (ieee.org)
3
Mip-Splatting: Alias-Free 3D Gaussian Splatting (niujinshuchong.github.io)
1
Connecting the Dots: LLMs Can Infer and Verbalize Latent Structure (arxiv.org)
1
Flash Diffusion: Accelerating Any Conditional Diffusion Model (arxiv.org)
2
Transcendence: Generative Models Can Outperform the Experts That Train Them (arxiv.org)
3
PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar (platonerf.github.io)
3
GGHead: Fast and Generalizable 3D Gaussian Heads (tobias-kirschstein.github.io)
1
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation (fudan-generative-vision.github.io)
2
A Tour of Differentiable Rasterization (srush.github.io)
1
Neural Thermodynamic Integration: Free Energies from Diffusion Models (arxiv.org)
1
Multimodal Masked Modeling (epfl.ch)
1
Understanding Hallucinations in Diffusion Models Through Mode Interpolation (arxiv.org)
2
SuperPrimitive: Scene Reconstruction at a Primitive Level (makezur.github.io)
1
PowerInfer-2: Fast Large Language Model Inference on a Smartphone (powerinfer.ai)
1
Proteus: Real-Time Expressive Generative Humans (apparate.ai)
3
Torax: A Fast and Differentiable Tokamak Transport Simulator in Jax (arxiv.org)
1
Samba: Simple Hybrid State Space Models (arxiv.org)
2
An Image Is Worth 32 Tokens for Reconstruction and Generation (yucornetto.github.io)
1
Image Neural Field Diffusion Models (yinboc.github.io)
1
TextGrad: Automatic "Differentiation" via Text (arxiv.org)
1
Everything Apple Plans to Show at Its AI-Focused WWDC Event (bloomberg.com)
4
Apple Intelligence Is Right on Time (stratechery.com)
1
Emad Mostaque: "Happy to Announce SchellingAI" (twitter.com/emostaque)
3
Seiler's Interpolation for Evaluating Polynomial Curves (cemyuksel.com)
42
Dragonfly: A large vision-language model with multi-resolution zoom (together.ai)
1
Knockout: A simple way to handle missing inputs (arxiv.org)
28
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales (arxiv.org)
1
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models (conglu.co.uk)
2
Transformers are SSMs (Mamba-2) (arxiv.org)
2
Compressed-Language Models for Understanding Compressed File Formats: JPEG (arxiv.org)
1
RB-Modulation: Training-Free Personalization of Diffusion Models (rb-modulation.github.io)
2