Articles by jasondavies
78

The Adolescence of Technology (darioamodei.com)

2

32-Bit Integer Multiplication on Tenstorrent (jasondavies.com)

1

Optimal "where" on Tenstorrent (jasondavies.com)

3

Aristotle: IMO-Level Automated Theorem Proving (arxiv.org)

5

MirageLSD: The First Live-Stream Diffusion AI Video Model (decart.ai)

16

Why am I searched every time I go to Australia? (caseyhandmer.wordpress.com)

2

Tenstorrent: An Open Future (tenstorrent.com)

2

Hogs: Homogeneous Gaussian Splatting (kh129.github.io)

67

Bolt3D: Generating 3D Scenes in Seconds (szymanowiczs.github.io)

1

The Area of the Pythagoras Tree (penteract.github.io)

30

Sparse Voxels Rasterization: Real-Time High-Fidelity Radiance Field Rendering (svraster.github.io)

3

Large Language Diffusion Models (ml-gsai.github.io)

251

Mistral Small 3 (mistral.ai)

4

Startup Raises $200M to Bring Back the Woolly Mammoth (bloomberg.com)

31

Reversible computing escapes the lab (ieee.org)

1

Iterated Log Coding (adamscherlis.github.io)

3

Generative World Models for Film, Gaming, and Beyond (odyssey.systems)

2

LinGen: Text-to-Video Generation with Linear Computational Complexity (lineargen.github.io)

13

Don't Look Twice: Faster Video Transformers with Run-Length Tokenization (rccchoudhury.github.io)

1

Data movement bottlenecks to large-scale model training: Scaling past 1e28 FLOP (epochai.org)

96

We Can Terraform the American West (caseyhandmer.wordpress.com)

1

Cohere Multimodal Embed 3 (cohere.com)

4

Sabotage Evaluations for Frontier Models (anthropic.com)

1

Market Prices Are Not Probabilities (quantian.substack.com)

5

Google Executive Overseeing Search and Advertising Leaves Role (wsj.com)

4

Microsoft's new cross-platform virtual machine layer written in Rust (openvmm.dev)

12

Lotus: Diffusion-Based Visual Foundation Model for High-Quality Dense Prediction (lotus3d.github.io)

1

Bugs in LLM Training – Gradient Accumulation Fix (unsloth.ai)

1

Linearizing LLMs with LoLCATs (stanford.edu)

68

Machines of loving grace: How AI could transform the world for the better (darioamodei.com)

1

Diffusion for World Modeling: Visual Details Matter in Atari (diamond-wm.github.io)

25

INTELLECT–1: Launching the First Decentralized Training of a 10B Parameter Model (primeintellect.ai)

2

Gaussian Haircut: Human Hair Reconstruction with Strand-Aligned 3D Gaussians (eth-ait.github.io)

1

Everything Everywhere All at Once: LLMs Can In-Context Learn Multiple Tasks (arxiv.org)

1

Practical Rateless Set Reconciliation (arxiv.org)

3

Intelligence at the Edge of Chaos (arxiv.org)

4

How the US Lost the Solar Power Race to China (bloomberg.com)

3

COMO: Compact Mapping and Odometry (edexheim.github.io)

4

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning (arxiv.org)

1

PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation (stevenlsw.github.io)

1

Clash of the Foundries: Gate All Around and Backside Power at 2nm (semianalysis.com)

151

Liquid Foundation Models: Our First Series of Generative AI Models (liquid.ai)

15

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robots (rekep-robot.github.io)

16

Molmo: a family of open multimodal AI models (allenai.org)

3

The Mamba in the Llama: Distilling and Accelerating Hybrid Models (together.ai)

2

Speculative decoding for high-throughput long-context inference (together.ai)

2

The Memory Wall: Past, Present, and Future of DRAM (semianalysis.com)

2

Entrepreneurship changed the way I think (caseyhandmer.wordpress.com)

2

HyperCard in the World, May 2016 [video] (youtube.com)

1

Anti-aging tech fixes demographic collapse (caseyhandmer.wordpress.com)

2

Anthropic: Artifacts are now generally available (anthropic.com)

1

Preliminary Report on DisTrO (Distributed Training Over-the-Internet) [pdf] (github.com/nousresearch)

36

Splatt3R: Zero-Shot Gaussian Splatting from Uncalibrated Image Pairs (active.vision)

1

Terraforming Mars with Nanowires (caseyhandmer.wordpress.com)

2

Prompt Caching with Claude (anthropic.com)

22

MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images (donydchen.github.io)

1

From Explicit Cot to Implicit Cot: Learning to Internalize Cot Step by Step (arxiv.org)

1

Global Structure-from-Motion Revisited (lpanaf.github.io)

1

The Limitations of Compute Thresholds as a Governance Strategy (arxiv.org)

1

OpenDiLoCo, globally distributed low-communication AI model training (primeintellect.ai)

4

3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes (gaussiantracer.github.io)

1

Magic Insert: Style-Aware Drag-and-Drop (magicinsert.github.io)

1

Diffusion Forcing: Next-Token Prediction Meets Full-Sequence Diffusion (boyuan.space)

2

The Mathematical Basics of Diffusion (saxton.ai)

1

Segment Any Text (arxiv.org)

19

DETRs Beat YOLOs on Real-Time Object Detection (zhao-yian.github.io)

1

Latent Intrinsics Emerge from Training to Relight (latent-intrinsics.github.io)

16

Computational Life: How self-replicating programs emerge from simple interaction (arxiv.org)

3

Apple, Microsoft Shrink AI Models to Improve Them (ieee.org)

3

Mip-Splatting: Alias-Free 3D Gaussian Splatting (niujinshuchong.github.io)

1

Connecting the Dots: LLMs Can Infer and Verbalize Latent Structure (arxiv.org)

1

Flash Diffusion: Accelerating Any Conditional Diffusion Model (arxiv.org)

2

Transcendence: Generative Models Can Outperform the Experts That Train Them (arxiv.org)

3

PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar (platonerf.github.io)

3

GGHead: Fast and Generalizable 3D Gaussian Heads (tobias-kirschstein.github.io)

1

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation (fudan-generative-vision.github.io)

2

A Tour of Differentiable Rasterization (srush.github.io)

1

Neural Thermodynamic Integration: Free Energies from Diffusion Models (arxiv.org)

1

Multimodal Masked Modeling (epfl.ch)

1

Understanding Hallucinations in Diffusion Models Through Mode Interpolation (arxiv.org)

2

SuperPrimitive: Scene Reconstruction at a Primitive Level (makezur.github.io)

1

PowerInfer-2: Fast Large Language Model Inference on a Smartphone (powerinfer.ai)

1

Proteus: Real-Time Expressive Generative Humans (apparate.ai)

3

Torax: A Fast and Differentiable Tokamak Transport Simulator in Jax (arxiv.org)

1

Samba: Simple Hybrid State Space Models (arxiv.org)

2

An Image Is Worth 32 Tokens for Reconstruction and Generation (yucornetto.github.io)

1

Image Neural Field Diffusion Models (yinboc.github.io)

1

TextGrad: Automatic "Differentiation" via Text (arxiv.org)

1

Everything Apple Plans to Show at Its AI-Focused WWDC Event (bloomberg.com)

4

Apple Intelligence Is Right on Time (stratechery.com)

1

Emad Mostaque: "Happy to Announce SchellingAI" (twitter.com/emostaque)

3

Seiler's Interpolation for Evaluating Polynomial Curves (cemyuksel.com)

42

Dragonfly: A large vision-language model with multi-resolution zoom (together.ai)

1

Knockout: A simple way to handle missing inputs (arxiv.org)

28

SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales (arxiv.org)

1

Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models (conglu.co.uk)

2

Transformers are SSMs (Mamba-2) (arxiv.org)

2

Compressed-Language Models for Understanding Compressed File Formats: JPEG (arxiv.org)

1

RB-Modulation: Training-Free Personalization of Diffusion Models (rb-modulation.github.io)

2

Thermox: The First Thermodynamic Computing Simulator (normalcomputing.ai)