Articles by zagwdt
52

Introspective Diffusion Language Models (introspective-diffusion.github.io)

2

EinsteinArena: Harnessing the collective intelligence of agents in the wild (einsteinarena.com)

1

RL Meets Adaptive Speculative Training (together.ai)

2

Weak models excel at long context tasks (together.ai)

1

TorchSpec: Speculative Decoding Training at Scale (pytorch.org)

1

Flash Attention 4 (together.ai)

1

CoderForge-Preview: SOTA open dataset for training efficient coding agents (together.ai)

1

Two years of vector search at Notion: 10x scale, 1/10th cost (notion.com)

69

Consistency diffusion language models: Up to 14x faster, no quality loss (together.ai)