Articles by polymorph1sm
10

Apply video compression on KV cache to 10,000x less error at Q4 quant (github.com/cenconq25)

1

In Forecasting, Search >> Distillation (spylab.ai)

1

Benchmarking the continuous improvement of language agents in deployment (arxiv.org)