pveldandi - Hacker News

HN

Articles by pveldandi

12

Show HN: How We Run 60 Hugging Face Models on 2 GPUs

20 hours ago pveldandi ycombinator.com

1

Benchmark: A100 vs. H100 NVMe Random Read throughput during multi-GPU loading

a month ago pveldandi ycombinator.com

1

Show HN: 50+ LLMs on 2 GPUs with 2-Second Swapping? We built AI-Native Runtime (github.com/inferx-net)

9 months ago pveldandi github.com

1

Show HN: InferX - AI Lambda-Like Inference Function as a Service

9 months ago pveldandi ycombinator.com

2

Show HN: We run 50+ LLMs on 2 GPUs using snapshot-based inference (inferx.net)

9 months ago pveldandi inferx.net

3

We're running 50 LLMs on 2 GPUs – no cold starts, no overprovisioning

10 months ago pveldandi ycombinator.com

1

Show HN: InferX – an AI-native OS for running 50 LLMs per GPU with hot swapping

10 months ago pveldandi ycombinator.com