I Put a Datacenter GPU in My Gaming PC for £200

2026-05-31

Link: https://blog.tymscar.com/posts/v100localllm/

HN Discussion: 1 points, 0 comments

The URL slug gives the game away: v100localllm. Someone bought an NVIDIA Tesla V100 — a card that retailed for around $10,000 when it shipped in 2017 and powered a meaningful fraction of the world's deep learning research — for £200 on the secondhand market, and crammed it into a consumer gaming rig to run local LLMs.

This is the kind of post that rewards a careful read because the practical obstacles are non-trivial and the writeup almost certainly walks through them:

The payoff is 32GB of HBM2 memory at ~900 GB/s bandwidth — enough to comfortably run a 30B-parameter model in fp16, or much larger ones quantized. For comparison, a new RTX 4090 has 24GB at 1 TB/s and costs ten times as much. The V100's FP16 tensor performance (~125 TFLOPS) still embarrasses anything in the consumer Ampere line for inference workloads that fit its memory.

The broader story here is the secondhand datacenter GPU market, which has quietly become one of the best-kept secrets in the local AI hobbyist scene. As hyperscalers cycle out Volta and early Ampere hardware for Hopper and Blackwell, V100s, P40s, and M40s flood eBay at fractions of their original prices. For tinkerers willing to deal with cooling and power-connector adapters, you can build a 48GB+ VRAM LLM rig for under £500 — a configuration that's literally impossible to buy new at consumer prices.

Why it deserves more upvotes: A practical guide to the secondhand datacenter GPU underground — the cheapest path to serious local-LLM hardware that most developers don't know exists.

All newsletters