Hacker News Deep Cuts: I Put a Datacenter GPU in My Gaming PC for £200

I Put a Datacenter GPU in My Gaming PC for £200

2026-05-31

Link: https://blog.tymscar.com/posts/v100localllm/

The URL slug gives the game away: v100localllm. Someone bought an NVIDIA Tesla V100 — a card that retailed for around $10,000 when it shipped in 2017 and powered a meaningful fraction of the world's deep learning research — for £200 on the secondhand market, and crammed it into a consumer gaming rig to run local LLMs.

This is the kind of post that rewards a careful read because the practical obstacles are non-trivial and the writeup almost certainly walks through them:

Cooling. Datacenter GPUs are passively cooled — they assume a 1U or 2U chassis blasting air across them at jet-engine velocity. Dropping one into an ATX case means rigging a blower shroud or 3D-printing an adapter for a high-static-pressure fan.
Power. The V100 wants an EPS/CPU-style 8-pin connector, not the PCIe 8-pin most gaming PSUs ship with. Pinouts differ; getting this wrong releases the magic smoke.
Form factor. SXM2 vs. PCIe variants matter enormously — SXM2 cards need a breakout board that costs more than the GPU itself, so the author almost certainly went PCIe.
Drivers. Tesla cards work with consumer drivers but lose features like proper display output; you're running headless or via an integrated GPU.

The payoff is 32GB of HBM2 memory at ~900 GB/s bandwidth — enough to comfortably run a 30B-parameter model in fp16, or much larger ones quantized. For comparison, a new RTX 4090 has 24GB at 1 TB/s and costs ten times as much. The V100's FP16 tensor performance (~125 TFLOPS) still embarrasses anything in the consumer Ampere line for inference workloads that fit its memory.

The broader story here is the secondhand datacenter GPU market, which has quietly become one of the best-kept secrets in the local AI hobbyist scene. As hyperscalers cycle out Volta and early Ampere hardware for Hopper and Blackwell, V100s, P40s, and M40s flood eBay at fractions of their original prices. For tinkerers willing to deal with cooling and power-connector adapters, you can build a 48GB+ VRAM LLM rig for under £500 — a configuration that's literally impossible to buy new at consumer prices.

Why it deserves more upvotes: A practical guide to the secondhand datacenter GPU underground — the cheapest path to serious local-LLM hardware that most developers don't know exists.

All newsletters