whichllm
Developed by Andyyyy64
whichllm is a CLI tool that auto-detects your hardware to recommend the best HuggingFace LLMs that actually fit and run efficiently. Rather than just finding the largest model that fits in VRAM, it intelligently ranks models using real benchmark scores, quantization penalties, and estimated inference speed. It also features GPU simulation, reverse hardware planning, one-command chat execution, and Python snippet generation.
- Auto-detect and simulate hardware configurations
- Evidence-based smart ranking via real benchmarks
- One-command model download and chat execution
- Live synchronization with HuggingFace data
- Reverse hardware planning and upgrade comparisons
desktop