Token compression, dev loops, and hardware routing for LLM coding agents on constrained hardware.