A fast, local neural text to speech system