Specifies which engines (standard, neural or long-form) are supported by a given voice.
standard
neural
long-form