Platform Support

qwen-tts runs on macOS, Linux, and Windows. The CLI automatically detects your hardware and selects the best inference backend during config init.

Support Matrix

Platform	Backend	Performance	Notes
macOS Apple Silicon (M1/M2/M3/M4)	`mlx`	Best	Native MLX acceleration. Recommended platform. Uses `mlx_audio` for inference with optimized MLX model weights.
macOS Intel	`cpu`	Slow	No GPU acceleration available. Falls back to PyTorch CPU inference.
Linux + NVIDIA GPU	`cuda`	Fast	Requires NVIDIA drivers and CUDA toolkit. Uses PyTorch with CUDA for inference.
Linux CPU-only	`cpu`	Slow	PyTorch CPU inference. Functional but not recommended for regular use.
Windows + NVIDIA GPU	`cuda`	Fast	Requires NVIDIA drivers and CUDA toolkit. Uses PyTorch with CUDA for inference.
Windows CPU-only	`cpu`	Slow	PyTorch CPU inference. Functional but not recommended for regular use.

When you run qwen-tts config init, the following logic determines your backend:

You can override the auto-detected backend manually:

qwen-tts config set backend cuda

Each backend requires different Python packages in the virtual environment:

pip install mlx-audio huggingface-hub

pip install torch transformers huggingface-hub

pip install torch transformers huggingface-hub --extra-index-url https://download.pytorch.org/whl/cpu

Generated audio is played automatically when auto_play is enabled. The playback command depends on the platform:

Platform	Command
macOS	`afplay` (built-in)
Windows	PowerShell `SoundPlayer`
Linux	`aplay`, `paplay`, or `ffplay` (tried in order)

If no audio player is found, a warning is printed and the generated file is still saved to disk.

On non-MLX backends, both pro and lite use the same upstream PyTorch checkpoint from Qwen.