Alibaba's Qwen3-TTS "Full Suite" Open-Sourced and Launched

Stock News01-23

On January 22, according to an announcement on the Qwen official WeChat account, the Qwen3-TTS "full suite" was open-sourced and launched. Qwen3-TTS is a series of powerful speech generation models developed by Qwen, offering comprehensive support for voice cloning, voice creation, ultra-high-quality anthropomorphic speech generation, and speech control based on natural language descriptions, providing developers and users with the most extensive speech generation capabilities. Leveraging the innovative Qwen3-TTS-Tokenizer-12Hz multi-codebook speech encoder, Qwen3-TTS achieves efficient compression and strong representational capabilities for speech signals, not only fully preserving paralinguistic information and acoustic environment characteristics but also enabling high-speed, high-fidelity speech reconstruction through a lightweight non-DiT architecture. Qwen3-TTS utilizes Dual-Track modeling, achieving exceptional bi-directional streaming generation speed where the first audio packet requires waiting for just a single character. The entire multi-codebook series of Qwen3-TTS models has been open-sourced, including two sizes: 1.7B and 0.6B. The 1.7B model delivers peak performance with powerful control capabilities, while the 0.6B model balances performance and efficiency. The models cover 10 major languages (Chinese, English, Japanese, Korean, German, French, Russian, Portuguese, Spanish, Italian) and various dialectal voice tones, meeting global application needs. Simultaneously, the models possess robust contextual understanding capabilities, allowing them to adaptively adjust tone, rhythm, and emotional expression based on instructions and text semantics, with a significant improvement in robustness against input text noise. The models are now available as open-source on GitHub and can also be experienced via the Qwen API.

Disclaimer: Investing carries risk. This is not financial advice. The above content should not be regarded as an offer, recommendation, or solicitation on acquiring or disposing of any financial products, any associated discussions, comments, or posts by author or other users should not be considered as such either. It is solely for general information purpose only, which does not consider your own investment objectives, financial situations or needs. TTM assumes no responsibility or warranty for the accuracy and completeness of the information, investors should do their own research and may seek professional advice before investing.

Alibaba's Qwen3-TTS "Full Suite" Open-Sourced and Launched

Comments