Kling 2.6 Model Introduces "Audio-Visual Sync" Capability, Leading Global Chinese Voice Generation

Stock News12-04

On December 3, Kling launched its video generation 2.6 model, featuring a groundbreaking "audio-visual sync" capability. This innovation transforms the traditional AI video generation workflow, which previously required silent visuals followed by manual dubbing. The new model can produce complete videos in a single generation, incorporating natural speech, action sound effects, and ambient audio, significantly enhancing creative efficiency.

The upgraded model introduces two major functions: text-to-audio-visual and image-to-audio-visual generation. Currently, it supports Chinese and English voice generation, with video lengths of up to 10 seconds. By deeply aligning semantic understanding of real-world sounds and dynamic visuals, the Kling 2.6 model excels in audio-visual synchronization, audio quality, and semantic comprehension. Notably, it maintains a globally leading position in Chinese voice generation quality.

Disclaimer: Investing carries risk. This is not financial advice. The above content should not be regarded as an offer, recommendation, or solicitation on acquiring or disposing of any financial products, any associated discussions, comments, or posts by author or other users should not be considered as such either. It is solely for general information purpose only, which does not consider your own investment objectives, financial situations or needs. TTM assumes no responsibility or warranty for the accuracy and completeness of the information, investors should do their own research and may seek professional advice before investing.

Comments

We need your insight to fill this gap
Leave a comment