Text to Speech

An AI-powered assistant that converts text into speech using male and female voices representing the Northern, Central, and Southern regions of Vietnam, ideal for news articles, audio books, and more, featuring natural rhythm, intonation, and lifelike voice quality.

Values Delivered

Enhance Advantages and Competitiveness

Create new opportunities, build competitive advantages, and elevate the brand positioning of your business

Save Time and Costs

Reduce transcription time by over 80% for audio files lasting up to 2 hours

Optimize Products/Services

Boost work efficiency and reduce personnel and operational costs by 30-60%

Streamline Management and Operations

Enable businesses to improve and optimize functions and workflows while enhancing customer experience

Strong Support for the Visually Impaired

Serve as a reliable assistant, aiding visually impaired individuals in learning and accessing information

Outstanding Features

Regional and Gender-Specific Voices

Support individuals, agencies, and businesses in converting text into male and female voices representing the Northern, Central, and Southern regions of Vietnam.

Custom Voices

Create AI voices from your own audio samples. Your AI voice will represent your brand and can be used for text-to-speech conversion.

Contextual Voices

Convert text into speech tailored for formats like book reading and news narration. Users can customize pauses, speed, and pitch to suit their specific needs.

Natural Voices

VNPT SmartVoice delivers precise pauses and expressive intonation. The system predicts pronunciations of foreign words based on international conventions and standardizes input text such as numbers, dates, addresses, document codes, and more. Additionally, users can create custom phonetic dictionaries to define how specific words are pronounced

Multi-Platform Integration

Easily integrate with various systems and devices across platforms such as mobile, websites, tablets, IoT devices, call centers, and more via API/SDK. Additionally, customers can use the service directly on the product’s website or deploy it on their own infrastructure

Fast and Accurate Responses

Achieve a high average Mean Opinion Score (MOS) from listeners. AI models are optimized for GPU performance and utilize server-to-server gRPC connections to handle high loads and accelerate processing speed