Multilingual TTS for Kids' Learning

What is KORENANI's multilingual text-to-speech?

KORENANI's multilingual text-to-speech lets children hear the label and short description for a recognized photo. It supports nine languages: Japanese, English, Spanish, French, German, Italian, Portuguese, Chinese, and Korean.

One flower photo surrounded by nine different sound-wave bubbles — The same discovery can be replayed through nine language settings, using sound as another way to explore the word.

Hear the names of things, in nine languages

KORENANI reads the labels and short descriptions connected with recognized photos. A child who does not yet read can still hear the same discovery, while a parent compares the sound with the photo and visible text.

The implementation combines device text-to-speech with saved audio used as a fallback. Available voices and playback behavior can vary by language, device, downloaded voice resources, and connection state, so this guide does not describe the audio as a guaranteed native pronunciation.

Supported languages

KORENANI ships with vocabulary and audio for nine languages:

Japanese
English
Spanish
French
German
Italian
Portuguese
Chinese
Korean

Language	Example family use
Japanese	Confirm the everyday name in a home language
English	Hear the same photo in English and compare the sound
Spanish	Add common words to a family language routine
French	Replay the label and notice how the sound differs
German	Compare names for vehicles, tools, and household objects
Italian	Use food and daily objects as simple listening practice
Portuguese	Focus on the language your family wants to practice
Chinese	Hear the same object in a different language system
Korean	Build familiarity with nearby regional vocabulary

Two kinds of audio

Recognized items can include two kinds of audio:

Label audio — the short name associated with the recognized item.
Description audio — a short sentence that gives additional context about the item.

Some audio is saved with the recognition result, while device speech can provide a fallback. Playback speed and availability should not be promised for every device or network condition.

Turning photos into language moments

The real magic happens when audio meets the moment a child is curious:

Snap the flower you spotted on a walk and hear the name in two different languages back-to-back
Take a picture of dinner and have your child repeat each ingredient in their target language
Show the same photo again later and choose whether to listen in the same or another language

These activities create opportunities to hear a label connected with a family photo. They do not guarantee vocabulary growth, pronunciation quality, fluency, or memory outcomes; follow the child's interest and keep the activity conversational.

Multilingual Text-to-Speech for Kids' Learning