What is Text to Speech?

Text to Speech reads any text aloud using your device's built-in speech synthesis. Pick a voice, adjust speed and pitch, and listen hands-free. All processing stays on your device.

Voices come straight from your operating system's speech engine, so the list changes depending on whether you're on Windows, macOS, Android, iOS, or a Linux distribution. Speed runs from 0.1x to 10x and pitch from 0 to 2. Chrome and Edge also let you record the playback to a downloadable WebM or MP4 file via the tab-share dialog.

How to use

  1. Type or paste the text you want to hear into the text area — any length from a sentence to full articles.
  2. Select a voice from the dropdown — available voices depend on your device and operating system.
  3. Adjust speed and pitch sliders to your preference, then press Play to listen. Use Pause and Stop controls as needed.

When to use

  • Listening to an article or PDF you don't have time to read while cooking or commuting.
  • Hearing a draft email or report read back so you catch awkward sentences before sending.
  • Practising pronunciation of a foreign-language passage by slowing playback to 0.75x.

Result

A language learner pastes a paragraph in French, selects a French voice, slows the speed to 0.75x, and listens carefully to practice pronunciation and comprehension.

FAQ

Why don't I see any voices in the dropdown?
Voice loading is asynchronous on some systems, especially Chrome. Give the page a second after opening, or refresh once. On Linux you may need to install a TTS engine like espeak or festival because most distros don't ship one by default.
Can I download the audio as an MP3 file?
Use the Record & Download button. Recording captures the tab audio as WebM (on Chromium) or MP4 (on Safari). You'll need to allow the tab-share prompt and tick 'Share tab audio'. To convert to MP3, run the result through any audio converter.
Why does the same voice sound different on my friend's computer?
Each operating system ships its own speech engine. Apple's Siri voices on macOS sound very different from Microsoft Edge's neural voices on Windows. The same selection name may even point to a slightly different model across versions.
Is there a character limit?
There's no strict cap on text length, but most engines chunk speech into roughly 32 KB blocks internally. Very long inputs (multiple articles in one go) sometimes cut off; if that happens, split the text into a few smaller pastes.
Does the text I paste get sent anywhere?
No. Speech synthesis happens locally through your device's TTS engine, and the recording feature stores audio in local memory until you download it. Nothing about the input or audio is uploaded to a server.

Related Tools