lostecho/edge-tts

Fork 0

Files

YuanHui 9d1429eeb2 add web ui

2025-12-02 12:22:06 +08:00

3.6 KiB

Raw Blame History

Quick Start Guide

🚀 Get Started in 3 Steps

1. Install Dependencies

cd web
pip install -r requirements.txt

2. Start the Server

./start.sh

Or manually:

python3 server.py

3. Open Your Browser

Visit: http://localhost:8000

✨ Features at a Glance

Text to Speech

Enter any text (up to 5000 characters)
Select from 100+ voices in multiple languages
Adjust speed, volume, and pitch
Generate natural-sounding speech

Voice Selection

Filter by language and gender
Preview voice names and locales
Save your favorite settings

Audio Controls

Play audio directly in browser
Download as MP3 files
View generation history
Quick reload from history

PWA Features

Install as standalone app
Offline support with service worker
Works on desktop and mobile
Responsive design

📱 Install as App

On Desktop (Chrome/Edge)

Click the install icon in the address bar
Or look for "Install App" button in the UI
App will be added to your applications

On Mobile (Android)

Open in Chrome
Tap the menu (⋮)
Select "Add to Home screen"
App icon will appear on home screen

On iOS (Safari)

Tap the share button
Select "Add to Home Screen"
Name the app and add to home screen

🎯 Quick Usage Tips

Generate Speech

Enter or paste text
Select a voice (default: English)
Adjust speed/volume/pitch if needed
Click "Generate Speech"
Audio player appears with playback controls

Download Audio

Click "Download MP3" button
File saves with timestamp and text snippet

Use History

Recent generations saved automatically
Click "Load" to restore settings
Click "Delete" to remove from history

Filter Voices

Use language dropdown for specific locales
Use gender filter for Male/Female voices
Voice list updates automatically

🔧 Configuration

Change Port

python3 server.py --port 8080

Enable Hot Reload (Development)

python3 server.py --reload

Bind to Specific Host

python3 server.py --host 127.0.0.1

⚡ API Usage

Test with cURL

Get voices:

curl http://localhost:8000/api/voices

Generate speech:

curl -X POST http://localhost:8000/api/synthesize \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Hello, world!",
    "voice": "en-US-EmmaMultilingualNeural",
    "rate": "+0%",
    "volume": "+0%",
    "pitch": "+0Hz"
  }' \
  --output speech.mp3

🎨 Customization

Update Theme Color

Edit styles.css:

:root {
    --primary-color: #2563eb; /* Your color here */
}

Update manifest.json:

{
  "theme_color": "#2563eb"
}

Replace Icons

Create PNG icons:

icon-192.png - 192x192 pixels
icon-512.png - 512x512 pixels

Use any image editing tool or online icon generator.

🐛 Troubleshooting

Port Already in Use

python3 server.py --port 8080

Dependencies Not Found

pip3 install -r requirements.txt

Voices Not Loading

Check internet connection
Check server logs for errors
Try refreshing the page

Service Worker Issues

Clear browser cache
Hard refresh (Ctrl+Shift+R or Cmd+Shift+R)
Check browser console for errors

📚 More Information

See README.md for detailed documentation including:

Full API reference
Deployment guide
Docker setup
Production considerations
Contributing guidelines

🎉 You're All Set!

Enjoy using Edge TTS Web UI!

For issues or questions, visit: https://github.com/rany2/edge-tts

3.6 KiB Raw Blame History