Home Papers RSS LibraryAnnouncements Changelog PPT

Papers

解析模型

Sign in to view your remaining parses.

Email me when analysis completesPick favorite folders after submittingKeep analysis private from users who haven't submitted this paper (still saved as your default analysis)

Tag Filter

Text-to-Speech Synthesis

Tacotron: Towards End-to-End Speech Synthesis

Published:3/30/2017

End-to-End Speech Synthesis ModelTacotron ModelSequence-to-Sequence LearningText-to-Speech SynthesisGenerative Models in NLP

Tacotron is an endtoend texttospeech model that synthesizes speech directly from characters, simplifying complex traditional TTS systems. Trained from scratch, it scores 3.82 in mean opinion, outperforming existing systems in naturalness and offering faster generation speeds.

WaveNet: A Generative Model for Raw Audio

Published:9/13/2016

Audio Generation ModelWaveNet ArchitectureText-to-Speech SynthesisAutoregressive ModelingMusic Generation

WaveNet is introduced as a deep neural network for raw audio generation, featuring probabilistic and autoregressive properties. It excels in texttospeech tasks, surpassing existing systems in naturalness, and shows high realism in music generation while also achieving promising

1 - 2 / 2

Go to

© 2025 AiPaper · Friend Links · Sitemap