speaker.audio
Open in
urlscan Pro
172.67.223.215
Public Scan
URL:
https://speaker.audio/
Submission: On November 18 via api from BE — Scanned from DE
Submission: On November 18 via api from BE — Scanned from DE
Form analysis
0 forms found in the DOMText Content
* Home * TTS * About * POPULARITY LEADERBOARD Open source text-to-speech popularity leaderboard OUTETTS GGUF LLaMa Voice cloning OuteTTS, a novel TTS model, uses pure language modeling on LLaMa architecture (Oute3-350M-DEV base). It shows quality speech synthesis via crafted prompts & audio tokens, without external adapters or complex setups. View F5-TTS ConvNeXt V2 F5 E2 Sway Sampling A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching View XTTS-V2 Voice cloning Cross-language 24khz 17 languages ⓍTTS is a Voice generation model that lets you clone voices into different languages by using just a quick 6-second audio clip. There is no need for an excessive amount of training data that spans countless hours. View MASKGCT Voice cloning zero-shot Zero-Shot Text-to-Speech with Masked Generative Codec Transformer View FISH-SPEECH-1.4 Voice cloning zero-shot Multilingual Fast Fish Speech V1.4 is a leading text-to-speech (TTS) model trained on 700k hours of audio data in multiple languages View BARK highly realistic Multilingual Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. View PARLER-TTS fully open-source lightweight Parler-TTS is a lightweight text-to-speech (TTS) model that can generate high-quality, natural sounding speech in the style of a given speaker (gender, pitch, speaking style, etc). View MELOTTS high-quality multi-lingual Parler-TTS is a lightweight text-to-speech (TTS) model that can generate high-quality, natural sounding speech in the style of a given speaker (gender, pitch, speaking style, etc). View Copyright ©2024 - Speaker.audio