arxiv.org
Open in
urlscan Pro
128.84.21.199
Public Scan
URL:
https://arxiv.org/abs/2006.13979v2
Submission: On May 18 via manual from BR — Scanned from DE
Submission: On May 18 via manual from BR — Scanned from DE
Form analysis
2 forms found in the DOMGET https://arxiv.org/search
<form class="level-item mini-search" method="GET" action="https://arxiv.org/search">
<div class="field has-addons">
<div class="control">
<input class="input is-small" type="text" name="query" placeholder="Search..." aria-label="Search term or terms">
<p class="help"><a href="https://arxiv.org/help">Help</a> | <a href="https://arxiv.org/search/advanced">Advanced Search</a></p>
</div>
<div class="control">
<div class="select is-small">
<select name="searchtype" aria-label="Field to search">
<option value="all" selected="selected">All fields</option>
<option value="title">Title</option>
<option value="author">Author</option>
<option value="abstract">Abstract</option>
<option value="comments">Comments</option>
<option value="journal_ref">Journal reference</option>
<option value="acm_class">ACM classification</option>
<option value="msc_class">MSC classification</option>
<option value="report_num">Report number</option>
<option value="paper_id">arXiv identifier</option>
<option value="doi">DOI</option>
<option value="orcid">ORCID</option>
<option value="author_id">arXiv author ID</option>
<option value="help">Help pages</option>
<option value="full_text">Full text</option>
</select>
</div>
</div>
<input type="hidden" name="source" value="header">
<button class="button is-small is-cul-darker">Search</button>
</div>
</form>
GET https://arxiv.org/search
<form class="mobile-search-form" method="GET" action="https://arxiv.org/search">
<div class="field has-addons">
<input class="input" type="text" name="query" placeholder="Search..." aria-label="Search term or terms">
<input type="hidden" name="source" value="header">
<input type="hidden" name="searchtype" value="all">
<button class="button">GO</button>
</div>
</form>
Text Content
GIVING WEEK! Show your support for Open Science by donating to arXiv during Giving Week, April 25th-29th. DONATE Skip to main content We gratefully acknowledge support from the Simons Foundation and member institutions. > cs > arXiv:2006.13979v2 Help | Advanced Search All fields Title Author Abstract Comments Journal reference ACM classification MSC classification Report number arXiv identifier DOI ORCID arXiv author ID Help pages Full text Search open search GO open navigation menu QUICK LINKS * Login * Help Pages * About COMPUTER SCIENCE > COMPUTATION AND LANGUAGE arXiv:2006.13979v2 (cs) [Submitted on 24 Jun 2020 (v1), last revised 15 Dec 2020 (this version, v2)] TITLE:UNSUPERVISED CROSS-LINGUAL REPRESENTATION LEARNING FOR SPEECH RECOGNITION Authors:Alexis Conneau, Alexei Baevski, Ronan Collobert, Abdelrahman Mohamed, Michael Auli Download PDF > Abstract: This paper presents XLSR which learns cross-lingual speech > representations by pretraining a single model from the raw waveform of speech > in multiple languages. We build on wav2vec 2.0 which is trained by solving a > contrastive task over masked latent speech representations and jointly learns > a quantization of the latents shared across languages. The resulting model is > fine-tuned on labeled data and experiments show that cross-lingual pretraining > significantly outperforms monolingual pretraining. On the CommonVoice > benchmark, XLSR shows a relative phoneme error rate reduction of 72% compared > to the best known results. On BABEL, our approach improves word error rate by > 16% relative compared to a comparable system. Our approach enables a single > multilingual speech recognition model which is competitive to strong > individual models. Analysis shows that the latent discrete speech > representations are shared across languages with increased sharing for related > languages. We hope to catalyze research in low-resource speech understanding > by releasing XLSR-53, a large model pretrained in 53 languages. Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS) Cite as: arXiv:2006.13979 [cs.CL] (or arXiv:2006.13979v2 [cs.CL] for this version) https://doi.org/10.48550/arXiv.2006.13979 Focus to learn more arXiv-issued DOI via DataCite SUBMISSION HISTORY From: Alexis Conneau [view email] [v1] Wed, 24 Jun 2020 18:25:05 UTC (282 KB) [v2] Tue, 15 Dec 2020 23:19:19 UTC (660 KB) Full-text links: DOWNLOAD: * PDF * Other formats (license) Current browse context: cs.CL < prev | next > new | recent | 2006 Change to browse by: cs cs.LG cs.SD eess eess.AS REFERENCES & CITATIONS * NASA ADS * Google Scholar * Semantic Scholar DBLP - CS BIBLIOGRAPHY listing | bibtex Alexis Conneau Alexei Baevski Ronan Collobert Abdelrahman Mohamed Michael Auli a export bibtex citation Loading... BIBTEX FORMATTED CITATION × loading... Data provided by: BOOKMARK Bibliographic Tools BIBLIOGRAPHIC AND CITATION TOOLS Bibliographic Explorer Toggle Bibliographic Explorer (What is the Explorer?) Litmaps Toggle Litmaps (What is Litmaps?) scite.ai Toggle scite Smart Citations (What are Smart Citations?) Code & Data CODE AND DATA ASSOCIATED WITH THIS ARTICLE arXiv Links to Code Toggle arXiv Links to Code & Data (What is Links to Code & Data?) OFFICIAL CODE No official code found; you can submit it here COMMUNITY CODE 4 code implementations (in PyTorch) DATASETS USED Common Voice 118 papers also use this dataset Demos DEMOS Replicate Toggle Replicate (What is Replicate?) No demos found for this article. You can add one here. Related Papers RECOMMENDERS AND SEARCH TOOLS Connected Papers Toggle Connected Papers (What is Connected Papers?) Core recommender toggle CORE Recommender (What is CORE?) About arXivLabs ARXIVLABS: EXPERIMENTAL PROJECTS WITH COMMUNITY COLLABORATORS arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them. Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs and how to get involved. Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?) * About * Help * contact arXivClick here to contact arXiv Contact * subscribe to arXiv mailingsClick here to subscribe Subscribe * Copyright * Privacy Policy * Web Accessibility Assistance * arXiv Operational Status Get status notifications via email or slack