arxiv.org Open in urlscan Pro
128.84.21.199 Public Scan

Back to summary

URL:
https://arxiv.org/abs/2006.13979v2
Submission: On May 18 via manual (May 18th 2022, 3:30:31 am UTC) from BR — Scanned from DE

Form analysis
2 forms found in the DOM

GET https://arxiv.org/search

<form class="level-item mini-search" method="GET" action="https://arxiv.org/search">
  <div class="field has-addons">
    <div class="control">
      <input class="input is-small" type="text" name="query" placeholder="Search..." aria-label="Search term or terms">
      <p class="help"><a href="https://arxiv.org/help">Help</a> | <a href="https://arxiv.org/search/advanced">Advanced Search</a></p>
    </div>
    <div class="control">
      <div class="select is-small">
        <select name="searchtype" aria-label="Field to search">
          <option value="all" selected="selected">All fields</option>
          <option value="title">Title</option>
          <option value="author">Author</option>
          <option value="abstract">Abstract</option>
          <option value="comments">Comments</option>
          <option value="journal_ref">Journal reference</option>
          <option value="acm_class">ACM classification</option>
          <option value="msc_class">MSC classification</option>
          <option value="report_num">Report number</option>
          <option value="paper_id">arXiv identifier</option>
          <option value="doi">DOI</option>
          <option value="orcid">ORCID</option>
          <option value="author_id">arXiv author ID</option>
          <option value="help">Help pages</option>
          <option value="full_text">Full text</option>
        </select>
      </div>
    </div>
    <input type="hidden" name="source" value="header">
    <button class="button is-small is-cul-darker">Search</button>
  </div>
</form>

GET https://arxiv.org/search

<form class="mobile-search-form" method="GET" action="https://arxiv.org/search">
  <div class="field has-addons">
    <input class="input" type="text" name="query" placeholder="Search..." aria-label="Search term or terms">
    <input type="hidden" name="source" value="header">
    <input type="hidden" name="searchtype" value="all">
    <button class="button">GO</button>
  </div>
</form>

Text Content

GIVING WEEK!

Show your support for Open Science by donating to arXiv during Giving Week,
April 25th-29th.

DONATE
Skip to main content
We gratefully acknowledge support from
the Simons Foundation and member institutions.
> cs > arXiv:2006.13979v2

Help | Advanced Search

All fields Title Author Abstract Comments Journal reference ACM classification
MSC classification Report number arXiv identifier DOI ORCID arXiv author ID Help
pages Full text
Search
open search
GO
open navigation menu


QUICK LINKS

 * Login
 * Help Pages
 * About


COMPUTER SCIENCE > COMPUTATION AND LANGUAGE

arXiv:2006.13979v2 (cs)
[Submitted on 24 Jun 2020 (v1), last revised 15 Dec 2020 (this version, v2)]


TITLE:UNSUPERVISED CROSS-LINGUAL REPRESENTATION LEARNING FOR SPEECH RECOGNITION

Authors:Alexis Conneau, Alexei Baevski, Ronan Collobert, Abdelrahman Mohamed,
Michael Auli
Download PDF

> Abstract: This paper presents XLSR which learns cross-lingual speech
> representations by pretraining a single model from the raw waveform of speech
> in multiple languages. We build on wav2vec 2.0 which is trained by solving a
> contrastive task over masked latent speech representations and jointly learns
> a quantization of the latents shared across languages. The resulting model is
> fine-tuned on labeled data and experiments show that cross-lingual pretraining
> significantly outperforms monolingual pretraining. On the CommonVoice
> benchmark, XLSR shows a relative phoneme error rate reduction of 72% compared
> to the best known results. On BABEL, our approach improves word error rate by
> 16% relative compared to a comparable system. Our approach enables a single
> multilingual speech recognition model which is competitive to strong
> individual models. Analysis shows that the latent discrete speech
> representations are shared across languages with increased sharing for related
> languages. We hope to catalyze research in low-resource speech understanding
> by releasing XLSR-53, a large model pretrained in 53 languages.

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound
(cs.SD); Audio and Speech Processing (eess.AS) Cite as: arXiv:2006.13979 [cs.CL]
  (or arXiv:2006.13979v2 [cs.CL] for this version)  
https://doi.org/10.48550/arXiv.2006.13979
Focus to learn more
arXiv-issued DOI via DataCite


SUBMISSION HISTORY

From: Alexis Conneau [view email]
[v1] Wed, 24 Jun 2020 18:25:05 UTC (282 KB)
[v2] Tue, 15 Dec 2020 23:19:19 UTC (660 KB)

Full-text links:


DOWNLOAD:

 * PDF
 * Other formats

(license)
Current browse context:
cs.CL
< prev   |   next >

new | recent | 2006
Change to browse by:
cs
cs.LG
cs.SD
eess
eess.AS



REFERENCES & CITATIONS

 * NASA ADS
 * Google Scholar
 * Semantic Scholar




DBLP - CS BIBLIOGRAPHY

listing | bibtex
Alexis Conneau
Alexei Baevski
Ronan Collobert
Abdelrahman Mohamed
Michael Auli
a export bibtex citation Loading...


BIBTEX FORMATTED CITATION

×
loading...
Data provided by:


BOOKMARK

Bibliographic Tools


BIBLIOGRAPHIC AND CITATION TOOLS

Bibliographic Explorer Toggle
Bibliographic Explorer (What is the Explorer?)
Litmaps Toggle
Litmaps (What is Litmaps?)
scite.ai Toggle
scite Smart Citations (What are Smart Citations?)



Code & Data


CODE AND DATA ASSOCIATED WITH THIS ARTICLE

arXiv Links to Code Toggle
arXiv Links to Code & Data (What is Links to Code & Data?)


OFFICIAL CODE

No official code found; you can submit it here


COMMUNITY CODE

4 code implementations (in PyTorch)


DATASETS USED

Common Voice
118 papers also use this dataset
Demos


DEMOS

Replicate Toggle
Replicate (What is Replicate?)

No demos found for this article. You can add one here.

Related Papers


RECOMMENDERS AND SEARCH TOOLS

Connected Papers Toggle
Connected Papers (What is Connected Papers?)
Core recommender toggle
CORE Recommender (What is CORE?)



About arXivLabs


ARXIVLABS: EXPERIMENTAL PROJECTS WITH COMMUNITY COLLABORATORS

arXivLabs is a framework that allows collaborators to develop and share new
arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and
accepted our values of openness, community, excellence, and user data privacy.
arXiv is committed to these values and only works with partners that adhere to
them.

Have an idea for a project that will add value for arXiv's community? Learn more
about arXivLabs and how to get involved.



Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
 * About
 * Help

 * contact arXivClick here to contact arXiv Contact
 * subscribe to arXiv mailingsClick here to subscribe Subscribe

 * Copyright
 * Privacy Policy

 * Web Accessibility Assistance

 * arXiv Operational Status
   Get status notifications via email or slack

arxiv.org Open in urlscan Pro 128.84.21.199 Public Scan

Form analysis 2 forms found in the DOM

GET https://arxiv.org/search

GET https://arxiv.org/search

Text Content

arxiv.org Open in urlscan Pro
128.84.21.199 Public Scan

Form analysis
2 forms found in the DOM