opus.nlpl.eu Open in urlscan Pro
193.166.25.9  Public Scan

Submitted URL: http://opus.nlpl.eu/
Effective URL: https://opus.nlpl.eu/
Submission: On February 24 via api from US — Scanned from FI

Form analysis 0 forms found in the DOM

Text Content

ContributePublicationsCorpora

ContributePublicationsCorpora
17
NEWS
CLOSE NEWS

MDN_Web_Docs

2023-09-25

NLLB

2023-09-07

OPUS on GitHub



Liv4ever and ELITR-ECA

2021-12-08

CCMatrix

2021-06-28

Updated: ParaCrawl and MultiParaCrawl

2021-06-11

New: MT560 dataset

2021-04-02

CCAligned and MultiCCAligned

2021-02-10

GoURMET and MIZAN

2020-11-27

EuroPat and tico-19

2020-10-31

OPUS-100 corpus

2020-06-30

ELRC public

2020-05-22

MultiParaCrawl

2019-10-16

Infopankki v1

2019-10-14

New corpus: memat (Xhosa/English)

2018-10-06

New corpora: ParaCrawl, XhosaNavy

2018-02-15

New version: OpenSubtitles2018

2017-11-06


FIND YOUR CORPORA

Source language
Afar (source)
Target language
Abkhazian (target)
Search


AN OVERVIEW OF THE OPUS COLLECTION

1,210 corpora

45,945,946,108 total sentence pairs

744 languages available

This map displays 10 corpora , which make up a total 93.40% of the entire OPUS
collection

See next 10

CorpusSentences% of
OPUSNLLB13B28.31CCMatrix11B23.64OpenSubtitles8.5B18.53MultiCCAligned2.2B4.87840ParaCrawl1.5B3.26229DGT1.1B2.37845XLEnt883M1.92148MultiParaCrawl789M1.71653LinguaTools-WikiTitles487M1.06082CCAligned439M0.95442


OUR CONTRIBUTORS




TOOLS & INFO

Opus APIOpus TrainerOpus CleanerOpus WordalignOpus FilterOpus Translator

Opus QueryOpus Tools (Python Package)Opus Tools (Perl Package)MT-DataEflomal
Word AlignerContribute to OPUS
Opus Legacy