Eintrag weiter verarbeiten
X-SRL dataset and mBERT word aligner
Gespeichert in:
Personen und Körperschaften: | |
---|---|
Titel: | X-SRL dataset and mBERT word aligner/ Angel Daza |
Format: | OnlineResource Computerdaten Datenbank |
Sprache: | Englisch |
veröffentlicht: |
Heidelberg
Universität
2021-02-17
|
Schlagwörter: | |
Quelle: | Verbunddaten SWB Lizenzfreie Online-Ressourcen |
Forschungsdaten zu: |
Daza, Angel, 1989 - : X-SRL |
Zusammenfassung: | This code contains a method to automatically align words from parallel sentences by using multilingual BERT pre-trained embeddings. This can be used to transfer source annotations (for example labeled English sentences) into the target side (for example a German translation of the sentence) by transferring the label into the best-aligned target word. This newly labeled data can be used to train different multilingual SOTA models to improve performance, especially for the lower-resource languages. |
---|---|
Beschreibung: |
Kind of data: Program source code Gesehen am 18.02.2021 |
Umfang: | 1 Online-Ressource (2 Files) |
DOI: | 10.11588/data/HVXXIJ |