# Format: <UD training corpus> <ISO 639-1 code (for OSCAR pretraining)> <Language name> <Recommended (chosen by size)>
UD_Afrikaans-AfriBooms af Afrikaans
UD_Ancient_Greek-Perseus ? Ancient_Greek
UD_Ancient_Greek-PROIEL ? Ancient_Greek *
UD_Arabic-PADT ar Arabic
UD_Armenian-ArmTDP hy Armenian
UD_Western_Armenian-ArmTDP hy Western_Armenian
UD_Basque-BDT eu Basque
UD_Belarusian-HSE be Belarusian
UD_Bulgarian-BTB bg Bulgarian
UD_Catalan-AnCora ca Catalan
UD_Chinese-GSD zh Chinese *
UD_Chinese-GSDSimp zh Chinese
UD_Classical_Chinese-Kyoto ? Classical_Chinese
UD_Croatian-SET hr Croatian
UD_Czech-CAC cs Czech
UD_Czech-CLTT cs Czech
UD_Czech-FicTree cs Czech
UD_Czech-PDT cs Czech *
UD_Danish-DDT da Danish
UD_Dutch-Alpino nl Dutch *
UD_Dutch-LassySmall nl Dutch
UD_English-Atis en English
UD_English-EWT en English *
UD_English-GUM en English
UD_English-LinES en English
UD_English-ParTUT en English
UD_Estonian-EDT et Estonian *
UD_Estonian-EWT et Estonian
UD_Finnish-FTB fi Finnish
UD_Finnish-TDT fi Finnish *
UD_French-GSD fr French *
UD_French-ParTUT fr French
UD_French-Rhapsodie fr French
UD_French-Sequoia fr French
UD_Galician-CTG gl Galician
UD_German-GSD de German
UD_German-HDT de German *
UD_Greek-GDT el Greek
UD_Hebrew-HTB he Hebrew
UD_Hindi-HDTB hi Hindi
UD_Hungarian-Szeged hu Hungarian
UD_Icelandic-IcePaHC is Icelandic *
UD_Icelandic-Modern is Icelandic
UD_Indonesian-GSD id Indonesian
UD_Irish-IDT ga Irish
UD_Italian-ISDT it Italian *
UD_Italian-ParTUT it Italian
UD_Italian-PoSTWITA it Italian
UD_Italian-TWITTIRO it Italian
UD_Italian-VIT it Italian
UD_Japanese-GSD ja Japanese *
UD_Japanese-GSDLUW ja Japanese
UD_Korean-Kaist ko Korean
UD_Latin-ITTB la Latin *
UD_Latin-LLCT la Latin
UD_Latin-PROIEL la Latin
UD_Latin-UDante la Latin
UD_Latvian-LVTB lv Latvian
UD_Lithuanian-ALKSNIS lt Lithuanian
UD_Maltese-MUDT mt Maltese
UD_Norwegian-Bokmaal no Norwegian_Bokmål
UD_Norwegian-Nynorsk nn Norwegian_Nynorsk *
UD_Norwegian-NynorskLIA nn Norwegian_Nynorsk
UD_Old_French-SRCMF ? Old_French
UD_Persian-PerDT fa Persian *
UD_Persian-Seraji fa Persian
UD_Polish-PDB pl Polish *
UD_Polish-LFG pl Polish
UD_Portuguese-Bosque pt Portuguese
UD_Portuguese-GSD pt Portuguese *
UD_Romanian-Nonstandard ro Romanian *
UD_Romanian-RRT ro Romanian
UD_Romanian-SiMoNERo ro Romanian
UD_Russian-GSD ru Russian
UD_Russian-SynTagRus ru Russian *
UD_Russian-Taiga ru Russian
UD_Scottish_Gaelic-ARCOSG gd Scottish_Gaelic
UD_Serbian-SET sr Serbian
UD_Slovak-SNK sk Slovak
UD_Slovenian-SSJ sl Slovenian
UD_Spanish-AnCora es Spanish *
UD_Spanish-GSD es Spanish
UD_Swedish-LinES sv Swedish
UD_Swedish-Talbanken sv Swedish *
UD_Tamil-TTB ta Tamil
UD_Telugu-MTG te Telegu
UD_Turkish-Atis tr Turkish
UD_Turkish-BOUN tr Turkish
UD_Turkish-FrameNet tr Turkish
UD_Turkish-IMST tr Turkish
UD_Turkish-Kenet tr Turkish
UD_Turkish-Penn tr Turkish *
UD_Turkish-Tourism tr Turkish
UD_Ukrainian-IU uk Ukrainian
UD_Urdu-UDTB ur Urdu
UD_Uyghur-UDT ug Uyghur
UD_Vietnamese-VTB vi Vietnamese
UD_Welsh-CCG cy Welsh
#NKJP_Polish-byName pl Polish
#NKJP_Polish-byType pl Polish
#UD_Polish-PDBnoMW pl Polish