Skip to content

Releases: explosion/spacy-models

zh_core_web_trf-3.8.0

30 Sep 09:59
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: 78008b8948a24f6a53436d39a3bc5fcab6c391bc5b4e500f16eefc4b34d8f5a1
Checksum .whl: 5c177a1df422ec114b2e83da8b8906cd44890ef27300e9014349dc6efc908c0a

Details: https://spacy.io/models/zh#zh_core_web_trf

Chinese transformer pipeline (Transformer(name='bert-base-chinese', piece_encoder='bert-wordpiece', stride=152, type='bert', width=768, window=208, vocab_size=21128)). Components: transformer, tagger, parser, ner, attribute_ruler.

Feature Description
Name zh_core_web_trf
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline transformer, tagger, parser, attribute_ruler, ner
Components transformer, tagger, parser, attribute_ruler, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources OntoNotes 5 (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)
CoreNLP Universal Dependencies Converter (Stanford NLP Group)
bert-base-chinese (Hugging Face)
License MIT
Author Explosion
Model size 396 MB

Label Scheme

View label scheme (99 labels for 3 components)
Component Labels
tagger AD, AS, BA, CC, CD, CS, DEC, DEG, DER, DEV, DT, ETC, FW, IJ, INF, JJ, LB, LC, M, MSP, NN, NR, NT, OD, ON, P, PN, PU, SB, SP, URL, VA, VC, VE, VV, X
parser ROOT, acl, advcl:loc, advmod, advmod:dvp, advmod:loc, advmod:rcomp, amod, amod:ordmod, appos, aux:asp, aux:ba, aux:modal, aux:prtmod, auxpass, case, cc, ccomp, compound:nn, compound:vc, conj, cop, dep, det, discourse, dobj, etc, mark, mark:clf, name, neg, nmod, nmod:assmod, nmod:poss, nmod:prep, nmod:range, nmod:tmod, nmod:topic, nsubj, nsubj:xsubj, nsubjpass, nummod, parataxis:prnmod, punct, xcomp
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 95.85
TOKEN_P 94.58
TOKEN_R 91.36
TOKEN_F 92.94
TAG_ACC 91.89
SENTS_P 73.32
SENTS_R 68.77
SENTS_F 70.97
DEP_UAS 76.18
DEP_LAS 72.02
ENTS_P 74.49
ENTS_R 74.03
ENTS_F 74.26

Installation

pip install spacy
python -m spacy download zh_core_web_trf

zh_core_web_sm-3.8.0

30 Sep 09:59
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: b099841a3f8c0e591ffff295c4aa30b243c3d7cc21446ff5ca2fac52792c34ea
Checksum .whl: 7de3bd267176b9b2a8defb6997c1cd296da16c57b5e712f72ea44a51755421c8

Details: https://spacy.io/models/zh#zh_core_web_sm

Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler.

Feature Description
Name zh_core_web_sm
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline tok2vec, tagger, parser, attribute_ruler, ner
Components tok2vec, tagger, parser, senter, attribute_ruler, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources OntoNotes 5 (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)
CoreNLP Universal Dependencies Converter (Stanford NLP Group)
License MIT
Author Explosion
Model size 46 MB

Label Scheme

View label scheme (100 labels for 3 components)
Component Labels
tagger AD, AS, BA, CC, CD, CS, DEC, DEG, DER, DEV, DT, ETC, FW, IJ, INF, JJ, LB, LC, M, MSP, NN, NR, NT, OD, ON, P, PN, PU, SB, SP, URL, VA, VC, VE, VV, X, _SP
parser ROOT, acl, advcl:loc, advmod, advmod:dvp, advmod:loc, advmod:rcomp, amod, amod:ordmod, appos, aux:asp, aux:ba, aux:modal, aux:prtmod, auxpass, case, cc, ccomp, compound:nn, compound:vc, conj, cop, dep, det, discourse, dobj, etc, mark, mark:clf, name, neg, nmod, nmod:assmod, nmod:poss, nmod:prep, nmod:range, nmod:tmod, nmod:topic, nsubj, nsubj:xsubj, nsubjpass, nummod, parataxis:prnmod, punct, xcomp
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 95.85
TOKEN_P 94.58
TOKEN_R 91.36
TOKEN_F 92.94
TAG_ACC 89.39
SENTS_P 78.18
SENTS_R 72.90
SENTS_F 75.45
DEP_UAS 69.52
DEP_LAS 64.05
ENTS_P 72.27
ENTS_R 64.97
ENTS_F 68.42

Installation

pip install spacy
python -m spacy download zh_core_web_sm

zh_core_web_md-3.8.0

30 Sep 09:59
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: 42cd04e0f406cbddf6aab5b9665c89a23de45456526c43d4875b10fd193573f8
Checksum .whl: bcf300540fbd15b1b0aac358f327f2a31de8992b346162d5600f843514da0e98

Details: https://spacy.io/models/zh#zh_core_web_md

Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler.

Feature Description
Name zh_core_web_md
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline tok2vec, tagger, parser, attribute_ruler, ner
Components tok2vec, tagger, parser, senter, attribute_ruler, ner
Vectors 500000 keys, 20000 unique vectors (300 dimensions)
Sources OntoNotes 5 (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)
CoreNLP Universal Dependencies Converter (Stanford NLP Group)
Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia) (Explosion)
License MIT
Author Explosion
Model size 74 MB

Label Scheme

View label scheme (100 labels for 3 components)
Component Labels
tagger AD, AS, BA, CC, CD, CS, DEC, DEG, DER, DEV, DT, ETC, FW, IJ, INF, JJ, LB, LC, M, MSP, NN, NR, NT, OD, ON, P, PN, PU, SB, SP, URL, VA, VC, VE, VV, X, _SP
parser ROOT, acl, advcl:loc, advmod, advmod:dvp, advmod:loc, advmod:rcomp, amod, amod:ordmod, appos, aux:asp, aux:ba, aux:modal, aux:prtmod, auxpass, case, cc, ccomp, compound:nn, compound:vc, conj, cop, dep, det, discourse, dobj, etc, mark, mark:clf, name, neg, nmod, nmod:assmod, nmod:poss, nmod:prep, nmod:range, nmod:tmod, nmod:topic, nsubj, nsubj:xsubj, nsubjpass, nummod, parataxis:prnmod, punct, xcomp
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 95.85
TOKEN_P 94.58
TOKEN_R 91.36
TOKEN_F 92.94
TAG_ACC 90.05
SENTS_P 78.59
SENTS_R 72.90
SENTS_F 75.64
DEP_UAS 70.54
DEP_LAS 65.30
ENTS_P 71.94
ENTS_R 68.11
ENTS_F 69.97

Installation

pip install spacy
python -m spacy download zh_core_web_md

zh_core_web_lg-3.8.0

30 Sep 09:59
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: f5ba75104f5a07854503461a6bbaf7bb34232e9a8b746953a3a46e1478380c34
Checksum .whl: eb39f5deb382752d3faef4052fb61570b7f45441076082b2a1fdbe50a8848e7c

Details: https://spacy.io/models/zh#zh_core_web_lg

Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler.

Feature Description
Name zh_core_web_lg
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline tok2vec, tagger, parser, attribute_ruler, ner
Components tok2vec, tagger, parser, senter, attribute_ruler, ner
Vectors 500000 keys, 500000 unique vectors (300 dimensions)
Sources OntoNotes 5 (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)
CoreNLP Universal Dependencies Converter (Stanford NLP Group)
Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia) (Explosion)
License MIT
Author Explosion
Model size 575 MB

Label Scheme

View label scheme (100 labels for 3 components)
Component Labels
tagger AD, AS, BA, CC, CD, CS, DEC, DEG, DER, DEV, DT, ETC, FW, IJ, INF, JJ, LB, LC, M, MSP, NN, NR, NT, OD, ON, P, PN, PU, SB, SP, URL, VA, VC, VE, VV, X, _SP
parser ROOT, acl, advcl:loc, advmod, advmod:dvp, advmod:loc, advmod:rcomp, amod, amod:ordmod, appos, aux:asp, aux:ba, aux:modal, aux:prtmod, auxpass, case, cc, ccomp, compound:nn, compound:vc, conj, cop, dep, det, discourse, dobj, etc, mark, mark:clf, name, neg, nmod, nmod:assmod, nmod:poss, nmod:prep, nmod:range, nmod:tmod, nmod:topic, nsubj, nsubj:xsubj, nsubjpass, nummod, parataxis:prnmod, punct, xcomp
ner CARDINAL, DATE, EVENT, FAC, GPE, LANGUAGE, LAW, LOC, MONEY, NORP, ORDINAL, ORG, PERCENT, PERSON, PRODUCT, QUANTITY, TIME, WORK_OF_ART

Accuracy

Type Score
TOKEN_ACC 95.85
TOKEN_P 94.58
TOKEN_R 91.36
TOKEN_F 92.94
TAG_ACC 90.35
SENTS_P 77.98
SENTS_R 72.91
SENTS_F 75.36
DEP_UAS 70.88
DEP_LAS 65.79
ENTS_P 73.34
ENTS_R 69.54
ENTS_F 71.39

Installation

pip install spacy
python -m spacy download zh_core_web_lg

xx_sent_ud_sm-3.8.0

30 Sep 09:59
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: d895d6ce9e1a70139fa9096e2d2809d8d20a306e4e3d5ef0beb3d66d87a09edf
Checksum .whl: 1b5fb9321723b395f76458d2a13b7a3673c537eb58c0a9df3778f4e0950180b4

Details: https://spacy.io/models/xx#xx_sent_ud_sm

Multi-language pipeline optimized for CPU. Components: senter.

Feature Description
Name xx_sent_ud_sm
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline senter
Components senter
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources Universal Dependencies v2.8 (UD_Afrikaans-AfriBooms, UD_Croatian-SET, UD_Czech-CAC, UD_Czech-CLTT, UD_Danish-DDT, UD_Dutch-Alpino, UD_Dutch-LassySmall, UD_English-EWT, UD_Finnish-FTB, UD_Finnish-TDT, UD_French-GSD, UD_French-Spoken, UD_German-GSD, UD_Indonesian-GSD, UD_Irish-IDT, UD_Italian-TWITTIRO, UD_Korean-GSD, UD_Korean-Kaist, UD_Latvian-LVTB, UD_Lithuanian-ALKSNIS, UD_Lithuanian-HSE, UD_Marathi-UFAL, UD_Norwegian-Bokmaal, UD_Norwegian-Nynorsk, UD_Norwegian-NynorskLIA, UD_Persian-Seraji, UD_Portuguese-Bosque, UD_Portuguese-GSD, UD_Romanian-Nonstandard, UD_Romanian-RRT, UD_Russian-GSD, UD_Russian-Taiga, UD_Serbian-SET, UD_Slovak-SNK, UD_Spanish-GSD, UD_Swedish-Talbanken, UD_Telugu-MTG, UD_Vietnamese-VTB) (Zeman, Daniel; Nivre, Joakim; Abrams, Mitchell; et al.)
License CC BY-SA 3.0
Author Explosion
Model size 4 MB

Label Scheme

Accuracy

Type Score
TOKEN_ACC 98.59
TOKEN_P 95.31
TOKEN_R 95.72
TOKEN_F 95.52
SENTS_P 90.67
SENTS_R 81.49
SENTS_F 85.83

Installation

pip install spacy
python -m spacy download xx_sent_ud_sm

xx_ent_wiki_sm-3.8.0

30 Sep 09:59
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: a0f0f1be6faa96300abf9d711f7d5a326698333ba8f21f9f9e0e12bf92f6be6b
Checksum .whl: 6f3c4b853852ea9e9d2dc76cc950dddb10a7e4c42d813308caefe6c5e8be2f0a

Details: https://spacy.io/models/xx#xx_ent_wiki_sm

Multi-language pipeline optimized for CPU. Components: ner.

Feature Description
Name xx_ent_wiki_sm
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline ner
Components ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources WikiNER (Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran)
License MIT
Author Explosion
Model size 10 MB

Label Scheme

View label scheme (4 labels for 1 components)
Component Labels
ner LOC, MISC, ORG, PER

Accuracy

Type Score
ENTS_P 83.57
ENTS_R 82.71
ENTS_F 83.14

Installation

pip install spacy
python -m spacy download xx_ent_wiki_sm

uk_core_news_trf-3.8.0

30 Sep 10:00
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: bc3e383e017f7e16a9adc33bbc55fd9b2769a1c225391909909824452ff2a010
Checksum .whl: 68896bd7b1dd3e1665b691edd4688a9fe386fa677571f75b05f3ca27e69fa1e9

Details: https://spacy.io/models/uk#uk_core_news_trf

Ukrainian transformer pipeline (Transformer(name='ukr-models/xlm-roberta-base-uk', piece_encoder='xlm-roberta-sentencepiece', stride=168, type='xlm-roberta', width=768, window=216, vocab_size=31274)). Components: transformer, morphologizer, parser, ner, attribute_ruler, lemmatizer.

Feature Description
Name uk_core_news_trf
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline transformer, morphologizer, parser, attribute_ruler, lemmatizer, ner
Components transformer, morphologizer, parser, attribute_ruler, lemmatizer, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources Ukr-Synth (e5d9eaf3) (Volodymyr Kurnosov)
ukr-models/xlm-roberta-base-uk (Volodymyr Kurnosov and Yinhan Liu and Myle Ott and Naman Goyal and Jingfei Du and Mandar Joshi and Danqi Chen and Omer Levy and Mike Lewis and Luke Zettlemoyer and Veselin Stoyanov)
License MIT
Author Explosion
Model size 391 MB

Label Scheme

View label scheme (1210 labels for 3 components)
Component Labels
morphologizer POS=CCONJ, Degree=Cmp|POS=ADV, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Animacy=Inan|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Ins|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT, Case=Gen|Number=Plur|POS=DET|PronType=Dem, Animacy=Inan|Case=Gen|Gender=Fem|Number=Plur|POS=NOUN, POS=ADV|PronType=Rel, POS=PART, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Aspect=Imp|POS=VERB|VerbForm=Inf, Animacy=Inan|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, Case=Loc|POS=ADP, Case=Loc|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN, POS=ADV, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Loc|Gender=Masc|Number=Plur|POS=NOUN, Case=Gen|POS=ADP, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Loc|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Abbr=Yes|Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN|Uninflect=Yes, Case=Nom|NumType=Card|POS=DET|PronType=Ind, Animacy=Anim|Case=Gen|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=NOUN, Case=Gen|Number=Plur|POS=ADJ, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN, Case=Loc|Number=Plur|POS=ADJ, POS=SCONJ, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Aspect=Perf|POS=VERB|VerbForm=Inf, Degree=Pos|POS=ADV, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=2|PronType=Prs, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Fut|VerbForm=Fin, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|Poss=Yes|PronType=Prs|Reflex=Yes, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=NOUN, Case=Loc|Gender=Neut|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Loc|Gender=Neut|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Acc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|Mood=Ind|POS=VERB|Person=0|VerbForm=Fin, Case=Gen|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=PROPN, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Acc|Number=Plur|POS=DET|PronType=Tot, POS=PART|Polarity=Neg, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT|PunctType=Quot, POS=PUNCT|PunctType=Dash, Aspect=Perf|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, POS=ADV|PronType=Dem, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=PROPN, Case=Acc|POS=ADP, Animacy=Inan|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Case=Gen|Gender=Masc|Number=Sing|POS=DET|PronType=Ind, Aspect=Perf|Case=Gen|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN, Foreign=Yes|POS=X, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Case=Ins|POS=ADP, Animacy=Inan|Case=Ins|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Number=Plur|POS=ADJ, Animacy=Anim|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Abbr=Yes|Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=PROPN, Animacy=Inan|Case=Acc|Number=Ptan|POS=NOUN, Case=Nom|Number=Plur|POS=DET|PronType=Rel, Case=Ins|Number=Plur|POS=PRON|Person=3|PronType=Prs, Aspect=Imp|Mood=Ind|Number=Plur|POS=AUX|Tense=Past|VerbForm=Fin, Aspect=Perf|Case=Nom|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Case=Nom|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Neut|Number=Plur|POS=NOUN, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=AUX|Tense=Past|VerbForm=Fin, Case=Ins|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Dat|POS=PRON|PronType=Neg, Case=Nom|Degree=Pos|Number=Plur|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Tot, Case=Ins|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Ins|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Gender=Masc|Number=Sing|POS=ADJ, Case=Gen|Degree=Pos|Number=Plur|POS=ADJ, Aspect=Perf|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Ins|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|POS=VERB|Tense=Past|VerbForm=Conv, Animacy=Inan|Case=Acc|Gender=Fem|Number=Plur|POS=NOUN, Aspect=Imp|Case=Gen|Gender=Neut|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Acc|Degree=Cmp|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Degree=Pos|Number=Plur|POS=ADJ, Case=Loc|Degree=Pos|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Gen|Number=Plur|POS=PRON|Person=2|PronType=Prs, Case=Nom|NumType=Card|POS=DET|PronType=Dem, Animacy=Anim|Case=Gen|Number=Ptan|POS=NOUN, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=PROPN, Case=Gen|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Gen|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Prs, Aspect=Perf|Gender=Fem|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Gen|Number=Ptan|POS=NOUN, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN|Uninflect=Yes, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN|Uninflect=Yes, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=PRON|PronType=Int, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Int, Case=Acc|Gender=Neut|Number=Sing|POS=ADJ, Case=Nom|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=1|PronType=Prs, Animacy=Inan|Case=Acc|Number=Plur|POS=ADJ, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=PROPN|Uninflect=Yes, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Fut|VerbForm=Fin, Case=Gen|Gender=Fem|Number=Sing|POS=ADJ, Case=Acc|Gender=Neut|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Dat|Gender=Neut|Number=Sing|POS=NOUN, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Rel, `Animacy=Anim|Case=Gen|Gender=Fem|Number=S...
Read more

uk_core_news_sm-3.8.0

30 Sep 10:00
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: 777fdbcd482a6e66ef278e2f739cc2e5f934295f02df4c0ec8b9c060f2b59624
Checksum .whl: d20adb50b42c0dcfdedf4994dabcb96789a64983a9ab560d0c6c38a59e8efb58

Details: https://spacy.io/models/uk#uk_core_news_sm

Ukrainian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name uk_core_news_sm
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline tok2vec, morphologizer, parser, attribute_ruler, lemmatizer, ner
Components tok2vec, morphologizer, parser, senter, attribute_ruler, lemmatizer, ner
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources Ukr-Synth (e5d9eaf3) (Volodymyr Kurnosov)
License MIT
Author Explosion
Model size 14 MB

Label Scheme

View label scheme (1211 labels for 3 components)
Component Labels
morphologizer POS=CCONJ, Degree=Cmp|POS=ADV, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Animacy=Inan|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Ins|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT, Case=Gen|Number=Plur|POS=DET|PronType=Dem, Animacy=Inan|Case=Gen|Gender=Fem|Number=Plur|POS=NOUN, POS=ADV|PronType=Rel, POS=PART, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Aspect=Imp|POS=VERB|VerbForm=Inf, Animacy=Inan|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, Case=Loc|POS=ADP, Case=Loc|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN, POS=ADV, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Loc|Gender=Masc|Number=Plur|POS=NOUN, Case=Gen|POS=ADP, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Loc|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Abbr=Yes|Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN|Uninflect=Yes, Case=Nom|NumType=Card|POS=DET|PronType=Ind, Animacy=Anim|Case=Gen|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=NOUN, Case=Gen|Number=Plur|POS=ADJ, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN, Case=Loc|Number=Plur|POS=ADJ, POS=SCONJ, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Aspect=Perf|POS=VERB|VerbForm=Inf, Degree=Pos|POS=ADV, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=2|PronType=Prs, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Fut|VerbForm=Fin, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|Poss=Yes|PronType=Prs|Reflex=Yes, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=NOUN, Case=Loc|Gender=Neut|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Loc|Gender=Neut|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Acc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|Mood=Ind|POS=VERB|Person=0|VerbForm=Fin, Case=Gen|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=PROPN, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Acc|Number=Plur|POS=DET|PronType=Tot, POS=PART|Polarity=Neg, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT|PunctType=Quot, POS=PUNCT|PunctType=Dash, Aspect=Perf|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, POS=ADV|PronType=Dem, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=PROPN, Case=Acc|POS=ADP, Animacy=Inan|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Case=Gen|Gender=Masc|Number=Sing|POS=DET|PronType=Ind, Aspect=Perf|Case=Gen|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN, Foreign=Yes|POS=X, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Case=Ins|POS=ADP, Animacy=Inan|Case=Ins|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Number=Plur|POS=ADJ, Animacy=Anim|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Abbr=Yes|Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=PROPN, Animacy=Inan|Case=Acc|Number=Ptan|POS=NOUN, Case=Nom|Number=Plur|POS=DET|PronType=Rel, Case=Ins|Number=Plur|POS=PRON|Person=3|PronType=Prs, Aspect=Imp|Mood=Ind|Number=Plur|POS=AUX|Tense=Past|VerbForm=Fin, Aspect=Perf|Case=Nom|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Case=Nom|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Neut|Number=Plur|POS=NOUN, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=AUX|Tense=Past|VerbForm=Fin, Case=Ins|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Dat|POS=PRON|PronType=Neg, Case=Nom|Degree=Pos|Number=Plur|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, POS=SPACE, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Tot, Case=Ins|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Ins|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Gender=Masc|Number=Sing|POS=ADJ, Case=Gen|Degree=Pos|Number=Plur|POS=ADJ, Aspect=Perf|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Ins|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|POS=VERB|Tense=Past|VerbForm=Conv, Animacy=Inan|Case=Acc|Gender=Fem|Number=Plur|POS=NOUN, Aspect=Imp|Case=Gen|Gender=Neut|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Acc|Degree=Cmp|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Degree=Pos|Number=Plur|POS=ADJ, Case=Loc|Degree=Pos|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Gen|Number=Plur|POS=PRON|Person=2|PronType=Prs, Case=Nom|NumType=Card|POS=DET|PronType=Dem, Animacy=Anim|Case=Gen|Number=Ptan|POS=NOUN, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=PROPN, Case=Gen|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Gen|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Prs, Aspect=Perf|Gender=Fem|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Gen|Number=Ptan|POS=NOUN, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN|Uninflect=Yes, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN|Uninflect=Yes, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=PRON|PronType=Int, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Int, Case=Acc|Gender=Neut|Number=Sing|POS=ADJ, Case=Nom|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=1|PronType=Prs, Animacy=Inan|Case=Acc|Number=Plur|POS=ADJ, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=PROPN|Uninflect=Yes, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Fut|VerbForm=Fin, Case=Gen|Gender=Fem|Number=Sing|POS=ADJ, Case=Acc|Gender=Neut|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Dat|Gender=Neut|Number=Sing|POS=NOUN, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Rel, Animacy=Anim|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Loc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Animacy=Anim|Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Case=Gen|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Case=Nom|NumType=Card|POS=NUM|Uninflect=Yes, Case=Nom|Degree=Pos|Gender=Neut|Number=Sing|POS=ADJ, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=NOUN, `Animacy=Inan|Case=Acc...
Read more

uk_core_news_md-3.8.0

30 Sep 10:00
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: 707b00a61684c9095fe7c56c5509833c152e85d0c60cea4ead5d71f0ed9d7457
Checksum .whl: 583c65ab55d6bdbe3b3ebe86978bf4f6a8aab0b6778ea8c4c5ab529722abaafd

Details: https://spacy.io/models/uk#uk_core_news_md

Ukrainian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name uk_core_news_md
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline tok2vec, morphologizer, parser, attribute_ruler, lemmatizer, ner
Components tok2vec, morphologizer, parser, senter, attribute_ruler, lemmatizer, ner
Vectors floret (50000, 300)
Sources Ukr-Synth (e5d9eaf3) (Volodymyr Kurnosov)
Explosion Vectors (OSCAR 2109 + Wikipedia + OpenSubtitles + WMT News Crawl) (Explosion)
License MIT
Author Explosion
Model size 65 MB

Label Scheme

View label scheme (1211 labels for 3 components)
Component Labels
morphologizer POS=CCONJ, Degree=Cmp|POS=ADV, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Animacy=Inan|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Ins|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT, Case=Gen|Number=Plur|POS=DET|PronType=Dem, Animacy=Inan|Case=Gen|Gender=Fem|Number=Plur|POS=NOUN, POS=ADV|PronType=Rel, POS=PART, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Aspect=Imp|POS=VERB|VerbForm=Inf, Animacy=Inan|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, Case=Loc|POS=ADP, Case=Loc|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN, POS=ADV, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Loc|Gender=Masc|Number=Plur|POS=NOUN, Case=Gen|POS=ADP, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Loc|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Abbr=Yes|Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN|Uninflect=Yes, Case=Nom|NumType=Card|POS=DET|PronType=Ind, Animacy=Anim|Case=Gen|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=NOUN, Case=Gen|Number=Plur|POS=ADJ, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN, Case=Loc|Number=Plur|POS=ADJ, POS=SCONJ, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Aspect=Perf|POS=VERB|VerbForm=Inf, Degree=Pos|POS=ADV, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=2|PronType=Prs, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Fut|VerbForm=Fin, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|Poss=Yes|PronType=Prs|Reflex=Yes, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=NOUN, Case=Loc|Gender=Neut|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Loc|Gender=Neut|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Acc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|Mood=Ind|POS=VERB|Person=0|VerbForm=Fin, Case=Gen|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=PROPN, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Acc|Number=Plur|POS=DET|PronType=Tot, POS=PART|Polarity=Neg, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT|PunctType=Quot, POS=PUNCT|PunctType=Dash, Aspect=Perf|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, POS=ADV|PronType=Dem, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=PROPN, Case=Acc|POS=ADP, Animacy=Inan|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Case=Gen|Gender=Masc|Number=Sing|POS=DET|PronType=Ind, Aspect=Perf|Case=Gen|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN, Foreign=Yes|POS=X, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Case=Ins|POS=ADP, Animacy=Inan|Case=Ins|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Number=Plur|POS=ADJ, Animacy=Anim|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Abbr=Yes|Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=PROPN, Animacy=Inan|Case=Acc|Number=Ptan|POS=NOUN, Case=Nom|Number=Plur|POS=DET|PronType=Rel, Case=Ins|Number=Plur|POS=PRON|Person=3|PronType=Prs, Aspect=Imp|Mood=Ind|Number=Plur|POS=AUX|Tense=Past|VerbForm=Fin, Aspect=Perf|Case=Nom|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Case=Nom|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Neut|Number=Plur|POS=NOUN, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=AUX|Tense=Past|VerbForm=Fin, Case=Ins|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Dat|POS=PRON|PronType=Neg, Case=Nom|Degree=Pos|Number=Plur|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, POS=SPACE, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Tot, Case=Ins|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Ins|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Gender=Masc|Number=Sing|POS=ADJ, Case=Gen|Degree=Pos|Number=Plur|POS=ADJ, Aspect=Perf|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Ins|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|POS=VERB|Tense=Past|VerbForm=Conv, Animacy=Inan|Case=Acc|Gender=Fem|Number=Plur|POS=NOUN, Aspect=Imp|Case=Gen|Gender=Neut|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Acc|Degree=Cmp|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Degree=Pos|Number=Plur|POS=ADJ, Case=Loc|Degree=Pos|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Gen|Number=Plur|POS=PRON|Person=2|PronType=Prs, Case=Nom|NumType=Card|POS=DET|PronType=Dem, Animacy=Anim|Case=Gen|Number=Ptan|POS=NOUN, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=PROPN, Case=Gen|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Gen|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Prs, Aspect=Perf|Gender=Fem|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Gen|Number=Ptan|POS=NOUN, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN|Uninflect=Yes, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN|Uninflect=Yes, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=PRON|PronType=Int, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Int, Case=Acc|Gender=Neut|Number=Sing|POS=ADJ, Case=Nom|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=1|PronType=Prs, Animacy=Inan|Case=Acc|Number=Plur|POS=ADJ, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=PROPN|Uninflect=Yes, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Fut|VerbForm=Fin, Case=Gen|Gender=Fem|Number=Sing|POS=ADJ, Case=Acc|Gender=Neut|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Dat|Gender=Neut|Number=Sing|POS=NOUN, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Rel, Animacy=Anim|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Loc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Animacy=Anim|Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Case=Gen|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Case=Nom|NumType=Card|POS=NUM|Uninflect=Yes, `Case=Nom|Degree=...
Read more

uk_core_news_lg-3.8.0

30 Sep 10:00
Compare
Choose a tag to compare

Downloads Downloads (wheel)

Checksum .tar.gz: 491b50d0804262709108b5ae5a5058f70bc945aae54452993cb7e9ae1284a324
Checksum .whl: 60b56f8fa5525925cf8e6266a80e907431c3d475f874c75c68b62f3e12620b0b

Details: https://spacy.io/models/uk#uk_core_news_lg

Ukrainian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.

Feature Description
Name uk_core_news_lg
Version 3.8.0
spaCy >=3.8.0,<3.9.0
Default Pipeline tok2vec, morphologizer, parser, attribute_ruler, lemmatizer, ner
Components tok2vec, morphologizer, parser, senter, attribute_ruler, lemmatizer, ner
Vectors floret (200000, 300)
Sources Ukr-Synth (e5d9eaf3) (Volodymyr Kurnosov)
Explosion Vectors (OSCAR 2109 + Wikipedia + OpenSubtitles + WMT News Crawl) (Explosion)
License MIT
Author Explosion
Model size 220 MB

Label Scheme

View label scheme (1211 labels for 3 components)
Component Labels
morphologizer POS=CCONJ, Degree=Cmp|POS=ADV, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Animacy=Inan|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Ins|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT, Case=Gen|Number=Plur|POS=DET|PronType=Dem, Animacy=Inan|Case=Gen|Gender=Fem|Number=Plur|POS=NOUN, POS=ADV|PronType=Rel, POS=PART, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Aspect=Imp|POS=VERB|VerbForm=Inf, Animacy=Inan|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, Case=Loc|POS=ADP, Case=Loc|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN, Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN, POS=ADV, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Loc|Gender=Masc|Number=Plur|POS=NOUN, Case=Gen|POS=ADP, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Loc|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Abbr=Yes|Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=NOUN|Uninflect=Yes, Case=Nom|NumType=Card|POS=DET|PronType=Ind, Animacy=Anim|Case=Gen|Gender=Masc|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=NOUN, Case=Gen|Number=Plur|POS=ADJ, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN, Case=Loc|Number=Plur|POS=ADJ, POS=SCONJ, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin, Aspect=Perf|POS=VERB|VerbForm=Inf, Degree=Pos|POS=ADV, Aspect=Imp|Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=2|PronType=Prs, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Fut|VerbForm=Fin, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|Poss=Yes|PronType=Prs|Reflex=Yes, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=NOUN, Case=Loc|Gender=Neut|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Loc|Gender=Neut|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=DET|PronType=Dem, Animacy=Inan|Case=Acc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|Mood=Ind|POS=VERB|Person=0|VerbForm=Fin, Case=Gen|Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=PROPN, Aspect=Imp|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin, Animacy=Anim|Case=Acc|Number=Plur|POS=DET|PronType=Tot, POS=PART|Polarity=Neg, Animacy=Inan|Case=Gen|Gender=Neut|Number=Plur|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, POS=PUNCT|PunctType=Quot, POS=PUNCT|PunctType=Dash, Aspect=Perf|Gender=Masc|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, POS=ADV|PronType=Dem, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing|POS=PROPN, Case=Acc|POS=ADP, Animacy=Inan|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Case=Gen|Gender=Masc|Number=Sing|POS=DET|PronType=Ind, Aspect=Perf|Case=Gen|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN, Foreign=Yes|POS=X, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Tense=Past|VerbForm=Fin, Case=Ins|POS=ADP, Animacy=Inan|Case=Ins|Gender=Fem|Number=Plur|POS=NOUN, Animacy=Inan|Case=Acc|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Number=Plur|POS=ADJ, Animacy=Anim|Case=Nom|Gender=Fem|Number=Plur|POS=NOUN, Abbr=Yes|Animacy=Inan|Case=Gen|Gender=Neut|Number=Sing|POS=NOUN|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Fem|Number=Sing|POS=PROPN, Animacy=Inan|Case=Acc|Number=Ptan|POS=NOUN, Case=Nom|Number=Plur|POS=DET|PronType=Rel, Case=Ins|Number=Plur|POS=PRON|Person=3|PronType=Prs, Aspect=Imp|Mood=Ind|Number=Plur|POS=AUX|Tense=Past|VerbForm=Fin, Aspect=Perf|Case=Nom|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Case=Nom|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs|Uninflect=Yes, Animacy=Inan|Case=Nom|Gender=Neut|Number=Plur|POS=NOUN, Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|POS=AUX|Tense=Past|VerbForm=Fin, Case=Ins|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Inan|Case=Dat|POS=PRON|PronType=Neg, Case=Nom|Degree=Pos|Number=Plur|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Masc|Number=Plur|POS=NOUN, POS=SPACE, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Tot, Case=Ins|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Ins|Gender=Masc|Number=Sing|POS=NOUN, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Dem, Case=Nom|Gender=Masc|Number=Sing|POS=ADJ, Case=Gen|Degree=Pos|Number=Plur|POS=ADJ, Aspect=Perf|Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Ins|Gender=Masc|Number=Sing|POS=ADJ, Aspect=Perf|POS=VERB|Tense=Past|VerbForm=Conv, Animacy=Inan|Case=Acc|Gender=Fem|Number=Plur|POS=NOUN, Aspect=Imp|Case=Gen|Gender=Neut|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin, Case=Acc|Degree=Cmp|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Inan|Case=Loc|Gender=Fem|Number=Sing|POS=NOUN, Animacy=Inan|Case=Acc|Degree=Pos|Number=Plur|POS=ADJ, Case=Loc|Degree=Pos|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Gen|Number=Plur|POS=PRON|Person=2|PronType=Prs, Case=Nom|NumType=Card|POS=DET|PronType=Dem, Animacy=Anim|Case=Gen|Number=Ptan|POS=NOUN, Animacy=Inan|Case=Loc|Gender=Masc|Number=Sing|POS=PROPN, Case=Gen|Gender=Masc|Number=Sing|POS=ADJ, Animacy=Anim|Case=Acc|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Gen|Number=Plur|POS=ADJ|VerbForm=Part|Voice=Pass, Case=Nom|Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Prs, Aspect=Perf|Gender=Fem|Mood=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Fin, Animacy=Inan|Case=Gen|Number=Ptan|POS=NOUN, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Giv|Number=Sing|POS=PROPN|Uninflect=Yes, Abbr=Yes|Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing|POS=PROPN|Uninflect=Yes, Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing|POS=PRON|PronType=Int, Animacy=Inan|Case=Nom|Gender=Neut|Number=Sing|POS=PRON|PronType=Int, Case=Acc|Gender=Neut|Number=Sing|POS=ADJ, Case=Nom|Gender=Fem|Number=Sing|POS=ADJ, Animacy=Anim|Case=Nom|Number=Plur|POS=PRON|Person=1|PronType=Prs, Animacy=Inan|Case=Acc|Number=Plur|POS=ADJ, Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing|POS=PROPN|Uninflect=Yes, Aspect=Perf|Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Fut|VerbForm=Fin, Case=Gen|Gender=Fem|Number=Sing|POS=ADJ, Case=Acc|Gender=Neut|NumType=Ord|Number=Sing|POS=ADJ|Uninflect=Yes, Animacy=Inan|Case=Dat|Gender=Neut|Number=Sing|POS=NOUN, Case=Nom|Gender=Fem|Number=Sing|POS=DET|PronType=Rel, Animacy=Anim|Case=Gen|Gender=Fem|Number=Sing|POS=NOUN, Aspect=Perf|Case=Loc|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part|Voice=Pass, Animacy=Anim|Case=Gen|Gender=Masc|Number=Sing|POS=NOUN, Case=Gen|Degree=Pos|Gender=Masc|Number=Sing|POS=ADJ, Case=Nom|NumType=Card|POS=NUM|Uninflect=Yes, `Case=Nom|Degre...
Read more