Page MenuHomePhabricator
Paste P48398

Sentences dataset
ActivePublic

Authored by santhosh on May 19 2023, 8:15 AM.
Tags
None
Referenced Files
F37019570: Sentences dataset
May 19 2023, 8:15 AM
Subscribers
None
ab 18870
ace 42278
ady 5137
af 1556861
ak 6055
als 623194
alt 31123
ami 27501
am 158040
ang 27812
anp 32185
an 349383
arc 8019
ar 12930938
ary 62510
arz 8262041
as 341684
ast 2948126
atj 13705
avk 165376
av 28723
awa 17558
ay 36800
azb 1250771
az 3108188
ban 141334
bar 308371
ba 1315248
bat_smg 87171
bcl 136727
be 3270290
be_x_old 1140843
bg 5556694
bh 65871
bi 4853
bjn 54584
blk 152371
bm 6429
bn 3386494
bo 137135
bpy 265807
br 638574
bs 1547650
bug 33056
bxr 30467
ca 12636855
cbk_zam 26046
cdo 39134
ceb 54557906
ce 3951919
chr 5349
ch 2211
chy 1921
ckb 424573
co 74122
crh 91994
cr 676
csb 35012
cs 11438308
cu 4999
cv 386795
cy 3065250
dag 49069
da 4302455
de 70270378
din 5308
diq 212983
dsb 25716
dty 28200
dv 76577
dz 7575
ee 8691
el 5177826
eml 31927
en 159482189
eo 4006001
es 37786759
et 3419905
eu 4258761
ext 31523
fa 9359812
ff 11016
fi 9338639
fiu_vro 42803
fj 5551
fo 131851
frp 26151
frr 125690
fr 49585051
fur 32676
fy 927380
gag 15149
gan 20275
ga 464974
gcr 12055
gd 121093
glk 43117
gl 3160491
gn 50396
gom 179220
gor 38162
got 5021
guc 6373
gur 12184
gu 456573
guw 14659
gv 45644
hak 32941
ha 448007
haw 11815
he 10004050
hif 47796
hi 2246005
hr 3415212
hsb 110436
ht 403644
hu 9987432
hy 4988967
hyw 226639
ia 140635
id 8070074
ie 70899
ig 278971
ik 2999
ilo 102825
inh 15781
io 334057
is 709374
it 29088805
iu 2978
jam 10243
ja 20691619
jbo 8453
jv 553533
kaa 39827
kab 42315
ka 2253281
kbd 19225
kbp 31060
kcg 5099
kg 4948
ki 6899
kk 2695066
kl 2607
km 187888
kn 1281953
koi 27632
ko 7857924
krc 22732
ksh 36350
ks 11975
ku 389573
kv 47498
kw 48304
ky 959008
lad 35376
la 893770
lbe 5246
lb 567845
lez 53400
lfn 80325
lg 59795
lij 75743
li 258222
lld 431391
lmo 364213
ln 18629
lo 38101
ltg 8211
lt 2925557
lv 1738726
mad 10550
mai 85192
map_bms 47650
mdf 15821
mg 733028
mhr 81024
min 1126599
mi 37176
mk 2936614
ml 1705166
mni 49191
mn 364025
mnw 263564
mrj 46901
mr 1104352
ms 3064886
mt 178632
mwl 124413
my 1870371
myv 64818
mzn 83089
nah 34624
nap 53226
na 1926
nds_nl 116685
nds 764852
ne 402046
new 866854
nia 16531
nl 22477112
nn 1986218
no 8107716
nov 10430
nqo 33448
nrm 28454
nso 23426
nv 149386
ny 13241
oc 905748
olo 36334
om 29933
or 306171
os 89913
pag 17900
pam 62743
pap 35127
pa 753402
pcd 37026
pcm 14264
pdc 15474
pfl 41216
pih 4975
pi 7534
pl 21604296
pms 354320
pnb 1371528
pnt 4364
ps 482774
pt 17820688
pwn 4332
qu 116295
rm 129020
rmy 4599
rn 4596
roa_rup 10069
roa_tara 51351
ro 5631373
rue 70095
ru 46860813
rw 84814
sah 253557
sa 370079
sat 151744
scn 153027
sco 307681
sc 78634
sd 226070
se 37221
sg 1743
shi 15454
shn 201547
sh 7499978
simple 2351109
si 490318
skr 147936
sk 3091570
sl 3269971
smn 47486
sm 7738
sn 59647
so 114376
sq 1389996
srn 7871
sr 12301353
ss 6005
stq 45958
st 7259
su 399260
sv 21050488
sw 573622
szl 252478
szy 70711
ta 2697403
tay 21993
tcy 48325
te 3670856
tet 13040
tg 671857
th 1601525
ti 3977
tk 116086
tl 642007
tn 44169
to 10304
tpi 5608
tr 6842055
trv 45238
ts 6948
tt 3403681
tum 41633
tw 48990
ty 3579
tyv 67134
udm 34184
ug 184083
uk 23002850
ur 1990038
uz 3431278
vec 259685
vep 116508
ve 3634
vi 10155028
vls 93180
vo 293980
war 5979827
wa 126408
wo 25129
wuu 120061
xal 10541
xh 19677
xmf 133935
yi 176809
yo 148286
za 10995
zea 52221
zh_classical 69259
zh_min_nan 1451915
zh 11663736
zh_yue 546971
zu 51276
total 897201783