Skip to content

Instantly share code, notes, and snippets.

@ScottPJones
Created March 17, 2018 02:24
Show Gist options
  • Save ScottPJones/d673d7497eb453aa790605ca6d4726bb to your computer and use it in GitHub Desktop.
Save ScottPJones/d673d7497eb453aa790605ca6d4726bb to your computer and use it in GitHub Desktop.
Comparison of Str types to String, on a variety of different text files in different languages
File name Lines Chars Avg C/L Empty Min Max MaxType
ASCII-Dict.txt 235886 2257223 9.569 0 1 24 ASCIIStr
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 2257223 0 0 0 0 0 0
Lines with > 0: 235886 0 0 0 0 0 0
File name Lines Chars Avg C/L Empty Min Max MaxType
ASCII-Dict.txt 1000 9500 9.500 0 7 12 ASCIIStr
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 9500 0 0 0 0 0 0
Lines with > 0: 1000 0 0 0 0 0 0
Type B/Char sizeof length nextind reverse repeat 1 repeat 10 search search iteration sum isascii isvalid isascii isvalid islower isalnum uppercase textwidth
chars string string string char Char char vals string string chars chars chars chars string string
Results: 9500 9500 9500 1000 1000 1000 0 0 0 1039500 1000 1000 9500 9500 9000 9500 1000 9500
String 1.000 0.056 0.856 5.137 30.764 1.656 5.880 109.161 90.761 2.114 1.748 1.437 0.626 1.951 2.005 2.952 3.623 30.817 6.628
UTF8Str 1.000 0.900 2.503 3.296 12.508 2.889 0.832 1.107 10.591 1.714 1.307 3.524 11.074 1.693 1.779 1.965 1.832 6.317 2.183
UTF16Str 2.000 0.900 1.184 2.474 11.059 2.843 0.682 1.134 16.311 1.257 1.053 3.740 11.054 1.117 1.243 1.000 1.162 6.707 1.883
UTF32Str 4.000 1.008 15.145 31.202 13.858 2.941 0.653 1.056 74.196 1.411 1.532 2.573 11.074 1.365 0.273 1.305 1.210 6.480 1.855
UniStr 1.000 0.124 0.468 7.689 5.696 1.607 0.786 1.000 21.790 1.809 1.523 1.356 0.436 0.385 0.387 0.527 0.249 6.361 4.725
ASCIIStr 1.000 1.002 15.316 13.578 6.770 2.843 0.899 1.116 204.612 2.667 2.654 2.397 11.074 0.353 0.355 0.488 1.887 7.201 6.798
####################################################################################################
File name Lines Chars Avg C/L Empty Min Max MaxType
Chinese.txt 27932 877727 31.424 0 1 80 UCS2Str
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 23172 0 1 854554 0 0 0
Lines with > 0: 124 0 0 27808 0 0 0
File name Lines Chars Avg C/L Empty Min Max MaxType
Chinese.txt 1000 25000 25.000 0 14 36 UCS2Str
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 500 0 0 24500 0 0 0
Lines with > 0: 0 0 0 1000 0 0 0
Type B/Char sizeof length nextind reverse repeat 1 repeat 10 search search iteration sum isascii isvalid isascii isvalid islower isalnum uppercase textwidth
chars string string string char Char char vals string string chars chars chars chars string string
Results: 74000 25000 25000 1000 1000 1000 0 0 0 711397000 0 1000 500 25000 0 21000 0 49500
String 2.947 0.021 3.138 6.195 45.727 0.918 5.491 142.167 100.405 5.156 9.805 0.329 2.077 10.273 10.793 12.887 13.167 32.385 11.872
UTF8Str 2.947 0.912 7.787 1.990 18.969 4.223 1.153 0.981 26.627 2.538 4.843 3.916 92.737 4.680 6.050 2.887 2.742 8.259 2.825
UTF16Str 2.000 0.912 5.882 2.045 27.163 4.381 1.144 2.448 45.276 3.460 6.315 3.626 92.903 6.720 7.475 3.261 3.116 10.290 3.220
UTF32Str 4.000 1.000 140.318 99.088 31.417 4.543 0.688 2.370 105.632 4.088 8.450 3.634 88.171 7.481 4.064 3.537 3.333 9.046 3.636
UniStr 1.989 0.155 3.703 14.948 32.965 2.346 1.357 1.433 2.648 3.852 9.158 3.223 3.764 3.540 3.501 2.464 2.232 9.886 3.535
UCS2Str 2.000 1.017 145.796 20.494 36.456 4.152 1.128 2.382 25.700 4.088 9.877 3.937 88.022 8.104 2.967 3.739 3.540 10.700 3.376
####################################################################################################
File name Lines Chars Avg C/L Empty Min Max MaxType
English-1.txt 15952 974112 61.065 0 2 153 UCS2Str
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 963564 2 0 10546 0 0 0
Lines with > 0: 10267 2 0 5683 0 0 0
File name Lines Chars Avg C/L Empty Min Max MaxType
English-1.txt 1000 69500 69.500 0 69 70 ASCIIStr
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 69500 0 0 0 0 0 0
Lines with > 0: 1000 0 0 0 0 0 0
Type B/Char sizeof length nextind reverse repeat 1 repeat 10 search search iteration sum isascii isvalid isascii isvalid islower isalnum uppercase textwidth
chars string string string char Char char vals string string chars chars chars chars string string
Results: 69500 69500 69500 1000 1000 1000 0 0 0 6276000 1000 1000 69500 69500 52500 54000 1000 69500
String 1.022 0.008 0.839 4.606 21.036 0.327 1.797 57.114 90.390 1.658 1.369 1.208 0.541 1.629 1.515 2.070 2.597 18.961 5.689
UTF8Str 1.022 1.067 6.358 4.134 23.167 4.186 1.010 1.025 76.599 1.962 1.611 11.104 67.120 1.969 2.021 1.840 1.684 11.357 2.025
UTF16Str 2.000 1.055 2.038 1.434 18.570 3.769 0.425 1.032 79.942 1.353 1.120 8.476 63.815 1.302 1.360 1.094 1.120 10.153 1.482
UTF32Str 4.000 1.002 96.223 193.314 18.298 4.248 0.147 1.028 152.108 1.580 1.348 3.739 63.815 1.653 1.120 1.324 1.247 6.097 1.730
UniStr 1.325 0.139 3.257 9.451 26.804 2.319 1.006 1.015 158.730 2.364 2.040 1.859 2.510 1.302 1.095 1.661 1.199 17.782 8.949
UCS2Str 2.000 1.002 98.832 11.819 25.559 4.052 0.504 1.013 58.023 1.495 1.654 6.753 63.707 1.644 1.060 1.144 1.229 10.047 1.859
####################################################################################################
File name Lines Chars Avg C/L Empty Min Max MaxType
English.txt 51863 3142689 60.596 0 3 72 UCS2Str
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 3104427 10666 3 27593 0 0 0
Lines with > 0: 30235 5221 1 16406 0 0 0
File name Lines Chars Avg C/L Empty Min Max MaxType
English.txt 1000 41000 41.000 0 10 72 ASCIIStr
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 41000 0 0 0 0 0 0
Lines with > 0: 1000 0 0 0 0 0 0
Type B/Char sizeof length nextind reverse repeat 1 repeat 10 search search iteration sum isascii isvalid isascii isvalid islower isalnum uppercase textwidth
chars string string string char Char char vals string string chars chars chars chars string string
Results: 41000 41000 41000 1000 1000 1000 18000 0 0 4002500 1000 1000 41000 41000 34500 35500 1000 41000
String 1.021 0.014 0.870 4.927 21.726 0.488 2.662 42.561 89.201 1.649 1.491 1.211 0.550 1.755 1.596 2.164 2.752 19.309 6.084
UTF8Str 1.021 1.000 5.411 4.354 19.077 3.679 1.196 0.940 42.438 1.862 1.742 8.708 40.318 1.926 2.015 1.863 1.649 9.792 2.075
UTF16Str 2.000 1.067 1.949 1.696 17.457 3.615 0.678 0.965 58.621 1.395 1.166 7.794 38.265 1.381 1.408 1.119 1.060 9.008 1.767
UTF32Str 4.000 1.000 60.478 129.172 18.020 3.751 0.414 0.960 127.350 1.514 1.471 3.752 40.318 1.717 0.848 1.246 1.229 6.594 1.965
UniStr 1.309 0.131 1.998 10.189 22.270 1.936 1.198 0.997 92.387 2.070 1.941 1.728 1.701 0.967 0.869 1.213 0.987 12.863 8.481
UCS2Str 2.000 1.055 64.524 11.816 22.826 3.841 0.900 0.996 36.239 1.513 1.729 5.547 40.682 1.641 0.831 1.120 1.157 8.500 1.862
####################################################################################################
File name Lines Chars Avg C/L Empty Min Max MaxType
French.txt 9082 566851 62.415 0 1 72 UCS2Str
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 549022 16433 171 1225 0 0 0
Lines with > 0: 1532 6835 161 554 0 0 0
File name Lines Chars Avg C/L Empty Min Max MaxType
French.txt 1000 69000 69.000 0 67 71 UCS2Str
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 33500 1000 0 34500 0 0 0
Lines with > 0: 0 500 0 500 0 0 0
Type B/Char sizeof length nextind reverse repeat 1 repeat 10 search search iteration sum isascii isvalid isascii isvalid islower isalnum uppercase textwidth
chars string string string char Char char vals string string chars chars chars chars string string
Results: 139000 69000 69000 1000 1000 1000 0 0 0 329988000 0 1000 33500 69000 25000 27000 500 69000
String 1.034 0.009 2.087 5.407 32.392 0.466 4.238 84.206 92.434 3.367 5.541 0.182 1.084 6.363 6.316 8.147 8.114 26.003 9.472
UTF8Str 1.034 1.000 8.573 2.415 25.387 5.914 1.274 0.956 67.284 2.283 4.142 5.572 125.030 4.405 5.013 2.782 2.433 8.844 2.899
UTF16Str 2.000 1.002 5.293 1.786 28.044 5.814 1.219 1.831 85.890 2.755 4.529 4.949 88.798 5.215 5.964 2.746 2.484 10.798 2.334
UTF32Str 4.000 1.002 259.895 238.530 28.565 6.261 0.565 1.820 155.087 3.363 5.477 3.257 89.543 6.118 4.819 3.320 2.614 8.360 2.932
UniStr 1.079 0.150 7.292 13.194 40.296 2.952 1.536 2.520 9.451 3.775 7.002 2.018 4.890 4.096 4.778 2.785 2.378 9.483 5.035
UCS2Str 2.000 1.000 244.037 12.320 41.363 5.434 1.557 1.798 61.066 3.033 6.352 4.059 139.233 6.493 4.152 2.845 2.614 10.826 3.011
####################################################################################################
File name Lines Chars Avg C/L Empty Min Max MaxType
German.txt 9442 635698 67.327 0 3 75 LatinStr
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 623889 11809 0 0 0 0 0
Lines with > 0: 2585 6857 0 0 0 0 0
File name Lines Chars Avg C/L Empty Min Max MaxType
German.txt 1000 72000 72.000 0 71 73 ASCIIStr
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 72000 0 0 0 0 0 0
Lines with > 0: 1000 0 0 0 0 0 0
Type B/Char sizeof length nextind reverse repeat 1 repeat 10 search search iteration sum isascii isvalid isascii isvalid islower isalnum uppercase textwidth
chars string string string char Char char vals string string chars chars chars chars string string
Results: 72000 72000 72000 1000 1000 1000 0 0 0 6656500 1000 1000 72000 72000 54500 60000 1000 72000
String 1.019 0.008 0.869 4.842 19.477 0.327 1.659 47.803 87.691 1.651 1.361 1.210 0.581 1.642 1.507 2.176 2.732 19.514 5.677
UTF8Str 1.019 1.002 6.212 4.368 19.186 4.336 0.678 0.976 72.477 1.971 1.614 11.116 74.823 1.999 1.925 1.931 1.831 12.034 2.079
UTF16Str 2.000 1.002 2.021 1.518 17.476 4.260 0.675 1.017 80.179 1.438 1.119 8.892 71.012 1.433 1.362 1.153 1.118 10.444 1.411
UTF32Str 4.000 1.055 105.992 211.288 18.305 4.654 0.358 1.007 154.347 1.592 1.391 3.250 71.012 1.732 1.090 1.401 1.318 6.558 1.855
UniStr 1.000 0.131 3.323 10.085 23.934 2.279 0.888 0.982 154.585 2.322 1.889 1.988 3.159 1.225 1.038 1.649 1.210 17.375 8.361
LatinStr 1.000 1.099 111.670 11.733 28.190 4.698 0.934 0.947 1150.461 2.639 2.562 10.925 77.888 1.071 1.434 1.826 1.708 10.703 13.738
####################################################################################################
File name Lines Chars Avg C/L Empty Min Max MaxType
Hungarian.txt 8222 468065 56.928 0 1 72 UCS2Str
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 428018 34550 3358 2139 0 0 0
Lines with > 0: 347 4187 2312 1376 0 0 0
File name Lines Chars Avg C/L Empty Min Max MaxType
Hungarian.txt 1000 31000 31.000 0 15 47 LatinStr
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 29500 1500 0 0 0 0 0
Lines with > 0: 500 500 0 0 0 0 0
Type B/Char sizeof length nextind reverse repeat 1 repeat 10 search search iteration sum isascii isvalid isascii isvalid islower isalnum uppercase textwidth
chars string string string char Char char vals string string chars chars chars chars string string
Results: 32500 31000 31000 1000 1000 1000 0 0 1500 3228500 500 1000 29500 31000 24500 27000 1000 31000
String 1.090 0.019 0.958 5.549 23.584 0.582 2.581 59.053 90.768 2.030 2.485 0.702 0.529 2.184 2.193 3.189 3.870 19.490 6.415
UTF8Str 1.090 0.972 4.938 4.361 16.111 3.320 1.066 1.034 33.294 2.003 2.322 5.744 29.631 2.240 2.298 2.366 2.170 7.560 1.977
UTF16Str 2.000 1.063 2.006 1.889 16.348 3.050 0.626 0.995 48.944 1.469 1.696 5.528 27.773 1.502 1.675 1.378 1.291 7.564 1.561
UTF32Str 4.000 0.998 49.908 103.694 15.415 3.372 0.384 0.982 110.130 1.667 2.121 3.322 28.547 1.795 0.948 1.561 1.292 7.235 1.861
UniStr 1.476 0.143 1.573 14.067 18.586 1.615 1.187 2.362 11.937 2.337 3.171 2.735 1.235 0.935 1.099 1.284 1.224 6.543 10.231
UCS2Str 2.000 1.054 53.698 15.325 21.847 3.381 0.679 0.981 29.406 1.601 2.456 4.420 27.773 1.784 0.902 1.265 1.282 7.568 1.954
####################################################################################################
File name Lines Chars Avg C/L Empty Min Max MaxType
Japanese.txt 2037 292885 143.783 0 3 2191 UCS2Str
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 1590 0 0 291295 0 0 0
Lines with > 0: 12 0 0 2025 0 0 0
File name Lines Chars Avg C/L Empty Min Max MaxType
Japanese.txt 1000 40000 40.000 0 34 46 UCS2Str
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 0 0 0 40000 0 0 0
Lines with > 0: 0 0 0 1000 0 0 0
Type B/Char sizeof length nextind reverse repeat 1 repeat 10 search search iteration sum isascii isvalid isascii isvalid islower isalnum uppercase textwidth
chars string string string char Char char vals string string chars chars chars chars string string
Results: 120000 40000 40000 1000 1000 1000 0 0 0 634928500 0 1000 0 40000 0 31000 0 80000
String 2.989 0.015 3.294 6.599 43.048 0.720 4.845 126.977 97.324 5.278 9.800 0.217 1.936 11.053 10.843 13.282 12.849 32.277 12.876
UTF8Str 2.989 1.067 8.922 1.990 22.079 5.586 0.977 0.959 44.959 2.750 5.127 4.052 131.281 5.834 6.627 2.801 2.537 8.198 3.068
UTF16Str 2.000 1.000 6.885 2.008 31.999 5.213 1.326 2.432 58.792 3.973 7.336 3.825 131.504 7.934 8.229 3.358 3.208 11.077 3.722
UTF32Str 4.000 1.000 223.353 159.969 39.437 5.387 0.923 2.381 133.890 4.106 8.230 3.834 90.592 9.266 5.396 3.857 3.272 9.537 4.047
UniStr 1.998 0.171 7.073 13.157 25.993 2.781 1.266 1.686 4.170 4.194 8.958 3.401 5.864 4.840 4.550 2.906 2.550 10.560 3.728
UCS2Str 2.000 1.099 235.318 19.665 44.361 5.214 1.295 2.478 40.288 4.379 10.497 4.037 97.307 9.171 5.579 3.771 3.468 11.279 3.843
####################################################################################################
File name Lines Chars Avg C/L Empty Min Max MaxType
Portuguese.txt 3541 195387 55.178 0 1 72 LatinStr
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 191175 4212 0 0 0 0 0
Lines with > 0: 1321 2220 0 0 0 0 0
File name Lines Chars Avg C/L Empty Min Max MaxType
Portuguese.txt 1000 68500 68.500 0 68 69 LatinStr
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 66500 2000 0 0 0 0 0
Lines with > 0: 500 500 0 0 0 0 0
Type B/Char sizeof length nextind reverse repeat 1 repeat 10 search search iteration sum isascii isvalid isascii isvalid islower isalnum uppercase textwidth
chars string string string char Char char vals string string chars chars chars chars string string
Results: 70500 68500 68500 1000 1000 1000 0 0 2500 4784500 500 1000 66500 68500 29500 32500 1000 68500
String 1.022 0.009 0.950 5.122 19.959 0.301 1.725 47.062 88.069 1.903 2.015 1.014 0.542 2.000 2.008 2.705 3.336 15.997 5.800
UTF8Str 1.022 1.000 7.108 3.955 17.072 3.792 0.981 1.015 72.115 1.957 2.044 11.366 68.955 2.258 2.313 2.223 2.078 8.637 1.976
UTF16Str 2.000 1.000 2.214 1.603 17.503 3.738 0.693 1.016 76.540 1.550 1.646 8.824 62.878 1.634 1.793 1.287 1.419 8.483 1.509
UTF32Str 4.000 1.000 116.456 229.189 18.287 3.868 0.258 1.024 139.639 1.855 1.929 4.194 64.743 1.967 1.449 1.498 1.409 7.608 1.890
UniStr 1.000 0.131 3.204 11.151 24.895 1.899 0.979 1.678 27.710 2.727 3.059 2.894 2.425 1.450 1.559 1.673 1.457 7.733 9.765
LatinStr 1.000 1.065 110.337 12.829 30.625 3.998 0.832 0.958 1187.539 3.254 3.835 10.764 62.985 1.371 1.871 2.285 1.871 8.538 13.327
####################################################################################################
File name Lines Chars Avg C/L Empty Min Max MaxType
Russian.txt 5427 481891 88.795 0 4 605 UCS2Str
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 100030 469 379784 1608 0 0 0
Lines with > 0: 3 0 4079 1345 0 0 0
File name Lines Chars Avg C/L Empty Min Max MaxType
Russian.txt 1000 96500 96.500 0 34 159 UCS2Str
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 28500 1000 67000 0 0 0 0
Lines with > 0: 500 0 500 0 0 0 0
Type B/Char sizeof length nextind reverse repeat 1 repeat 10 search search iteration sum isascii isvalid isascii isvalid islower isalnum uppercase textwidth
chars string string string char Char char vals string string chars chars chars chars string string
Results: 164500 96500 96500 1000 1000 1000 0 0 3000 74525500 500 1000 28500 96500 75500 81500 1000 96500
String 1.796 0.006 2.281 5.920 28.741 0.360 2.346 72.785 95.291 3.353 7.130 0.314 0.938 7.673 7.192 9.736 9.812 23.779 10.339
UTF8Str 1.796 1.002 11.386 2.319 21.794 6.296 1.044 0.989 60.034 1.837 3.874 7.191 157.742 5.000 6.341 2.673 2.420 3.530 2.714
UTF16Str 2.000 0.864 5.148 1.970 31.163 6.643 0.896 1.569 100.768 2.901 6.172 6.617 107.280 6.658 6.942 2.603 2.240 4.004 3.066
UTF32Str 4.000 0.998 393.136 365.045 28.906 6.848 0.638 1.536 203.015 3.524 7.733 4.086 153.725 8.611 6.408 3.253 2.776 3.493 3.514
UniStr 2.000 0.146 10.814 17.314 41.686 3.327 0.876 1.308 16.926 3.554 8.346 2.187 6.584 3.918 4.763 2.881 2.553 4.100 3.774
UCS2Str 2.000 1.003 373.146 17.538 35.761 6.428 0.913 1.584 55.208 3.037 7.891 5.996 153.464 6.521 5.990 3.229 2.706 4.294 3.225
####################################################################################################
File name Lines Chars Avg C/L Empty Min Max MaxType
Spanish.txt 31622 2060688 65.166 0 3 78 LatinStr
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 2017122 43566 0 0 0 0 0
Lines with > 0: 8470 23152 0 0 0 0 0
File name Lines Chars Avg C/L Empty Min Max MaxType
Spanish.txt 1000 38500 38.500 0 3 74 LatinStr
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 35500 3000 0 0 0 0 0
Lines with > 0: 500 500 0 0 0 0 0
Type B/Char sizeof length nextind reverse repeat 1 repeat 10 search search iteration sum isascii isvalid isascii isvalid islower isalnum uppercase textwidth
chars string string string char Char char vals string string chars chars chars chars string string
Results: 41500 38500 38500 1000 1000 1000 0 0 0 3923000 500 1000 35500 38500 28500 30500 1000 38500
String 1.021 0.015 1.046 4.788 23.191 0.463 2.985 55.008 98.960 2.023 2.257 0.312 0.643 2.405 2.340 3.389 3.934 20.951 6.564
UTF8Str 1.021 1.000 6.118 4.156 17.550 3.281 1.293 0.996 45.004 1.944 2.131 2.767 41.939 2.354 2.188 2.557 2.067 9.643 2.035
UTF16Str 2.000 0.673 2.275 1.616 15.421 3.407 0.618 1.058 58.780 1.614 1.796 2.899 34.462 1.912 2.157 1.437 1.431 8.176 1.846
UTF32Str 4.000 1.063 68.382 74.745 17.680 3.333 0.512 1.035 140.139 1.973 2.122 2.665 44.745 2.382 1.300 1.702 1.525 6.394 1.947
UniStr 1.000 0.122 1.817 7.479 9.793 1.078 0.948 4.136 17.022 2.575 3.085 2.318 1.613 1.217 1.439 1.458 1.399 7.045 10.817
LatinStr 1.000 1.097 72.834 11.401 25.471 3.462 0.969 1.043 734.238 3.126 4.215 4.033 42.010 1.248 1.581 2.591 2.290 10.072 13.318
####################################################################################################
File name Lines Chars Avg C/L Empty Min Max MaxType
Swedish.txt 11160 591452 52.997 0 3 76 LatinStr
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 564808 26644 0 0 0 0 0
Lines with > 0: 1518 9642 0 0 0 0 0
File name Lines Chars Avg C/L Empty Min Max MaxType
Swedish.txt 1000 44500 44.500 0 20 69 LatinStr
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 43500 1000 0 0 0 0 0
Lines with > 0: 500 500 0 0 0 0 0
Type B/Char sizeof length nextind reverse repeat 1 repeat 10 search search iteration sum isascii isvalid isascii isvalid islower isalnum uppercase textwidth
chars string string string char Char char vals string string chars chars chars chars string string
Results: 45500 44500 44500 1000 1000 1000 0 0 0 4350500 500 1000 43500 44500 36000 37500 1000 44500
String 1.045 0.012 0.888 4.719 22.401 0.459 2.454 67.048 104.190 3.002 1.944 0.396 0.521 2.215 3.287 4.396 5.169 31.090 11.079
UTF8Str 1.045 0.838 4.791 3.827 15.763 3.286 0.894 1.047 49.655 2.893 1.652 3.961 26.587 2.190 3.251 2.947 2.780 15.125 3.259
UTF16Str 2.000 0.912 1.918 1.539 16.441 3.452 0.747 1.205 65.099 2.206 1.507 5.083 41.400 1.637 2.727 2.050 2.012 14.373 3.105
UTF32Str 4.000 0.522 47.699 83.763 11.923 3.816 0.227 0.939 140.108 2.530 1.730 2.194 26.197 1.932 1.562 2.149 2.002 8.468 2.421
UniStr 1.000 0.110 1.631 9.863 20.886 1.704 1.136 2.552 17.781 3.655 2.328 1.650 0.875 0.842 1.034 1.183 1.469 8.009 12.407
LatinStr 1.000 0.598 44.128 7.433 17.612 3.440 0.733 1.144 1025.536 5.019 3.344 4.813 41.474 1.153 2.225 3.176 2.286 10.585 23.384
####################################################################################################
File name Lines Chars Avg C/L Empty Min Max MaxType
Telugu.txt 7519 505822 67.273 0 3 123 UCS2Str
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 76016 0 0 429806 0 0 0
Lines with > 0: 40 0 0 7479 0 0 0
File name Lines Chars Avg C/L Empty Min Max MaxType
Telugu.txt 1000 63500 63.500 0 19 108 UCS2Str
Character Types: ASCII Latin1 2-Byte UCS2 UTF32 Surrogate Invalid
Total characters: 10000 0 0 53500 0 0 0
Lines with > 0: 0 0 0 1000 0 0 0
Type B/Char sizeof length nextind reverse repeat 1 repeat 10 search search iteration sum isascii isvalid isascii isvalid islower isalnum uppercase textwidth
chars string string string char Char char vals string string chars chars chars chars string string
Results: 170500 63500 63500 1000 1000 1000 0 0 0 167279000 0 1000 10000 63500 0 30000 0 70000
String 2.699 0.009 3.090 6.890 39.809 0.489 4.462 111.525 97.045 4.617 9.291 0.163 1.946 9.722 8.758 11.966 12.649 28.687 12.176
UTF8Str 2.699 1.002 10.281 2.442 24.581 5.709 0.984 0.950 72.409 2.731 4.858 4.830 223.461 5.604 5.080 2.826 2.551 7.338 2.829
UTF16Str 2.000 1.099 7.781 2.438 26.992 5.930 1.357 2.183 77.652 3.705 7.466 4.574 210.160 8.051 7.679 3.062 2.957 10.671 3.645
UTF32Str 4.000 1.099 333.110 264.855 38.934 5.797 1.025 2.269 168.766 5.134 10.042 3.473 230.119 10.472 6.466 3.835 3.487 9.620 3.747
UniStr 1.999 0.175 11.178 17.183 45.587 3.250 1.371 1.707 6.550 4.604 11.615 4.059 7.079 5.720 5.247 3.092 2.897 10.086 3.670
UCS2Str 2.000 1.002 350.987 19.606 47.146 5.498 1.225 2.259 58.463 4.667 12.174 4.920 223.461 9.678 5.361 3.543 3.679 9.936 3.955
####################################################################################################
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment