Reports folder: Difference between revisions

70 bytes added ,  06:23, 1 July 2021
m
mNo edit summary
Line 163: Line 163:


== Encoding ==
== Encoding ==
Remember, most modern web pages (including the standard web templates provided with latest releases of both flavours of Cumulus) use UTF-8 encoding.  The only problem is MX was not defaulting to this for NOAA reports in earlier releases (default changed in 3.10.0).


=== Why does encoding matter?===
=== Why does encoding matter?===
Line 211: Line 213:
This means that when you read a file you probably find the letters A to Z where you expect them, but whether you see correct case cannot be guaranteed. Some encodings put capital letters at lower binary values than lower case letters, and some put capitals at higher binary values.  
This means that when you read a file you probably find the letters A to Z where you expect them, but whether you see correct case cannot be guaranteed. Some encodings put capital letters at lower binary values than lower case letters, and some put capitals at higher binary values.  


If you use 7 bits, you have 127 combinations, enough for standard 26 letters in both capitals, and lower case, plus 10 digits (0 to 9), some punctuation, and some control characters (like new line, end of file, and so on). If you use 8 bits, a whole byte, you have 254 combinations, and you can start coping with accented letters, with alphabets that don't have 26 letters, and even add some symbols. Obviously, once you start using more than one byte, you can have 16, 32, 64, or even more bits to use and can include lots more characters and the bigger character sets start including lots of symbols and the biggest add smilies or emotion icons.
If you use 7 bits, you have 127 combinations, enough for standard 26 letters in both capitals, and lower case, plus 10 digits (0 to 9), some punctuation, and some control characters (like new line, end of file, and so on). If you use 8 bits, a whole byte, you have 254 combinations, and you can start coping with accented letters, with alphabets that don't have 26 letters, and even add some symbols. Obviously, once you start using more than one byte, you can have 16, 32, 64, or even more bits to use and can include lots more characters and the bigger character sets start including lots of symbols and the biggest add smilies or emotion icons.
Remember, most modern web pages (including the standard web templates provided with both flavours of Cumulus) use UTF-8 encoding.  The only problem is MX not defaulting to this for NOAA reports.
5,838

edits