I have been working on this diagram to explain the encoding formats. I haven't released it yet, but I figured this video might help some people out when it comes to understanding this stuff. UTF-8 encoder:
decoding UTF-8:
Unicode Deep Dive with UTF-16:
https://www.youtube.com/watch?v=ZQRAMHiVEzs PDF:
0:00 - What Is Unicode 0:52 - Codepoints 2:30 - The Binary Blob 5:00 - UTF-32 6:24 - UTF-16 And UCS-2 13:51 - UTF-8 21:52 - Size Comparisons