Android 16 (SDK 36) is now out and it has added support for the following emoji:
(This is the only change in code point coverage versus SDK 35 that I could detect.)
I've updated Is It Tofu? with the new data:
https://tofu.quest/?q=%F0%9F%AA%89%F0%9F%AA%8F%F0%9F%AA%BE%F0%9F%AB%86%F0%9F%AB%9C%F0%9F%AB%9F%F0%9F%AB%A9
@doctorwhom Ha! At The Mag, remember Ops got this filtering software and we decided to see if we could break it? Tox added an "@" before the <html> tag.
Browser totally rendered the page but filter wouldn't parse it because of the At
I think it took us three hours?
In there a #Unicode way to represent 100 in a single fullwidth character?
The macOS Character Viewer previews all Unicode space characters as a blank space, so finding any particular space can be a bit of a chore. Here's me trying to find PUNCTUATION SPACE.
The iOS 18.5 SDK finally came out and the only change for Unicode coverage is the *removal* of a bunch of Sinhala codepoints:
ඁ෦෧෨෩෪෫෬෭෮෯𑇡𑇢𑇣𑇤𑇥𑇦𑇧𑇨𑇩𑇪𑇫𑇬𑇭𑇮𑇯𑇰𑇱𑇲𑇳𑇴
(Those of you on iOS 18.4: Enjoy seeing those glyphs while you can!)
I was doing some particular work on spaces for my font-in-progress, and as a result i extended the research on the 5 year old draft of my Unicode Spaces article and published it: https://home.octetfont.com/blog/unicode-space.html
#Unicode
(i now realise i should've made a click-baity title: Unicode’s top-ten space codepoints, you won’t believe number 7!)
#ASCII was a mistake: It could not even encode all the languages spoken in the USA. This mistake lasted for 19, maybe for years; depending on weither you consider ISO/IEC 8859 or #Unicode a solution.
Anyway: Unicode exists for 34 years now.
Demanding compatibility with a short lived mistake, while proper solutions exists for min. 34 years, has nothing to do with technical debt or compatibility. Demanding it is agenda. It is #Xenophobia that propagates superiority of White English speaking men.
How xenophobic does one have to be to introduce a new parameter `ensure_ascii` to Python's `json.dump()` function and to enable this xenophobic misfeature by default?
I just found out that #Unicode has segment-display digit characters. The below screenshot is all in one font (#JuliaMono). The characters are U+1FBF0 to U+1FBF9. Unicode is gorgeous
Curl-Entwickler warnt: Unicode-Trick gefährdet Softwareprojekte auf Github - Golem.de
https://www.golem.de/news/curl-entwickler-warnt-unicode-trick-gefaehrdet-softwareprojekte-auf-github-2505-196314.html #Unicode #Github
Detecting malicious #Unicode
https://daniel.haxx.se/blog/2025/05/16/detecting-malicious-unicode/