{"id":56081,"date":"2022-09-09T10:00:22","date_gmt":"2022-09-09T15:00:22","guid":{"rendered":"https:\/\/languagelog.ldc.upenn.edu\/nll\/?p=56081"},"modified":"2022-09-09T17:11:55","modified_gmt":"2022-09-09T22:11:55","slug":"the-mystery-of-song-u2aa0a-semen","status":"publish","type":"post","link":"https:\/\/languagelog.ldc.upenn.edu\/nll\/?p=56081","title":{"rendered":"The mystery of s\u00f3ng  (U+2AA0A) (\"semen\")"},"content":{"rendered":"<p>Matt Jenkins writes:<\/p>\r\n<div style=\"padding-left: 40px;\"><span style=\"font-family: arial, sans-serif;\">I am hoping you'll indulge a question that's been bugging me. I\u00a0have been trying to improve my fluency by watching as many Chinese online dramas as possible,\u00a0and s<span lang=\"ZH-CN\">\u00f3<\/span>n<span lang=\"ZH-CN\">g<\/span><span lang=\"ZH-CN\">\u00a0(<\/span>U+2AA0A) comes up in show after show. But the character is always quite obviously \"cut-and-pasted\" into the subtitles. I'm (generally) familiar with the character as a simplified form of\u00a0\u379e, and that people usually write\u00a0\u6002\u00a0instead. But <u>why<\/u> is the character practically completely absent from character sets and dictionaries? It's no more offensive than its progenitor\u00a0\u379e, but\u00a0\u379e is far easier to find in character sets. <br \/><\/span><\/div>\r\n<p style=\"padding-left: 40px;\"><span style=\"font-family: arial, sans-serif;\">Jichang Lulu wrote about\u00a0\u379e on the Language Log back in March [see \"Selected readings\" below], but that post didn't include any reference to\u00a0\u00a0\u00a0<span lang=\"ZH-CN\">\u00a0(<\/span>U+2AA0A). <\/span><\/p>\r\n<p><!--more--><\/p>\r\n<p>If you want to see what this mysterious character looks like, you can find it <a href=\"https:\/\/glyphwiki.org\/wiki\/u2aa0a\">here<\/a> &#8212; with some really esoteric variants (N.B.).<\/p>\r\n<p>Matt asks a good question about <span style=\"font-family: arial, sans-serif;\"><u>\"why<\/u> is the character practically completely absent from character sets and dictionaries?\"\u00a0 That's hard to say, especially since it's a heck of a lot simpler to write than <\/span><span style=\"font-family: arial, sans-serif;\">\u379e, which it is equal to.\u00a0 One surmise I have is that the mystery character is visually somehow reminiscent of that very nasty character \"c<\/span>\u00e0o \u808f\" (\"f*ck\") (graphically \"enter\" + \"flesh \/ meat\") as in \"c\u00e0o n\u01d0 m\u0101 \u808f\u4f60\u5abd\" (\"f*ck your mom\"), which is beyond the pale, even for all but the most hardened garbage mouth.\u00a0 Of course, one can always, and usually does, avoid <span style=\"font-family: arial, sans-serif;\">\"c<\/span>\u00e0o \u808f\" by punningly writing it as c\u0101o \u64cd (\"exercise\").<\/p>\r\n<p>Another possibility is that <span style=\"font-family: arial, sans-serif;\">\u00a0<span lang=\"ZH-CN\">\u00a0(<\/span>U+2AA0A) echoes the shape of the notorious <\/span><span style=\"font-family: arial, sans-serif;\">b\u012b <\/span>\u5c44 (vulgar \"vulva or \/ and vagina\"), which is similarly nearly always replaced by the homophones \u903c or B.<\/p>\r\n<p>For whatever reason people are avoiding writing the \"real\" glyph for \u00a0 (U+2AA0A), the enigma of its being missing from character sets, dictionaries, and fonts points to an inalterable verity about the Sinographic writing system:\u00a0 it is essentially open-ended and the number of its potential discrete constituents is infinite.\u00a0 The artist <a href=\"https:\/\/en.wikipedia.org\/wiki\/Xu_Bing\">X\u00fa B\u012bng<\/a> <span lang=\"zh\">\u5f90\u51b0 <\/span>(b. 1955) proved that with his monumental, epochal installation (1988), \"Ti\u0101n sh\u016b \u5929\u66f8 \/ <span class=\"VIiyi\" lang=\"zh-CN\"><span class=\"JLqJ4b\" data-language-for-alternatives=\"zh-CN\" data-language-to-translate-into=\"zh-TW\" data-phrase-index=\"0\" data-number-of-phrases=\"1\"><span class=\"Q4iAWc\">\u5929\u4e66<\/span><\/span><\/span>\" (\"<a href=\"https:\/\/en.wikipedia.org\/wiki\/A_Book_from_the_Sky\">Book from the Sky<\/a>\") and other art works (see also \"Selected readings\").\u00a0 Anyone, anywhere, anytime can create his or her very own hanzi \/ kanji \/ hanja \u6f22\u5b57 \/\u00a0 <span class=\"VIiyi\" lang=\"zh-CN\"><span class=\"JLqJ4b\" data-language-for-alternatives=\"zh-CN\" data-language-to-translate-into=\"zh-TW\" data-phrase-index=\"0\" data-number-of-phrases=\"1\"><span class=\"Q4iAWc\">\u6c49\u5b57 (\"Sinoglyph\") &#8212; and many do.\u00a0 One can even advocate combining them with emoji.<br \/><\/span><\/span><\/span><\/p>\r\n<p>&nbsp;<\/p>\r\n<p><b>Selected readings<\/b><\/p>\r\n<ul>\r\n<li>\"<a href=\"https:\/\/languagelog.ldc.upenn.edu\/nll\/?p=53919\">Mandarin and Manchu semen<\/a>\" (3\/11\/22)<\/li>\r\n<li>\"<a href=\"https:\/\/languagelog.ldc.upenn.edu\/nll\/?p=50104\">Hokkien in Sino-Japanese script<\/a>\" (2\/2\/21)<\/li>\r\n<li>\"<a href=\"http:\/\/itre.cis.upenn.edu\/~myl\/languagelog\/archives\/001689.html\">Semen, green rice and the rate of internet decay<\/a>\" (12\/5\/04)<\/li>\r\n<li>\"<a href=\"https:\/\/languagelog.ldc.upenn.edu\/nll\/?p=22414\">Big WHAT hall<\/a>\" (12\/1\/15)<\/li>\r\n<li>\"<a title=\"Permanent link to Grass-Mud Horse Lexicon Classics\" href=\"http:\/\/languagelog.ldc.upenn.edu\/nll\/?p=6538\" target=\"_blank\" rel=\"bookmark noopener\">Grass-Mud Horse Lexicon Classics<\/a> \" (8\/30\/13)<\/li>\r\n<li>\"<a title=\"Permanent link to Franco-Croatian Squid in pepper sauce\" href=\"http:\/\/languagelog.ldc.upenn.edu\/nll\/?p=1225\" target=\"_blank\" rel=\"bookmark noopener\">Franco-Croatian Squid in pepper sauce<\/a> \" (3\/12\/09)<\/li>\r\n<li>\"<a title=\"Permanent link to Punning banned in China\" href=\"http:\/\/languagelog.ldc.upenn.edu\/nll\/?p=16011\" target=\"_blank\" rel=\"bookmark noopener\">Punning banned in China<\/a> \" (11\/29\/14)<\/li>\r\n<li>\"<a title=\"Permanent link to Banned in Beijing\" href=\"http:\/\/languagelog.ldc.upenn.edu\/nll\/?p=12717\" target=\"_blank\" rel=\"bookmark noopener\">Banned in Beijing<\/a> \" (6\/4\/14)<\/li>\r\n<li>\"<a title=\"Permanent link to When intonation overrides tone\" href=\"http:\/\/languagelog.ldc.upenn.edu\/nll\/?p=4673\" target=\"_blank\" rel=\"bookmark noopener\">When intonation overrides tone<\/a> \" (6\/4\/13)<\/li>\r\n<\/ul>\r\n<p>&#8212;&#8212;-<\/p>\r\n<ul>\r\n<li>\"<a title=\"Permanent link to The infinitude of Chinese characters\" href=\"https:\/\/languagelog.ldc.upenn.edu\/nll\/?p=48404\" rel=\"bookmark\">The infinitude of Chinese characters<\/a>\" (9\/9\/20)<\/li>\r\n<li>\"<a title=\"Permanent link to Hanmoji\" href=\"https:\/\/languagelog.ldc.upenn.edu\/nll\/?p=55983\" rel=\"bookmark\">Hanmoji<\/a>\" (9\/4\/22)<\/li>\r\n<li>\"<a title=\"Permanent link to How many more Chinese characters are needed?\" href=\"https:\/\/languagelog.ldc.upenn.edu\/nll\/?p=29034\" rel=\"bookmark\">How many more Chinese characters are needed?<\/a>\" (10\/25\/16)<\/li>\r\n<li>\"<a title=\"Permanent link to Sinographs by the numbers\" href=\"https:\/\/languagelog.ldc.upenn.edu\/nll\/?p=41496\" rel=\"bookmark\">Sinographs by the numbers<\/a>\" (1\/22\/19)<\/li>\r\n<li>\"<a title=\"Permanent link to The unpredictability of Chinese character formation and pronunciation\" href=\"https:\/\/languagelog.ldc.upenn.edu\/nll\/?p=3750\" rel=\"bookmark\">The unpredictability of Chinese character formation and pronunciation<\/a>\" (2\/6\/12) &#8212; includes a long section on the artist <a href=\"http:\/\/en.wikipedia.org\/wiki\/Xu_Bing\">Xu Bing<\/a>'s \"<a href=\"http:\/\/www.hanshan.com\/specials\/xubingts.html\">Book from the Sky<\/a>\"<\/li>\r\n<li>\"<a title=\"Permanent link to How to generate fake Chinese characters automatically\" href=\"https:\/\/languagelog.ldc.upenn.edu\/nll\/?p=23143\" rel=\"bookmark\">How to generate fake Chinese characters automatically<\/a>\" (12\/30\/15)<\/li>\r\n<li>\"<a title=\"Permanent link to Chinese characters formed from letters of the alphabet\" href=\"http:\/\/languagelog.ldc.upenn.edu\/nll\/?p=14150\" rel=\"bookmark\">Chinese characters formed from letters of the alphabet<\/a>\" (8\/20\/14)<\/li>\r\n<li>\"<a title=\"Permanent link to Is there a practical limit to how much can fit in Unicode?\" href=\"http:\/\/languagelog.ldc.upenn.edu\/nll\/?p=35136\" rel=\"bookmark\">Is there a practical limit to how much can fit in Unicode?<\/a>\" (10\/27\/17)<\/li>\r\n<li>\"<a title=\"Permanent link to Character crises\" href=\"http:\/\/languagelog.ldc.upenn.edu\/nll\/?p=38784\" rel=\"bookmark\">Character crises<\/a>\" (6\/15\/18)<\/li>\r\n<\/ul>\r\n<p>&nbsp;<\/p>\r\n","protected":false},"excerpt":{"rendered":"<p>Matt Jenkins writes: I am hoping you'll indulge a question that's been bugging me. I\u00a0have been trying to improve my fluency by watching as many Chinese online dramas as possible,\u00a0and s\u00f3ng\u00a0(U+2AA0A) comes up in show after show. But the character is always quite obviously \"cut-and-pasted\" into the subtitles. I'm (generally) familiar with the character as [&hellip;]<\/p>\n","protected":false},"author":13,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_exactmetrics_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[316,19,250,57,18],"tags":[],"class_list":["post-56081","post","type-post","status-publish","format-standard","hentry","category-colloquial","category-semantics","category-slang","category-variation","category-writing-systems"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/posts\/56081","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/users\/13"}],"replies":[{"embeddable":true,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=56081"}],"version-history":[{"count":5,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/posts\/56081\/revisions"}],"predecessor-version":[{"id":56128,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=\/wp\/v2\/posts\/56081\/revisions\/56128"}],"wp:attachment":[{"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=56081"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=56081"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/languagelog.ldc.upenn.edu\/nll\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=56081"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}