snsabot: Logged on 2019-08-15 21:16:03 asciilifeform: ~pasting~ e.g. '2000000 ÷ (154 × 1088 × 1.25) = 9.549274255x' dun trigger the barfolade, btw. cuz on civilized box,
lobbes's txt is ALREADY utf8 when yer viewing it. only triggering lobbes's echo , where it pisses the malcoded garbage straight into irc, triggers.
snsabot: Logged on 2018-02-02 11:23:06 asciilifeform: the 'postel's law' nonsense, of silently forgiving people who send liquishit at the dusty disused corners of the protocol, enabling there to even ~be~ such a thing as dusty corners in a protocol!, MUST die.
snsabot: Logged on 2019-08-15 21:27:38 asciilifeform: unless someone supplies, in next coupla days, a simple AND WORKING pill, i'ma do this : anything coming into the bot that dun decode cleanly as utf8, will have the offending chars (i.e. any with 8th bit set) replaced with that glyph that depicts a steaming pile of shit.
lobbes: In either case, I'll be spending some time this evening working on genesis-ing my own current #e logotron. If anything just for posterity
snsabot: Logged on 2019-08-15 11:14:45 asciilifeform: loox like a schoolboy site of the '90s style ( 'learn haxxing! phreaking!' ) and dusty.
mp_en_viaje: but whatever, give kids more "self-esteem"
mp_en_viaje: you know i was at club last night, with the only two cunts anyone'd fuckl in the joint, and this moronic twentysomething (groomed beard, of course) sat next to me while one was off getting more drinks to explain to me what his needs were ?
mp_en_viaje: COMPLETELY the fuck lost in the galaxies & brownie ways witin their own colon
snsabot: Logged on 2019-08-15 14:05:50 asciilifeform: lobbes, mp_en_viaje , et al : strangely, having problem reproducing the unibarf discovered by lobbes . pasting the text from the barf samples into test chan, doesn't produce the expected barf, it gets eaten normally
snsabot: Logged on 2019-08-14 02:06:25 mp_en_viaje: All the CP1252 characters are also available in Unicode. For example the CP1252 character 146 that you mentioned (RIGHT SINGLE QUOTATION MARK) has the Unicode number 8217, therefore you should use this number in order to conform to the HTML standard. Modern HTML browsers like Netscape 4.0 understand Unicode, and will automatically convert the Unicode character ’ back into the character 146 on MS-Windows mach
mp_en_viaje: dangerz]" ? da fuck happened, treason never prospers because if it prosper we'll just call it coincidence ?
mp_en_viaje: pretty fucking epic qntra thursday, wtf! nice BingoBoingo !
snsabot: Logged on 2019-08-15 15:00:45 BingoBoingo: It amazes me how many "Anti-socialist" alt-alt rags are pointedly NOT highlighting Black American Patriot Maurice Hill as a warrior against the state
snsabot: Logged on 2019-08-15 20:31:27 asciilifeform: meanwhile, update re unitardation --
phf's pill doesn't cure -- decoder successfully falls back to 'latin-1', if set up as described, and dun barf ; but ~encoder~ barfs in pg ( cur.execute(query, args) )
snsabot: Logged on 2019-08-15 21:19:29 asciilifeform: why the fuck is this even a good idea ? to support
arbitrary liquishit in logs ? imho the Right Thing would be 'if you want yer logs imported or echoed, put'em into utf8 or stuff up arse' .
mp_en_viaje: anyway, supposedly i should be able to reproduce this, lessee. ×÷√
mp_en_viaje: and of course everyone's favourite from the old days, ¤
mp_en_viaje: the breakage more extensive than asciilifeform diagnosed.
mp_en_viaje: i originally thought one actually needs
http://www.cp1252.com/ to illustrate the breakage (whence i copied the original lines missing from log above), however this is not actually so.
mp_en_viaje: U+00D8 Ø 0216 Ø Latin Capital letter O with stroke
mp_en_viaje: U+0126 Ħ 294 Ħ Latin Capital Letter H with stroke
mp_en_viaje: U+0193 Ɠ 403 Latin Capital Letter G with hook
mp_en_viaje: 474 Latin Small Letter U with diaeresis and caron
mp_en_viaje: U+01DA ǚ 474 Latin Small Letter U with diaeresis and caron
mp_en_viaje: U+0211 ȑ 529 Latin Small Letter R with double grave
mp_en_viaje: U+0236 ȶ 566 Latin Small Letter T with curl
mp_en_viaje: U+0469 ѩ Cyrillic Small Letter Iotified Little Yus
mp_en_viaje: U+049C Ҝ Cyrillic Capital Letter Ka with vertical stroke
mp_en_viaje: ℕ ℤ ℚ ℝ hey wtf stylization is this, R for reals is a double bar-r not a double-r-superimposed! and rationals is Q with a single bar left, not two bars left and right. WTF!
mp_en_viaje: Ⓐ Ⓑ Ⓒ Ⓓ Ⓔ Ⓕ Ⓖ Ⓗ Ⓘ Ⓙ Ⓚ Ⓛ Ⓜ Ⓝ Ⓞ Ⓟ Ⓠ Ⓡ Ⓢ Ⓣ Ⓤ Ⓥ Ⓦ Ⓧ Ⓨ Ⓩ
mp_en_viaje: 🜷 alkali ; 🝤 putrefaction ; 🜀 aether ; 🜰 regular antimony and so following
mp_en_viaje: this then completes our foray into the unpleasant.
☟︎ a111: Logged on 2019-08-16 10:52 mp_en_viaje: let's do some selected testing then.
a111: Logged on 2019-08-16 11:13 mp_en_viaje: this then completes our foray into the unpleasant.
snsabot: Logged on 2019-08-16 07:33:56 mp_en_viaje: let's do some selected testing then.
snsabot: Logged on 2019-08-16 07:55:31 mp_en_viaje: this then completes our foray into the unpleasant.
mp_en_viaje: the only possible conclusion is that unicode support is actually broken on either the flask, the python outright, or some other library asciilifeform is using there. it's true that it was coincidentally discovered by cp1252 translation failure, but it is not limited to that. things missing from nsa log like the micron above are plain unicode and correctly quoted as such by my terminal.
mp_en_viaje: note that both loggers miss some glyphs, rendering them as eg 01f737 in a box. this is different behaviour from simply dropping the line.
mp_en_viaje: now that we're talking about this, i recall a similar (if more restrained) experiment with phf, maybe cca late 2017, and he went away and came back with a properly wroking logger a week or so later -- though i dunno that he ever wrote a blog article
detailing the problem and its solution
mp_en_viaje: trilema/2019-08-10#1927160][feelings] indicate was the rapist" or any such nonsense, neh ?
snsabot: Logged on 2019-08-15 21:12:20 asciilifeform: burned entire evening on this, and is no closer to a ~lead~ to a solution (not even speaking of actual solution) than before.
mp_en_viaje:
http://logs.nosuchlabs.com/log/trilema/2019-08-15#1928982 << there's nothing in principle wrong with this solution. there may be some practical concerns (ie, might want to get a working unicode set first) ; but other than that i don't see the advantage to having numbers-in-a-rectangle, for instance, in preference of a fixed turd.
snsabot: Logged on 2019-08-15 21:27:38 asciilifeform: unless someone supplies, in next coupla days, a simple AND WORKING pill, i'ma do this : anything coming into the bot that dun decode cleanly as utf8, will have the offending chars (i.e. any with 8th bit set) replaced with that glyph that depicts a steaming pile of shit.
snsabot: Logged on 2019-08-15 22:21:36 asciilifeform: btw just as i thought, lobbes's searchtron
fails if fed the liquishit in question.
a111: Logged on 2019-08-09 08:39 mircea_popescu: what can you do, alf just not as good at computing as phf :D
mp_en_viaje shall now be off attending to household dramaz, will bbl tho.
snsabot: asciilifeform: time since my last reconnect : 6d 11h 48m
snsabot: Logged on 2019-08-16 08:00:13 mp_en_viaje: the only possible conclusion is that unicode support is actually broken on either the flask, the python outright, or some other library asciilifeform is using there. it's true that it was coincidentally discovered by cp1252 translation failure, but it is not limited to that. things missing from nsa log like the micron above are plain unicode and correctly quoted as such by my terminal.
a111: Logged on 2016-09-21 14:44 phf: but not necessarily your special encoding. so client will guess the encoding of the line you're sending, and if it can be encoded as latin-1 it'll send it as such, otherwise utf-8
a111: Logged on 2016-09-21 14:45 phf: mircea_popescu's xchat specifically still does that. so his latin-1 encodable text comes out as such, otherwise it gets promoted to utf-8
snsabot: Logged on 2019-08-16 09:49:00 asciilifeform: mp_en_viaje when you wake up, plz describe how you actually got the barfola into yer irc client, so can reproduce.
snsabot: Logged on 2019-08-15 20:31:27 asciilifeform: meanwhile, update re unitardation --
phf's pill doesn't cure -- decoder successfully falls back to 'latin-1', if set up as described, and dun barf ; but ~encoder~ barfs in pg ( cur.execute(query, args) )
phf: asciilifeform: that is something else wrong with your code, i've tested with some local python and it works as expected
phf: well, correct hence "something else wrong", i've only verified what i already knew, that latin-1 decoded to "unicode" is otherwise indistinguishable from any other unicode, and can then be encoded to utf-8
phf: what does SHOW SERVER_ENCODING return for your pg?
phf: asciilifeform: you have a bug in your code, data.strip(b'\r\n').decode('latin-1') is not assigned to data, so ultimately payload is still a latin-1 bytestring
phf: fyi erc supports custom channel encodings
snsabot: Logged on 2019-08-15 21:23:52 asciilifeform: 'I mean literally, the guy's from Washitistan, they write things with their own excrement there, and the Unicode Foundation introduced actual excrement in the standard so now whenever someone asks for the networking code in your project they are delivered physical faeces on cardboard. About fifty eight acres of it. Where would you like this put, sir ?' (tm)(r)(mp)
mp_en_viaje: U+00D8 Ø 0216 Ø Latin Capital letter O with stroke
mp_en_viaje: U+0126 Ħ 294 Ħ Latin Capital Letter H with stroke
mp_en_viaje: U+0193 Ɠ 403 Latin Capital Letter G with hook
mp_en_viaje: 474 Latin Small Letter U with diaeresis and caron
mp_en_viaje: U+01DA ǚ 474 Latin Small Letter U with diaeresis and caron
mp_en_viaje: U+0211 ȑ 529 Latin Small Letter R with double grave
mp_en_viaje: U+0236 ȶ 566 Latin Small Letter T with curl
mp_en_viaje: U+0469 ѩ Cyrillic Small Letter Iotified Little Yus
mp_en_viaje: U+049C Ҝ Cyrillic Capital Letter Ka with vertical stroke
mp_en_viaje: ℕ ℤ ℚ ℝ hey wtf stylization is this, R for reals is a double bar-r not a double-r-superimposed! and rationals is Q with a single bar left, not two bars left and right. WTF!
mp_en_viaje: Ⓐ Ⓑ Ⓒ Ⓓ Ⓔ Ⓕ Ⓖ Ⓗ Ⓘ Ⓙ Ⓚ Ⓛ Ⓜ Ⓝ Ⓞ Ⓟ Ⓠ Ⓡ Ⓢ Ⓣ Ⓤ Ⓥ Ⓦ Ⓧ Ⓨ Ⓩ
mp_en_viaje: 🜷 alkali ; 🝤 putrefaction ; 🜀 aether ; 🜰 regular antimony and so following
lobbesbot: Logged on 2015-08-26 20:04:33: <mircea_popescu> "Or in my case, re-write a cooking book. So ladies and gentlorans, I give you Foxys Euloran Cookbook V1.1,"
snsabot: Logged on 2019-08-16 11:30:55 lobbesbot: Logged on 2015-08-26 20:04:33: <mircea_popescu> "Or in my case, re-write a cooking book. So ladies and gentlorans, I give you Foxys Euloran Cookbook V1.1,"
snsabot: Logged on 2019-08-16 10:42:11 mp_en_viaje: ℕ ℤ ℚ ℝ hey wtf stylization is this, R for reals is a double bar-r not a double-r-superimposed! and rationals is Q with a single bar left, not two bars left and right. WTF!
snsabot: Logged on 2019-08-16 10:41:53 mp_en_viaje: U+0469 ѩ Cyrillic Small Letter Iotified Little Yus
snsabot: Logged on 2019-08-14 13:52:44 mp_en_viaje: i expect large portions of lobbesbot actually salvageable ; spyked was making a lisp one too iirc.
snsabot: Logged on 2019-08-16 07:18:07 mp_en_viaje: pretty fucking epic qntra thursday, wtf! nice BingoBoingo !
BingoBoingo: Apparently Minnesota doesn't have enough room to resettle all of the warm climate "refugees" into proper cold climates?
BingoBoingo: Could also migrate from Alaska to Canada. More easily than Greenland to anywhere else. Gotta keep the inmates in.
auctionbot: Buy order # 1052 has ENDED: No sale. Attn: BingoBoingo
auctionbot: Buy order # 1053 has ENDED: No sale. Attn: BingoBoingo
auctionbot: Buy order # 1054 created by BingoBoingo: 2835 WFF, Wire Only Opening: 283mn ecu Ending: 2019-08-17 14:05:28.848350 UTC (25 hours)
auctionbot: Buy order # 1055 created by BingoBoingo: 500 WFF, WU esta bien Opening: 51mn ecu Ending: 2019-08-17 14:05:29.126412 UTC (25 hours)