Gmail marked this Google antitrust notice as spam. Only caught it because I was looking for another message.

thinkercharmercoderfarmer@slrpnk.net · 15 days ago

This is in some ways an easier problem than classifying LLM vs non-LLM authorship. That only has two possible outcomes, and it’s pretty noisy because LLMs are trained to emulate the average human. Here, you can generate an agreement score based on language features per comment, and cluster the comments by how they disagree with the model. Comments that disagree in particular ways (never uses semicolons, claims to live in Canada, calls interlocutors “buddy”, writes run-on sentences, etc.) would be clustered together more tightly. The more comments two profiles have in the same cluster(s), the more confident the match becomes. I’m not saying this attack is novel or couldn’t be accomplished without an LLM, but it seems like a good fit for what LLMs actually do.

thinkercharmercoderfarmer@slrpnk.net · 16 days ago

Why not? if LLMs are good at predicting mean outcomes for the next symbol in a string, and humans have idiosyncrasies that deviate from that mean in a predictable way, I don’t see why you couldn’t detect and correlate certain language features that map to a specific user. You could use things like word choice, punctuation, slang, common misspellings, sentence structure… For example, I started with a contradicting question, I used “idiosyncrasies”, I wrote “LLMs” without an apostrophe, “language features” is a term of art, as is “map” as a verb, etc. None of these are indicative on their own, but unless people are taking exceptional care to either hyper-normalize their style, or explicitly spiking their language with confounding elements, I don’t see why an LLM wouldn’t be useful for this kind of espionage.

I wonder if this will have a homogenizing effect on the anonymous web. It might become an accepted practice to communicate in a highly formalized style to make this kind of style fingerprinting harder.

thinkercharmercoderfarmer@slrpnk.net · 1 month ago

It depends a lot on what you want to do and a little on what you’re used to. It’s some configuration overhead so it may not be worth the extra hassle if you’re only running a few services (and they don’t have dependency conflicts). IME once you pass a certain complexity level it becomes easier to run new services in containers, but if you’re not sure how they’d benefit your setup, you’re probably fine to not worry about it until it becomes a clear need.

thinkercharmercoderfarmer@slrpnk.net · 2 months ago

Gmail marked this Google antitrust notice as spam. Only caught it because I was looking for another message.

thinkercharmercoderfarmer@slrpnk.net · 2 months ago

thanks!

thinkercharmercoderfarmer@slrpnk.net · 2 months ago

I find they’re faster too. Another benefit of them being distributed is the distributor doesn’t have to pay for all the bandwidth for every download, which I understand can be a considerable savings for smaller distros.

thinkercharmercoderfarmer@slrpnk.net · 2 months ago

what client is that?

thinkercharmercoderfarmer@slrpnk.net · 2 months ago

Haven’t looked at my ratios in a while

thinkercharmercoderfarmer@slrpnk.net · 5 months ago

The Post Office is secretly being controlled by the US Government. If you look at the actual laws of the US it allows the President to appoint someone called the Postmaster General who’s in charge of the whole thing.

I think it would be a pretty good prank to bring this up in a “favorite crazy conspiracy theory” conversation where all but one participant agrees that it’s a baseless conspiracy theory and see if the one other person insists that the Postmaster General Theory is real, or goes along with the crowd. But I really don’t think my friends are coordinated enough to pull it off.