• Lvxferre@mander.xyz
    link
    fedilink
    arrow-up
    32
    ·
    3 months ago

    It’s actually worse.

    The video focuses on how you’re leaking personal info all the time through the software that you use and the connections that you make, and ways to mitigate it.

    However, have you guys heard about forensic linguistics? That’s how the Unabomber was caught. The way that you use your language(s) is pretty unique to yourself, and can be used to uncover your identity. This was done manually by two guys, Fitzgerald and Shuy; they were basically identifying patterns in how Unabomber wrote to narrow down the suspects further and further, until they hit the right guy.

    Now, let’s talk about large “language” models, like Gemini or ChatGPT. Frankly, I believe that people who think that LLMs are “intelligent” or “comprehend language” themselves lack intelligence and language comprehension. But they were made to find and match patterns in written text, and rather good at it.

    Are you getting the picture? What Fitzgerald and Shuy did manually 30 years ago can be automated now. And it gets worse, note how those LLMs “happen” to be developed by companies that you can’t trust to die properly (Google, Amazon, Facebook, Apple, Microsoft and its vassal OpenAI).

    So, while the video offers some solid advice regarding privacy, sadly it is not enough. If you’re in some deep shit, and privacy is a life-or-death matter for you, I strongly advise you be always mindful of what and how you write.

    And, for the rest of us: fighting individually for our right to privacy is not enough. We need to assemble and organise ourselves, to fight on legal grounds against those who are trying to kill it. You either fight for your rights or you lose them.

    Just my two cents. I apologise as this is just side-related to the video, but I couldn’t help it.

      • brbposting@sh.itjust.works
        link
        fedilink
        arrow-up
        3
        ·
        3 months ago

        “Revise generically: [manifesto]” (certainly not the best prompt)

        Folks seem to like Ollama per HackerNews threads: in a coding context here:

        Not using Codestral (yet) but check out Continue.dev[1] with Ollama[2] running llama3:latest and starcoder2:3b. It gives you a locally running chat and edit via llama3 and autocomplete via starcoder2.

        It’s not perfect but it’s getting better and better.

        [1] https://www.continue.dev/ [2] https://ollama.com/

        Please no unabombing though

        Oh wow he wrote a 35k word manifesto… feel like that’s so rare you’d still stand a solid chance at being identified somehow.

    • UmeU@lemmy.world
      link
      fedilink
      arrow-up
      2
      ·
      3 months ago

      While forensic linguistics is pretty cool, the Unabomber was caught because they released his manifesto and his brother’s wife and brother recognized the unusual phrasing such as ‘Eat your cake and have it too’.

      If an author has a large amount of known works then it’s not too difficult to identify other writings by that same author. But if the author does not have a large body of writing that is known to come from that individual, then the best we can do is determine an approximate age and geographic location where the Individual grew up, and that’s only when the unidentified writing is large enough, like in the case of the Unabomber where his manifesto was 30k words.

  • SpookyCoffee@lemmy.world
    link
    fedilink
    arrow-up
    3
    ·
    3 months ago

    It’s one of the videos on his channel, that I didn’t watch on purpose. It’s just depressing thinking about it.