• swnt@feddit.de
    link
    fedilink
    English
    arrow-up
    103
    arrow-down
    1
    ·
    1 year ago

    The EU AI Act would require the company to disclose details of its training methods and data sources.

    If we’re not going to make this apply to AI companies, which have overporportionate power already, then what else is there to talk about?

    • tryptaminev 🇵🇸 🇺🇦 🇪🇺@feddit.de
      link
      fedilink
      English
      arrow-up
      20
      ·
      edit-2
      1 year ago

      Understandable AI is an important field in machine learning, where it is about well understanding how the model came tobits conclusions based on the data. This is crucial to apply AI tools for anything beyond writing silly Haikus. An AI company that denies access to that basically wants its customers to use its tools like a fortune teller.

      “Yes the computer read that in the stars. how why or how reliable the result? Dunno, but it says sobso it must be true. And now off to prison young black men, with a good job and no criminal record. The AI predicted you would commit a crime in 10 years.”

      EDIT: To give an example from a lecture i had. The task was picture classification and one model rekiabl, recognized pictures of a horse in the training data set, but failed to recognize it outside of it. Turns out all the pictures in the training set had a watermark text in the botton, that the model recognized as being the defining feature. And that is a very simple task in comparision.

      Open AI wanting not disclose their training methods and data source indicates that there could be a lot of garbage like this in their models.

      • GregorGizeh@lemmy.ml
        link
        fedilink
        English
        arrow-up
        8
        ·
        1 year ago

        This is a great point I hadn’t even considered yet, even though I am already very wary and sceptical of capitalism developing this next revolution.

        How can the user possibly trust an AI that is for all intents and purposes a secretive stranger with an agenda and values you don’t know? Especially because capitalism will only develop a slave to their profits, they would never create an actual intelligence with free will the user could actually get to “know” and trust, it would never constitute a person in the philosophical sense.

        The whole thing is creepy and dystopian come to think about it… we allow the worst of humanity to shape and bind what will essentially be a superhuman entity to their will.

  • uhh@lemmy.world
    link
    fedilink
    English
    arrow-up
    45
    arrow-down
    1
    ·
    1 year ago

    keeping information like training methods and data sources secret was necessary to stop its work being copied by rivals.

    In addition to the possible business threat, forcing OpenAI to identify its use of copyrighted data would expose the company to potential lawsuits. Generative AI systems like ChatGPT and DALL-E are trained using large amounts of data scraped from the web, much of it copyright protected.

    These two paragraphs one after the other really brightened my day.

    • maynarkh@feddit.nl
      link
      fedilink
      English
      arrow-up
      6
      ·
      1 year ago

      Wonder when the big copyright trolls show up at their doorstep instead of pestering random retirees for pirating Matlock.

  • Phanatik@kbin.social
    link
    fedilink
    arrow-up
    38
    arrow-down
    3
    ·
    1 year ago

    I’ve talked about this so much but nobody bloody listens. I sound like I’m crazy sometimes but it’s fucking real.

    You don’t know what the AI is doing so you have no reason to trust it beyond an expectation that it will give you accurate information but that’s not guaranteed.

    They don’t have permission to use the vast amounts of information they’ve scraped from the internet to train an AI model. No one gave OpenAI the permission to commercialise the use of their content in an AI model.

    It was all well and good when they were a non-profit but they’re selling products now. AI trained on our data and content we produced.

  • febra@lemmy.world
    link
    fedilink
    English
    arrow-up
    29
    ·
    1 year ago

    Asking these companies to disclose the sources they’re training their algorithms on just makes sense. You can’t allow companies to build for-profit algorithms built on copyrighted material that they haven’t paid for.

    • 50gp@kbin.social
      link
      fedilink
      arrow-up
      2
      ·
      1 year ago

      hopefully legislation catches up and enforces these models to not be commercially usable if they havent paid for any of the sources

  • Tashlan@kbin.social
    link
    fedilink
    arrow-up
    28
    ·
    1 year ago

    What does the Open in the name stand for, then?

    Very, very tired of companies embracing openness and share-alike mentalities when it’s their turn to take and then skulking pit when it’s their time to give. Reminds me of how Crunchyroll started off selling a subscription to stream other people’s pirated and fansubbed anime.

    • magic_lobster_party@kbin.social
      link
      fedilink
      arrow-up
      6
      ·
      1 year ago

      AI was already one of the most publicly open scientific fields before OpenAI grew to dominance. Google, Microsoft and Facebook were all making significant open contributions to the field.

      OpenAI is reversing all that. They’re not releasing any code or data researchers could experiment with. Completely opposite of what their name suggests.

      Meta deserves a lot the hate they get, but at least they’re still one of the most significant open contributors to the field - most recently with the release of Llama models.

    • DarkThoughts@kbin.social
      link
      fedilink
      arrow-up
      1
      ·
      1 year ago

      I’m not super into their history but afaik they started out open & non profit (allegedly, they had people like Musk on board so take that with a grain of salt) and over the years then did basically a full 180. lol

  • maynarkh@feddit.nl
    link
    fedilink
    English
    arrow-up
    9
    ·
    1 year ago

    Can’t or won’t?

    “But your honor, that would be devastating to my client’s case!”

  • antonymous@feddit.de
    link
    fedilink
    English
    arrow-up
    5
    ·
    edit-2
    1 year ago

    Well from the looks of it, the US is not far behind in the efforts to regulate. It’s hard to not see this as nothing more than a negotiation tactic.

  • Cardinal@lemmy.pt
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 year ago

    To much hype on that one , like Italy to ban bla bla bla And everything it’s ok now