Judge backs AI firm over use of copyrighted books

Davriellelouna@lemmy.world · 22 days ago

Judge backs AI firm over use of copyrighted books

Grimy@lemmy.world · 22 days ago

80% of the book market is owned by 5 publishing houses.

They want to create a monopoly around AI and kill open source. The copyright industry is not our friend. This is a win, not a loss.

Sentient Loom@sh.itjust.works · 22 days ago

How exactly does this benefit “us” ?

gaylord_fartmaster@lemmy.world · 22 days ago

Because books are used to train both commercial and open source language models?

Sentient Loom@sh.itjust.works · 21 days ago

used to train both commercial

commercial training is, in this case, stealing people’s work for commercial gain

and open source language models

so, uh, let us train open-source models on open-source text. There’s so much of it that there’s no need to steal.

?

I’m not sure why you added a question mark at the end of your statement.

gaylord_fartmaster@lemmy.world · 21 days ago

I’m not sure why you added a question mark at the end of your statement.

I was questioning whether or not you would see that as a benefit. Clearly you don’t.

Are you also against libraries letting people borrow books since those are also lost sales for the authors, or are you just a luddite?

OmegaMouse@pawb.social · 22 days ago

What, how is this a win? Three authors lost a lawsuit to an AI firm using their works.

Grimy@lemmy.world · 22 days ago

The lawsuit would not have benefitted their fellow authors but their publishing houses and the big ai companies.

hendrik@palaver.p3x.de · 22 days ago

Keep in mind this isn’t about open-weight vs other AI models at all. This is about how training data can be collected and used.

bob_omb_battlefield@sh.itjust.works · 22 days ago

If you aren’t allowed to freely use data for training without a license, then the fear is that only large companies will own enough works or be able to afford licenses to train models.

Nomad Scry@lemmy.sdf.org · 22 days ago

If they can just steal a creator’s work, how do they suppose creators will be able to afford continuing to be creators?

Right. They think we have enough original works that the machines can just make any new creations.

😠

MudMan@fedia.io · 22 days ago

It is entirely possible that the entire construct of copyright just isn’t fit to regulate this and the “right to train” or to avoid training needs to be formulated separately.

The maximalist, knee-jerk assumption that all AI training is copying is feeding into the interests of, ironically, a bunch of AI companies. That doesn’t mean that actual authors and artists don’t have an interest in regulating this space.

The big takeaway, in my book, is copyright is finally broken beyond all usability. Let’s scrap it and start over with the media landscape we actually have, not the eighteenth century version of it.

bob_omb_battlefield@sh.itjust.works · 22 days ago

Yeah, I guess the debate is which is the lesser evil. I didn’t make the original comment but I think this is what they were getting at.

Grimy@lemmy.world · edit-2 22 days ago

Yes precisely.

I don’t see a situation where the actual content creators get paid.

We either get open source ai, or we get closed ai where the big ai companies and copyright companies make bank.

I think people are having huge knee jerk reactions and end up supporting companies like Disney, Universal Music and Google.

Grimy@lemmy.world · 22 days ago

Because of the vast amount of data needed, there will be no competitive viable open source solution if half the data is kept in a walled garden.

This is about open weights vs closed weights.

the_q@lemmy.zip · 22 days ago

An 80 year old judge on their best day couldn’t be trusted to make an informed decision. This guy was either bought or confused into his decision. Old people gotta go.

FaceDeer@fedia.io · 22 days ago

Did you read the actual order? The detailed conclusions begin on page 9. What specific bits did he get wrong?

AbouBenAdhem@lemmy.world · 22 days ago

IMO the focus should have always been on the potential for AI to produce copyright-violating output, not on the method of training.

Artisian@lemmy.world · edit-2 21 days ago

Plantifs made that argument and the judge shoots it down pretty hard. That competition isn’t what copyright protects from. He makes an analogy with teachers teaching children to write fiction: they are using existing fantasy to create MANY more competitors on the fiction market. Could an author use copyright to challenge that use?

Would love to hear your thoughts on the ruling itself (it’s linked by reuters).

Sculptus Poe@lemmy.world · edit-2 21 days ago

If you try to sell “the new adventures of Doctor Strange, Jonathan Strange and Magic Man.” existing copyright laws are sufficient and will stop it. Really, training should be regulated by the same laws as reading. If they can get the material through legitimate means it should be fine, but pulling data that is not freely accessible should be theft, as it is already.

devfuuu@lemmy.world · 22 days ago

That “freely” there really does a lot of hard work.

Sculptus Poe@lemmy.world · edit-2 22 days ago

It means what it means, “freely” pulls its own weight. I didn’t say “readily” accessible. Torrents could be viewed as “readily” accessible but it couldn’t be viewed as “freely” accessible because at the very least you bear the guilt of theft. Library books are “freely” accessible, and if somehow the training involved checking out books and returning them digitally, it should be fine. If it is free to read into neurons it is free to read into neural systems. If payment for reading is expected then it isn’t free.

Womble@lemmy.world · 21 days ago

Civil cases of copyright infringment are not theft, no matter what the MPIA have trained you to believe.

Judge backs AI firm over use of copyrighted books

Judge backs AI firm over use of copyrighted books

US Judge sides with AI firm Anthropic over copyright issue