Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
I saw a new article - Gentoo refusing AI generated updates
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Other Things Gentoo
View previous topic :: View next topic  
Author Message
lyallp
Veteran
Veteran


Joined: 15 Jul 2004
Posts: 1559
Location: Adelaide/Australia

PostPosted: Wed Apr 17, 2024 5:58 am    Post subject: I saw a new article - Gentoo refusing AI generated updates Reply with quote

If this is true, then....

GO Gentoo! Awesome!

I guess there will be some push back.

The other thing is I guess it will be kinda hard to differentiate AI generated and human generated.

:)
_________________
...Lyall
Back to top
View user's profile Send private message
rfx
Tux's lil' helper
Tux's lil' helper


Joined: 19 Apr 2023
Posts: 127
Location: de-by

PostPosted: Wed Apr 17, 2024 6:17 am    Post subject: Reply with quote

Do you have a source for this information?
Over the last 30 years I have often read that the world is about to end. Hasn't happened yet.
Back to top
View user's profile Send private message
lyallp
Veteran
Veteran


Joined: 15 Jul 2004
Posts: 1559
Location: Adelaide/Australia

PostPosted: Wed Apr 17, 2024 6:20 am    Post subject: Reply with quote

https://www.theregister.com/2024/04/16/gentoo_linux_ai_ban/

The end is getting close, with xz hack, middle east, inflation, climate change, cost of living, sheesh, my keyboard just wore out listing the issues....

:)
_________________
...Lyall
Back to top
View user's profile Send private message
Chiitoo
Administrator
Administrator


Joined: 28 Feb 2010
Posts: 2575
Location: Here and Away Again

PostPosted: Wed Apr 17, 2024 6:44 am    Post subject: Reply with quote

Here's a quote from the mail quoting the approved statement:

Quote:
Hello,

On 2024-04-14, the Gentoo Council has unanimously approved the new AI
policy. The original wording from the mailing list thread was approved:

"""
It is expressly forbidden to contribute to Gentoo any content that has
been created with the assistance of Natural Language Processing
artificial intelligence tools. This motion can be revisited, should
a case been made over such a tool that does not pose copyright, ethical
and quality concerns.
"""

I have started drafting a Wiki page detailing this at [1]. We will also
look into how best provide this new information to our contributors.

[1] https://wiki.gentoo.org/wiki/Project:Council/AI_policy

--
Best regards,
Michał Górny

- https://marc.info/?l=gentoo-dev&m=171324172908553&w=2
_________________
Kindest of regardses.
Back to top
View user's profile Send private message
kgdrenefort
Apprentice
Apprentice


Joined: 19 Sep 2023
Posts: 186
Location: Somewhere in the 77

PostPosted: Wed Apr 17, 2024 8:37 am    Post subject: Reply with quote

Hello,

That could sounds naive, but I have to strongly say before hands that I agree with the decision, I do not understand, as a baby wannabe dev (I'm bad and not skilled or experienced enough to have a proper idea of all of this) I'm wondering why not use AI for… Code checking.

I know that is not something that is set by snapping your fingers, of course.

I do understand that using AI is, mostly, a bad idea. Myself I could use it for Gentoo french translation, either full translation (what is the point of human, then ?), or partially (which I did a few times, because I wasn't able to translate properly a difficult sentence, simple as that), but I avoid it.

But, checking code for "obvious" but unsee error, bugs or worst: security hole.

As Linus said, about AI incoming into the dev IT field:

Quote:
Looking ahead, Hohndel said, we must talk about "artificial intelligence large language models (LLM). I typically say artificial intelligence is autocorrect on steroids. Because all a large language model does is it predicts what's the most likely next word that you're going to use, and then it extrapolates from there, so not really very intelligent, but obviously, the impact that it has on our lives and the reality we live in is significant. Do you think we will see LLM written code that is submitted to you?"

Torvalds replied, "I'm convinced it's gonna happen. And it may well be happening already, maybe on a smaller scale where people use it more to help write code." But, unlike many people, Torvalds isn't too worried about AI. "It's clearly something where automation has always helped people write code. This is not anything new at all."


It has to be seen, as he said (I agree), as an automation tool.

Refusing AI, maybe not now but in the future, to help peoples doing a better job (while not replacing them, for different reason and mostly the best argument would be morality), is in my opinion a problem.

It should never replace someone's job, otherwise than a really useless job (as we had with lamplighter, for example, it was just becoming useless and we had better alternatives with time). Maybe one day AI will do most of the works. That is something a bit scary, but humanity have to adapt to the new technology.

No tech is bad by itself, it's what you do with it.

Otherwise, let's just start the Butlerian Jihad :wink: !

Regards,
GASPARD DE RENEFORT Kévin
_________________
«Gentoo does not have problems, only learning opportunities.» - NeddySeagoon
«If your Gentoo installation isn't valuable to you, feel free to continue to ignore the instructions.» - figueroa
Back to top
View user's profile Send private message
saturnalia0
Tux's lil' helper
Tux's lil' helper


Joined: 13 Oct 2016
Posts: 136

PostPosted: Wed Apr 17, 2024 12:49 pm    Post subject: Reply with quote

Policy (WIP): https://wiki.gentoo.org/wiki/Project:Council/AI_policy

Let me start by saying I just contribute helping others online and donating when I can, I'm not the one actually maintaining the software, so take my opinion for what it's worth...

As for the stated quality concerns, I understand that some people blindly use whatever AI spits out, just like some people copy and paste from StackOverflow without understanding what they were copying. Should StackOverflow be banned? How would one even enforce either one of these? Human code reviews are still required to merge...

As for the stated ethical concerns, yes, AI requires computational power, so does thousands/millions of Gentoo boxes compiling things. Should Gentoo switch to binary distribution, then? Using an IDE and browser on X.org on a modern i5/i7 with a GPU instead of a console on an 8086 also uses more energy. Should contributions be restricted to the ones typed with vim on a tty? As long as it's productive usage, the energy use makes sense... As for the other concerns over spam and scams, I don't understand how accepting code partially generated with CoPilot contribute to it.

As for the stated copyright concerns, I'm no legal expert, but it seems like the horse has left the barn on this one. Every big tech company I know of already has code that had some sort of AI generation involved in the development process merged deeply into their trunks. I just don't see the entire software industry getting sued over this.

I'm usually a late adopter of things, and it was no different with AI code completion. Nowadays I use it daily and it increases productivity dramatically for me. Just like touch typing, visual/insert modes, autocompletion and an IDE. CoPilot is like auto-complete on steroids, you still need to read and understand what it's spitting out. So it seems to me that banning it is counterproductive.
Back to top
View user's profile Send private message
Rad
Guru
Guru


Joined: 11 Feb 2004
Posts: 401
Location: Bern, Switzerland

PostPosted: Wed Apr 17, 2024 1:36 pm    Post subject: Reply with quote

I see this as just as much of a mistake. It affects tooling and programmer productivity and it is mostly unenforceable anyhow. On claim seems to be that AI is pretty bad, but it already often is better than (above) average people and this trend will likely just become more extreme. No offense intended to the most skilled Gentoo devs and users who do their thing with the greatest skill and care and rarely/never forget anything while doing huge documentation works or test cases or filling in boilerplate code or other things.

Are you going to refuse particularly good contributions for fear that they might be AI-generated? How do you check for this anyhow, will Gentoo run an AI to check it's not accepting AI work...? Spam-wise, I guess we could also talk about the harm that Linux distros in general do by being able to send emails in any way other than slow human keyboard input. Will humans like AI have to prove they licensed their learning from patterns? What if AI is humanlike in intelligence and by all reasonable tests equally sentient, does such an AI have to forever comply/pay for what patterns it learned unless it stays 100% away from anything that is still copyrightable? And so on.

But realistically, all of this won't be solved on Gentoo's side anyhow. It would simply make far more sense to integrate into tooling or even operate open source AI and to take the productivity gains while doing a little something so that some of the most powerful tools ever -AI and AI models- stay open source. If national legislation requires x to be done, then maybe you do x. Else let people use what is reasonably shareable as open source between developers (for reasons of reproduction, shared tooling, and so on). Yes, simply use AI whenever and wherever it is effective, fun, useful, all the other things people might do with/on/for Gentoo.
Back to top
View user's profile Send private message
szatox
Advocate
Advocate


Joined: 27 Aug 2013
Posts: 3140

PostPosted: Wed Apr 17, 2024 1:49 pm    Post subject: Reply with quote

Quote:
As for the stated copyright concerns, I'm no legal expert, but it seems like the horse has left the barn on this one. Every big tech company I know of already has code that had some sort of AI generation involved in the development process merged deeply into their trunks. I just don't see the entire software industry getting sued over this.
The funniest part is that AI does a very similar thing to what humans do:
it discovers patterns in the data it's fed and fills gaps in other data with discovered patterns.
Sure, there is a difference in quantity and abstraction level, but both AI and humans need some input to learn common features before producing any useful output, so slapping "copyright" or "ethical concerns" sounds just like a bad joke and I wish people stopped legitimizing this line of thinking... It's about as sensible as charging kids for scribbling on the last page of their notebooks.

Quality is a real concern, since AI lacks the depth and understanding. There are useful automated tools specialized for a particular job, like e.g. static analyzers, but language models specifically are just not fit doing any work in tech, so banning them specifically is a good policy. Still, justifying the decision with wrong reasoning just to extend the list actually makes the argument weaker... And might have unintended consequences down the line, when said wrong reasoning is extrapolated to something else.
_________________
Make Computing Fun Again
Back to top
View user's profile Send private message
Rad
Guru
Guru


Joined: 11 Feb 2004
Posts: 401
Location: Bern, Switzerland

PostPosted: Wed Apr 17, 2024 2:09 pm    Post subject: Reply with quote

szatox wrote:
Quality is a real concern, since AI lacks the depth and understanding.

I'll again place my opinion that this is not an universal fact at all anymore. AI made remarkable progress, it's already *very* powerful in summarizing documents or even internet discussions, including open source tooling running LLM in the hands of users who specialized it a little to get short summaries or whatever they wanted. I say it would already *increase* rather than decrease availability-quality of documentation, or the understanding of bug reports. That it should not be completely trusted... isn't that the same for working with tired hobbyist humans?

IF you focus on running multiple AI within your control (->the ones your developers like best), you can get a rough understanding how reliable or not they are individually, and of course replace or disable or just be aware of and warn about what doesn't work well enough in the context where it doesn't quite work.
Back to top
View user's profile Send private message
pingtoo
l33t
l33t


Joined: 10 Sep 2021
Posts: 926
Location: Richmond Hill, Canada

PostPosted: Wed Apr 17, 2024 2:34 pm    Post subject: Reply with quote

Rad wrote:
szatox wrote:
Quality is a real concern, since AI lacks the depth and understanding.

I'll again place my opinion that this is not an universal fact at all anymore. AI made remarkable progress, it's already *very* powerful in summarizing documents or even internet discussions,

I wonder if we take Gentoo's Wiki to AI, will AI produce better document (and summary)? Will Gentoo team accept this result?
Back to top
View user's profile Send private message
Hu
Moderator
Moderator


Joined: 06 Mar 2007
Posts: 21651

PostPosted: Wed Apr 17, 2024 3:01 pm    Post subject: Reply with quote

I was not involved in the discussion, nor in the subsequent vote. My take on it is that:
  • Regarding copyright, most countries have copyright laws that are poorly suited to software, at best. For an organization that cannot afford to spend millions defending against a copyright lawsuit, extreme caution seems like the right approach to me. Once major jurisdictions have clear statutory or case law holding that AI output is not subject to the copyright of the input training data, this will be less of a concern. I am not aware of any jurisdictions that have said that it is subject, or is not subject, so the cautious approach is to assume a court might decide that the copyright on the input training data does transfer to the AI output. Most, if not all, AI output currently has poor attribution, so it is difficult to determine whether the input training data was subject to an enforceable copyright, and if it was, then whether the input had an acceptable license (such as BSD/MIT for any purpose, or GPL for projects that accept GPL contributions) that would permit its use even if a court does rule that the copyright is passed down.
  • Regarding quality, my opinion is that the submission's quality should be judged independent of the source. If the submitter's work is of good quality, whether because the submitter is a good author, because the submitter took poor quality AI output and manually improved it, or because the submitter used a high quality AI that produces naturally good output, does not matter. However, at present, AI output is often not of a quality that it can be submitted verbatim, and I would not want reviewers (whose time is often very precious) to waste their time picking through poor quality AI output. Establishing a rule that summarily rejects AI output is heavy-handed, but effective. Once submitters provide output that is not obviously poor quality, this point will become difficult to enforce, but also unnecessary.
  • Regarding the other concerns, I have not read enough to form an opinion.
Back to top
View user's profile Send private message
spica
Apprentice
Apprentice


Joined: 04 Jun 2021
Posts: 288

PostPosted: Wed Apr 17, 2024 6:28 pm    Post subject: Reply with quote

AI can enhance existing texts by providing a means for authors to have their work reviewed and improved, either by seeking feedback from other individuals or utilizing AI tools for refinement. This practice, when employed by knowledgeable individuals seeking to refine their work, is undoubtedly beneficial. This is undoublety beneficial for this forum, because AI helps to make text more understandable and clear.

However, a significant concern arises when individuals lacking expertise rely solely on AI to generate complex content, such as code. In such cases, if an inexperienced user utilizes AI-generated content and distributes it to others, who then must expend valuable time deciphering and correcting the nonsensical output? This practice obviously must be prohibited.

One critical issue surrounding this practice of AI usage pertains to licensing. It remains unclear how the responsibility for the generated content is attributed to the AI authors. Most AI tools are accessed via non-GUI APIs, which may not inherently include licensing information. To address this, it is imperative to tightly integrate licensing information with the output provided by AI. Without this cohesive integration, establishing a clear connection between the final product, crafted by a skilled engineer and submitted to AI for verification, becomes very challenging.

At the end, Google's search results are generated using AI algorithms. Does Michał's message mean we must avoid Google usage?

Absolutely, defining clear boundaries for AI usage is important for ensuring its ethical and responsible usage. Without such boundaries, AI could start to be more than aimless noise, lacking direction and purpose. By establishing well-defined limits and guidelines, we can harness the potential of AI while mitigating potential risks and ensuring that its impact aligns with our values and objectives.
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Other Things Gentoo All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum