Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Microsoft char­ac­ter­izes the out­put of Copi­lot as a series of code "sug­ges­tions". Microsoft "does not claim any rights" in these sug­ges­tions. But nei­ther does Microsoft make any guar­an­tees about the cor­rect­ness, secu­rity, or exten­u­at­ing intel­lec­tual-prop­erty entan­gle­ments of the code so pro­duced. Once you accept a Copi­lot sug­ges­tion, all that becomes your prob­lem:

> "You are respon­si­ble for ensur­ing the secu­rity and qual­ity of your code. We rec­om­mend you take the same pre­cau­tions when using code gen­er­ated by GitHub Copi­lot that you would when using any code you didn’t write your­self. These pre­cau­tions include rig­or­ous test­ing, intel­lec­tual prop­erty scan­ning, and track­ing for secu­rity vul­ner­a­bil­i­ties."

I can't help but recall:

"Linux is a cancer that attaches itself in an intellectual property sense to everything it touches."

- Steve Ballmer, while CEO of Microsoft



intel­lec­tual prop­erty scan­ning

With "normal" code I can generally see (or figure out) who posted/published it and reach out for explicit permission. It's not uncommon for me to do this.

How is one supposed to do that for the generated stuff? Seems like an awefully hands-off attitude. As challenging as it is, they really ought to be qualifying the input samples of training code before ingesting.


There are some techniques used mostly to detect when students copy paste code. I've seen some of the tools in that space and they have varying degrees of accuracy. MOSS is a common one[0].

There are some vendors in this space too (BlackDuck comes to mind) but they're $$$ so only within the scope of large corporations.

If anybody has any ideas relating to this type of analysis, I'd be excited to chat. I am working on a project[1] in this space for "Software Composition Analysis" which could potentially overlap with snippet detection for code like Co-Pilot. (We basically just have a big pipeline of analysis jobs that run on code and store the results. I need to update the docs!)

0: https://yangdanny97.github.io/blog/2019/05/03/MOSS

1: https://github.com/lunasec-io/lunasec/tree/master/lunatrace


I don't think it's right to characterize it as hands off after they had their hands all up in the generated code. It's just malfeasant. They've produced a tool that is fundamentally (legally) unsafe to use and said that's not their problem.


Could you help me understand the link between the two?


It isn't so much a connection as an example of cognitive dissonance from the organisation.

On the one hand stating plainly that mixing in copy-left code and similar can be disastrously dangerous because it is a rampant virus. On the other hand not understanding why people think it might be a problem that their tool could encourage mixing in copy-left code.


Microsoft released a product which gives you cancer the moment you use it.

According to the opinions about what inclusion of open source code into your projects does, as per the ex-CEO of the company. That seems a bit of a far fetched conclusion, but then, Ballmer did say it.


The point is not clear, but if I were to guess, it's that Github Copilot should come with a California Prop 65 warning, because it can give your code "cancer" (GPL-licensed snippets from sources like Linux codebas).


Linux is open source and Ballmer is displaying Microsoft’s negative attitude towards open source that is demonstrated in the author’s arguments regarding copilot.


> rig­or­ous test­ing, intel­lec­tual prop­erty scan­ning, and track­ing for secu­rity vul­ner­a­bil­i­ties

Seems like best practice recommendation that everyone should apply when downloading a torrent.


Seems like we need MITpilot.


> Steve Ballmer

They have some really good blow in Redmond.

If anybody could win an award for being coked up and sweaty on stage...

https://www.youtube.com/watch?v=Vhh_GeBPOhs


Fun story: That was my first employee town hall, in 2000. I was concerned for the fellow (and so very glad when he left, Satya has been so so so much better for the company and morale). It was definitely an... interesting introduction to the company.

See also this Domo video that turned it into a song. :) https://www.youtube.com/watch?v=f7ZDH45OAt8


At the time, I was doing Linux, OpenBSD and FreeBSD stuff in Bellingham. The reaction from the local and regional non-Microsoft community was really like "Holy shit what is going on down there?!"




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: