More

gregkh · on Feb 10, 2021

So you want us to just stop fixing bugs and pushing out those fixes to users? That feels risky, if you do not want to upgrade to solve known problems, that's fine, feel free to skip upgrades. But why would you want to prevent those who want to run secure systems that ability?

stonesweep · on Feb 10, 2021

The answer doesn't have to be so tetchy as your ire puts it, it's as simple as reducing releases to once a week or once every other week. I respect your work greatly, but you've taken the "throw the baby out with the bathwater" approach to your answer here; two Production LTS kernels a week seems excessive to me.

gregkh · on Feb 10, 2021

Why is it "excessive"? We are running 30+ fixes a day in these kernel releases, who would benefit if we delayed in getting those known-bug/security fixes out to the world quickly and properly tested (as we are currently doing)?

Why wait? What is a slower cadence going to accomplish?

plorkyeran · on Feb 11, 2021

If you only want to upgrade your kernel once a week, then you are free to do so regardless of how many releases they make during the week. Unless they've been introducing regressions by releasing too aggressively, there's no upside to releasing less often.

gregkh · on Feb 6, 2021

And what would that help solve?

glandium · on Feb 6, 2021

MITM injecting code on your page that visitors'browsers will happily execute. That's not hypothetical, some ISPs do it.

gregkh · on Feb 6, 2021

Time to get a new ISP? :)

gregkh · on Dec 23, 2020

Where in the current CI that we have today is lacking that needs to be improved? We always want more testing and testers, what is preventing everyone from helping with this?

dlgeek · on Dec 23, 2020

I'll bite: How can I help?

I'm a software engineer who's not involved in Linux Kernel Dev... but I've got a stack of old laptops that I'd be happy to set up to run automated CI if that'd be helpful.

Is there a webpage or doc somewhere I can look at?

(I'm not trying to snark - the fact that you're you and you're here asking for help is making me want to dip my toe in).

gregkh · on Dec 24, 2020

Simplest thing to do, just run Linus's latest releases (the -rc releases), or from his git tree, on your machine and report any problem.

Second-simplest thing to do is to run the linux-next branch/tree on your machines and report any build warnings and runtime issues you find. That's what will be the "next" kernel releases and is where all of the developer/maintainer trees are merged together before they are sent to Linus.

Both of those should be very easy to do, and any problems found there should be easy to fix and resolve before they get to a "real" release.

nitrogen · on Dec 23, 2020

I haven't been following kernel dev for years; what does the CI setup look like? Did the Phoronix Test Suite ever find its way into widespread use?

Back when I was building kernels for embedded hardware (Sheevaplug) in the 2.6.33 timeframe, I found a USB audio regression between 2.6.33.7 and later versions. If there were a semi-turnkey way to set up a testbench that could automatically reboot hardware in every new kernel, run through some basic tests, and report any deviation, I probably would have been more likely to do so. At the time I was working solo trying to release a polished consumer product (sadly though the product was released the business didn't work out) and didn't have time to dig into and report bugs.

gregkh · on Dec 24, 2020

We have so many different CI systems running on the kernel on a hourly basis.

We have the 0-day bot from Intel that runs so many things on all developer trees. We have kernelci running on many many different hardware platforms, and we have Linaro test systems also running on many different branches and hardware platforms.

If you want to tie your own hardware into the system, kernelci is the best place to start, I recommend looking into that.

thanks!

gregkh · on Dec 23, 2020

I get a vacation? Hah!

tuldia · on Dec 23, 2020

Every software has bugs, and is easier to criticize than to help. Don't focus on the negativity of HN and keep it up!

Thanks for your hard work, Greg!

gregkh · on Dec 23, 2020

If people don't report bugs, we don't know they are there as it "works for me!".

This isn't "negativity", this is people not understanding how the process works :)

And you're welcome!

Skunkleton · on Dec 23, 2020

These are some attitude goals for me. It's so easy to take things personally. Being able to take things constructively even when they might be personal is a great skill.

mike256 · on Dec 23, 2020

As every patch nowadays means that yet another symbol gets his gpl-only tag, no I don't report bugs...

muxator · on Dec 23, 2020

Thanks for your work, Greg!

nasir_hm · on Jan 3, 2021

Thank You very much for the hard work you do Greg, You're awesome :D

gruturo · on Dec 23, 2020

Thank you for your hard work Greg, it is very much appreciated.

Merry Christmas/Isaac Newton's Birthday!

fatboy93 · on Dec 23, 2020

You should a small vacation sometime ;)

Bugs can always be fixed, mental health can't.

Thanks for all the awesome work Greg!

esgwpl · on Dec 23, 2020

Sorry if my post sounds like a jab, it wasn't meant to be, thanks for your work and Merry Christmas.

mr_sturd · on Dec 23, 2020

Merry Christmas, Greg!

malikolivier · on Dec 23, 2020

Merry Christmas, Greg!

Thanks for all the hard work!

gregkh · on Dec 14, 2020

Everyone gets older, the alternative isn't as attractive :)

Seriously, the kernel averages about 200-250 new contributors every release (i.e. every 2 1/2 months). We are not starved for new contributors at the moment at all, do you think we are somehow not attracting new developers compared to other open source projects?

yudlejoza · on Dec 14, 2020

Hi Greg! :)

I was mainly referring to something I read many years ago (regarding new kernel devs), e.g., this from 2013 [0]. However, from your response, looks like that's not a problem.

[0] https://www.zdnet.com/article/graying-linux-developers-look-...

sargun · on Dec 14, 2020

The media has talked about how the kernel development community has been slow to get new blood: https://forums.theregister.com/forum/all/2020/08/25/linux_ke...

I’m not sure that _this_ is the (a?) problem, but if someone were purely sourcing CNCF / Linux foundation / press releases they might think the project is heading for a day when the old guard keels over and we’re left high and dry.

gregkh · on June 24, 2020

"regular stable kernel" lives only about 3-4 months, just long enough for the next release from Linus to feel "good enough".

"longterm kernel" lives for 2+ years. I pick one each year (usually the last one released in a year) for that.

See the releases page on kernel.org for details on what the longterm kernels are, and for how long they are being maintained and by whom.

gregkh · on June 24, 2020

That "partially-ABI stable" is the same exact thing that Red Hat and SUSE and Debian have been doing for 20+ years now. Nothing major and exciting there, but see the presentations at the Linux Plumbers conferences for details on the tools being used if people are curious (hint this time everyone is working together on the same set of tools...)

gregkh · on June 23, 2020

And what would that "middle ground" look like?

With the current rate of change that the kernel community develops at, including the patches backported to the stable/longterm kernels, it's impossible to try to evaluate each and every patch for "is this something that could be exploited or not?"

Companies have tried, it was fun watching them, but they quickly gave up and declared it impossible and much safer to just take all stable patch updates instead.

I've also talked to MITRE about just applying for a CVE for ever stable kernel patch (20+ a day), and while they appreciated me not doing that, they agreed that the current model of CVEs just does not work at all for the Linux kernel and that what we are doing is fine.

See my Kernel Recipes talk last year for details about all of that if you are curious.

despera · on June 23, 2020

I understand that completely and i already know your thoughts on that and in some extend i do agree.

Still, i think we do have in hand a very characteristic issue that even without knowing the details, simply by searching commit messages for "crypto" "key" "buffer" etc it should alert somebody to give it a second and third look.

eqvinox · on June 23, 2020

Hell no. You can't just "guess by words."

The only thing (far!) worse than no impact marking is incomplete impact marking. That's just giving people a way to cop out, and they WILL use it.

despera · on June 23, 2020

How is no marking better than some marking?

If there is a commit that refers to a "memory leak" why shouldn't be, at least superfluously, checked, identified and have distros informed? (e.g 2ca068be09bf8e285036603823696140026dcbe7)

If the crypto fix was assigned early as a vulnerability would have stayed unpatched for that long?

eqvinox · on June 23, 2020

> How is no marking better than some marking?

With no marking it is clear what it means: commits have not been audited to identify security-relevant ones.

With partial, incomplete marking, unmarked commits can be one of two things: commits that have not been looked at, and commits that have been looked at and are believed to contain no security relevant changes.

The majority of commits will be in the "not looked at" category. And there's enough people around to have a significant subset of them be lazy, ignorant, unskilled or stupid and take that as "contains no security relevant changes."

P.S.: also, patches are already marked. By being included in the LTS series. Because that means they were important enough to get a backport — though not necessarily due to security impact.

despera · on June 24, 2020

I do agree with the premises, i don't agree with your conclusion.

Yes only a part of patches would be marked as such. That, major or minor, part would simply mean that people won't have to reinvent the particular wheel, as happened in this case. People won't be missing critical _discovered_ changes, the vulnerability would be discussed, recognized in its totality (PoC, documentation etc) and proper patches will be offered. There have been cases where LTS backports were old revisions of bad patches.

I think that baking LTS kernels is unnecessarily closer to an artistic approach of doing things.

gregkh · on June 23, 2020

They feel like they know better and do not want all of the fixes that the LTS kernels provide for some crazy reason.

I suggest you contact them if you rely on a RHEL kernel to ask them why they do this, it's always seemed crazy.

Note, I'm the person who does the LTS kernel releases, maybe they just don't like me :)

rwmj · on June 23, 2020

I'm certain that Red Hat and our kernel developers have no animosity towards you, in fact it's completely the opposite. You're well known in the community not just for this but for maintaining and writing countless drivers and loads of other great work in the kernel.

Fedora does use the LTS kernels.

iforgotpassword · on June 23, 2020

> They feel like they know better and do not want all of the fixes that the LTS kernels provide for some crazy reason.

It's even crazier; they sometimes backport changes to their kernel that the LTS kernels don't get. We use a custom kernel module that contains a bunch of #if #endif blocks that check the kernel version for stuff that changed. Doesn't work on RedHat since you actually need the branch that's for more recent kernels in some places.

orf · on June 23, 2020

Isn’t the whole autoconf stuff supposed to avoid the need for if macro soup, by using feature detection?

iforgotpassword · on June 23, 2020

There would be cleaner ways to achieve this, maybe not specifically autoconf since I think that's more tailored towards "normal" (user space) stuff.

Macros are convenient to quickly check the version in your code without adding another layer of tooling... Until you end up with said macro soup of course.

It's actually a legacy module we're about to phase out for 5.x so don't worry too much. The new and shiny replacement will probably use git branches for whenever something changes.

stonogo · on June 23, 2020

It could also be motivated by the fact that it's not entirely wise to base an entire multibillion dollar business around whatever text a non-employee happens to push to a repo you don't control.

gregkh · on June 23, 2020

Android doesn't seem to mind, they require the LTS updates to be taken for their devices (well, "require" is a strong word, they are pushing harder now than they were in the past, "required" will be happening in the future, hopefully...)

As the number of systems running RHEL is really just a rounding error compared to the number of Android systems out there, maybe it doesn't really matter :)

bonzini · on June 23, 2020

Android and RHEL are completely different scenarios.

The target of Android, which is who Google has to deal with, is multiple manufacturers creating kernels for custom hardware (often without upstream drivers), with very short product life, relatively little experience with upstream contribution and few needs for new features for a given major release of Android.

RHEL is developed by a single company with a 10 years life cycle, only 3-4 kernel versions to juggle but almost non-overlapping lifecycle (as far as the initial development-heavy phase is concerned). Development occurs upstream first and quite a few engineers are upstream developers or maintainers, so that the number of non-upstream features is very small and almost going down over time, see for example stuff like the secure boot lockdown patches that Matthew Garrett started when he was at Red Hat. And even though the product is not the kernel, we need to backport more features than what goes into LTS, because userspace needs them (user namespaces, driver updates, networking or virtualization optimizations, enablement for new processors, etc.)

So it's only natural that there are completely trade-offs to make.

azinman2 · on June 23, 2020

I would guess that RHEL has more stringent requirements than your average me-too Android manufacturer.

radiator · on June 23, 2020

But do the devices running Android generally operate with the same requirements as the servers running RHEL?

nordsieck · on June 23, 2020

> But do the devices running Android generally operate with the same requirements as the servers running RHEL?

I guess it depends on the device and the business.

How many days of downtime on Amazon.com is the cost of bricking 1M phones? Or 100M?

stonogo · on June 23, 2020

Yes, that's why all the servers in the datacenter are running Android, which has a stellar security record.

This sort of myopia is yet another excellent reason for Red Hat to take their kernel process in-house.

phh · on June 24, 2020

LTS' security record is definitely not to be proven. The example we're currently commenting on this thread is only one occurence. It is highly re-occuring.

RHEL fixes only CVEs. Linus Torvalds consider there is no such thing as a security bug, that actually every bug is a security bug. So RHEL's kernel can't be secure.

zozbot234 · on June 23, 2020

Debian also uses LTS-series kernels for their stable release - with their own patches on top. They don't actively backport features like RHEL does however.

baggy_trough · on June 23, 2020

Why doesn't that argument apply to the mainline kernel?

gregkh · on May 31, 2020

Citation needed :)

Seriously, all software has bugs, how do you judge the quality of an operating system that is not finished?

pjmlp · on May 31, 2020

Does Google itself help?

https://events19.linuxfoundation.org/wp-content/uploads/2017...

https://outflux.net/slides/2019/lss/kspp.pdf

Slides from 2018 and 2019 talks.

HN For You