https://lwn.net/SubscriberLink/858023/1caabaef50d4946b/

LWN.net Logo LWN
.net News from the source LWN

  * Content
      + Weekly Edition
      + Archives
      + Search
      + Kernel
      + Security
      + Distributions
      + Events calendar
      + Unread comments
      + -------------------------------------------------------------
      + LWN FAQ
      + Write for us

User: [        ] Password: [        ] [Log in]
|
[Subscribe]
|
[Register]
Subscribe / Log in / New account

Auditing io_uring

[LWN subscriber-only content]

    Welcome to LWN.net

    The following subscription-only content has been made available
    to you by an LWN subscriber. Thousands of subscribers depend on
    LWN for the best news from the Linux and free software
    communities. If you enjoy this article, please consider
    subscribing to LWN. Thank you for visiting LWN.net!

By Jonathan Corbet
June 3, 2021
The io_uring subsystem, first introduced in 2019, has quickly become
the leading way to perform high-bandwidth, asynchronous I/O. It has
drawn the attention of many developers, including, more recently,
those who are focused more on security than performance. Now some
members of the security community are lamenting a perceived lack of
thought about security support in io_uring, and are trying to remedy
that shortcoming by adding audit and Linux security module support
there. That process is proving difficult, and has raised the prospect
of an unpleasant fallback solution.

The Linux audit mechanism allows the monitoring and logging of all
significant activity on the system. If somebody wants to know, for
example, who looked at a specific file, an audit-enabled system can
provide answers. This capability is required to obtain any of a
number of security certifications which, in turn, are crucial if one
wants to deploy Linux in certain types of security-conscious
settings. It is probably fair to say that a relatively small
percentage of Linux systems have auditing turned on, but
distributors, almost without exception, enable auditing in their
kernels.

The audit mechanism relies, in turn, on a large array of hooks
sprinkled throughout the kernel source. Whenever an event that may be
of interest occurs, it is reported via the appropriate hook to the
audit code. There, a set of rules loaded from user space controls
which events are reported to user space.

When io_uring was being developed (which is still happening now, of
course), the developers involved were deeply concerned about
performance and functionality. Supporting security features like
auditing was not at the top of their list, so they duly neglected to
add the needed hooks -- or to think about how auditing could be
supported in a way consistent with the performance goals. Now that
io_uring is showing up in more distributor kernels (and, in
particular, the sorts of kernels where auditing is relatively likely
to be enabled), security-oriented developers are starting to worry
about it. Having io_uring serve as a way to circumvent the otherwise
all-seeing audit eye does not seem like a good way to maintain those
security certifications.

Adding security support

In late May, Paul Moore (a maintainer of the audit subsystem) posted
a set of patches adding Linux security module (LSM) and audit
capabilities to io_uring. The LSM side is relatively straightforward;
the operations performed by io_uring are already covered by LSM
hooks, so all that was needed was a pair of new hooks to pass
judgment on io_uring-specific actions. Specifically, these hooks
control the sharing (between processes) of credentials that are
stored with the ring buffer that is used to communicate operation
requests to the kernel; see this patch for details. This part of the
patch set does not seem to be controversial.

The audit code is another story. The core io_uring code has been
carefully optimized to dispatch requests and their results as quickly
as possible. Use cases for io_uring can involve performing millions
of I/O operations per second, so any added overhead will prove most
unwelcome. Adding the audit hooks to cover operations submitted
through the ring slowed down one of the most performance-critical
parts of io_uring, leading Pavel Begunkov to react negatively: "So,
it adds two if's with memory loads (i.e. current->audit_context) per
request in one of the hottest functions here... No way, nack".

Begunkov suggested that perhaps the audit hooks could be patched in
at run time when they are actually enabled, the way tracepoints and
kprobes work. Moore responded that the audit subsystem doesn't
support that sort of patching, and that doing so could raise problems
of its own: "I fear it would run afoul of the various security
certifications". So that does not appear to be the route to a
possible solution.

Meanwhile, Jens Axboe, the io_uring maintainer, ran some tests. A
simple random-read test slowed down by nearly 5% with the audit hooks
installed, even in the absence of any actual audit rules. Various
other benchmarks, even when run with an updated version of the patch
set (which was not posted publicly), gave the same results. Kernel
developers can work for months for a 5% performance gain; losing that
amount to audit hooks is a bitter pill for them to swallow.

Axboe pointed out that read and write operations are not audited when
they are initiated through the older asynchronous I/O system calls. "
In the past two decades, I take it that hasn't been a concern?" He
agreed that some operations (such as opening or removing files)
should be audited, but said that auditing read and write operations
was "just utter noise and not useful at all". Since those operations
are the ones where performance matters the most, taking the audit
hooks out of the fast path for them might be a possible solution.

Moore suggested an approach based on that idea; only a specific,
carefully chosen set of operations would have the audit hooks
applied. There is a handy switch statement in the io_uring dispatcher
that makes it easy to instrument just the desired operations. He
asked for feedback, but has not gotten much so far. The important
question, as Begunkov pointed out, is which operations in particular
need audit support. Adding an audit call when opening a file is
unlikely to bother anybody; a call added to, say, a poll operation
would be another story. Moore has posted an initial set of operations
that he thinks merit auditing.

Threats and grumbles

With luck, that solution will prove acceptable to everybody. The
alternative to adding audit support to io_uring is, according to
Moore, not particularly pleasant:

    If we can't find a solution here we would need to make io_uring
    and audit mutually exclusive and I don't think that is in the
    best interests of the users, and would surely create a headache
    for the distros.

"Headache" is not really the word for it. If the two features are
made exclusive, then it will not be possible to configure a kernel
containing both of them. So distributors would have to either ship
two different kernels (something they will go far out of their way to
avoid) or pick one of the two features to support. Hopefully it will
not come to that.

Meanwhile, there has been some disgruntlement expressed by developers
on both sides, but the security developers have made it especially
clear that they would have liked to see audit designed into io_uring
from the beginning. As Casey Schaufler put it:

    It would have been real handy had the audit and LSM requirements
    been designed into io_uring. The performance implications could
    have been addressed up front. Instead it all has to be retrofit.

Richard Guy Briggs also complained that "multiple aspects of security
appear to have been a complete afterthought to this feature,
necessitating it to be bolted on after the fact". The implication in
both cases is that, with adequate forethought, the difficulties being
encountered now could have been avoided.

That is arguably a fair criticism. Kernel developers working on new
features often leave security as something to be thought about later;
that is especially true for relatively niche features like auditing,
which is unlikely to be enabled on development systems. The kernel
community can be a bit unfriendly toward its security developers,
characterizing them as prioritizing security above anything else (and
above performance in particular). Such an environment seems like a
recipe for leaving security concerns by the wayside, to be fixed up
later.

It is also fair to point out, though, that io_uring has been
developed in public since early 2019. It has been heavily discussed
on the mailing lists (and in LWN), but the security community did not
see that as their cue to make suggestions on how features like
auditing could be supported. It is a rare kernel developer who can
summon the focus to implement a million-operation-per-second I/O
subsystem while simultaneously making provisions for security hooks
that won't kill performance. Perhaps the io_uring developers should
have been considering security from the beginning, but they should
also have had help from the beginning.

The kernel community has surprisingly few rules regarding the
addition of new features like io_uring. In theory, new system calls
should come with manual pages, but it's somewhat surprising when that
actually happens. In a project with a more bureaucratic process, it
would make sense to insist that new features do not go in until they
have proper support for mechanisms like LSM and auditing. That might
force earlier interactions with security developers and avoid this
kind of problem.

That is not the world that we live in, though; there is nobody with a
checklist making sure that all of the relevant boxes have been marked
before a new subsystem can be merged. So the kernel community will
have to continue to muddle along, supporting the needed features as
best it can. This is not the last time that a security mechanism will
have to be retrofitted into an existing kernel feature. It's arguably
not the best approach, but it generally gets the job done in the end.

Index entries for this article
Kernel          Auditing
Security        io_uring


[Send a free link]


-----------------------------------------
(Log in to post comments)

Auditing io_uring

Posted Jun 3, 2021 19:13 UTC (Thu) by willy (subscriber, #9762) [Link
]

FWIW, this is the kind of thing that leads people to say things like
"The kernel is too slow at FOO; I did a prototype in userspace and it
was eleventy-bajillion percent faster!"

Having worked on one or two of these projects, my first response is
"What functionality did your prototype leave out to achieve that
speedup, are real users actually ok without those features, and can
we make those features optional in the current implementation?"

Yeah, audit support is slow. But it's also mandatory in some
environments.
[Reply to this comment]
Auditing io_uring

Posted Jun 3, 2021 20:46 UTC (Thu) by JanC_ (subscriber, #34940) [
Link]

But how many would need both hyper-detailed audit support and
hyper-fast I/O?
[Reply to this comment]
Auditing io_uring

Posted Jun 3, 2021 20:50 UTC (Thu) by Cyberax ( supporter , #52523)
[Link]

A database server with important information might want fast IO and
good security/auditing support.
[Reply to this comment]
Auditing io_uring

Posted Jun 3, 2021 22:05 UTC (Thu) by andresfreund (subscriber, #
69562) [Link]

Seems unlikely that kernel level audit support for individual read/
write operations would be useful for a database server. The audit
information wouldn't contain enough context (which client, how did
they authorize, what statement), and the audit operations would often
end up far from the user actions due to the database buffer pools.
Including IO potentially happening in different processes/threads
from the user action.
[Reply to this comment]
Auditing io_uring

Posted Jun 3, 2021 22:39 UTC (Thu) by roc (subscriber, #30627) [Link]

Seems to me that people should push harder on the code-patching
approach. The objections to it amount to "might be a problem". Well,
find out if it IS a problem.
[Reply to this comment]

                  Copyright (c) 2021, Eklektix, Inc.
   Comments and public postings are copyrighted by their creators.
          Linux is a registered trademark of Linus Torvalds