0% found this document useful (0 votes)

6 views6 pages

L Organization

2020 version of 6.S081

Uploaded by

memoko8574

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views6 pages

L Organization

2020 version of 6.S081

Uploaded by

memoko8574

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 6

6.

S081 2020 L18: Operating System Organization, Microkernels

Topic:
What should a kernel do?
What should its abstractions / system calls look like?

Answers depend on the application, and on programmer taste!

There is no single best answer
This topic is more about ideas and less about specific mechanisms

The traditional approach

1) powerful abstractions, and
2) a "monolithic" kernel implementation
UNIX, Linux, xv6

The philosophy behind traditional kernels is powerful abstractions:

portable interfaces
files, not disk controller registers
address spaces, not MMU access
simple interfaces that hide complexity
all I/O via FDs and read/write, not specialized for each device &c
address spaces with transparent disk paging
abstractions help the kernel manage and share resources
process abstraction lets kernel be in charge of scheduling
file/directory abstraction lets kernel be in charge of disk layout
abstractions help the kernel enforce security
file permissions
processes with private address spaces
lots of indirection
e.g. FDs, virtual addresses, file names, PIDs
helps kernel virtualize, hide, revoke, schedule, &c

Powerful abstractions have led to big "monolithic" kernels

kernel is one big program, like xv6
easy for kernel sub-systems to cooperate -- no irritating boundaries
exec() and mmap() are part of both FS and VM system
relatively easy to add sym links, COW fork, mmap, &c
all kernel code runs with high privilege -- no internal security restrictions

What's wrong with traditional kernels?

big => complex => buggy/insecure
perhaps over-general and thus slow
how much code executes to send one byte via a UNIX pipe?
buffering, locks, sleep/wakeup, scheduler
many design decisions are baked in, can't be changed, may be awkward
maybe I want to wait for a process that's not my child
maybe I want to change another process's address space
maybe DB is better at laying out B-Tree files on disk than kernel FS
hard to create kernel "extensions" that others can use
new device drivers, file systems, &c

Microkernels -- a different approach

big idea: move most O/S functionality to user-space service processes
[diagram: h/w, kernel, services (FS disk VM TCP NIC display), apps]
kernel can be small
address spaces, threads, IPC (inter-process communication)
IPC lets threads send each other messages
1980s saw big burst of research on microkernel designs
CMU's Mach perhaps the most influential
used today in embedded systems, phone chips, car entertainment
ideas (esp user-level servers and IPC) influential e.g. Windows and MacOS

Why the interest in microkernels?

focused, elegant, clean slate
small -> more security -- less code means fewer bugs to exploit
small -> verifiable (see seL4)
small -> easier to optimize
you don't have to pay for features you don't use
small -> avoid forcing design decisions on applications
user-level -> may encourage modularity of O/S services
user-level -> easier to extend / customize / replace user-level services
user-level -> more robust -- restart individual user-level services
most bugs are in drivers, get them out of the kernel!
can run/emulate multiple O/Ses, like a VMM

Microkernel challenges
What's a minimum kernel API?
Need simple primitives on which to build exec, fork, mmap, &c
Need to build the rest of the O/S at user level
How to get good performance, despite IPC and less integration?

L4
has evolved over time, many versions and re-implementations
used commercially today, in phones and embedded controllers
representative of the micro-kernel approach
emphasis on minimality:
7 system calls (Linux has 300+, xv6 has 21)
13,000 lines of code

L4 basic abstractions
[diagram]
address space ("task")
thread
IPC

L4 system calls:
create an address space
create/destroy a thread in [another] address space
send/recv message via IPC (addresses are thread IDs)
map pages of your memory into another address space
it must agree
this happens via IPC -- one task can modify another task's page table
used to create new tasks, share memory
intercept another address space's page faults -- "pager"
kernel delivers via IPC
access device hardware (not a system call, happens directly)
handle device interrupts
kernel delivers via IPC

Note L4 kernel is missing almost everything that Linux or even xv6 has
file system, fork(), exec(), pipes, device drivers, network stack, &c
If you want these, they have to be user-level code
library or server process

how does L4 thread switching work?

current user-level thread can yield for 3 reasons:
IPC system call waits
timer interrupt
yield() system call
L4 kernel saves user thread registers,
picks a RUNNABLE thread to run,
restores user registers,
switches page table,
jumps to user space
no surprises here

how do L4 external pagers work?

every task has a pager task
1. page fault
2. kernel suspends thread
3. kernel sends fault info in IPC to pager
4. pager picks one its own pages
5. pager sends virtual page address in IPC reply to faulting thread
6. kernel intercepts IPS, maps in target, resumes target

what can you use an L4 pager for?

allocating memory -- "sigma0" allocates on fault for early tasks
copy-on-write fork
coupled with a system call that revokes access
mmap of file

problem: IPC performance

Microkernel programs do lots of IPC!
Was expensive in early systems
multiple kernel crossings, TLB misses, context switches, &c
Cost of IPC caused many to dismiss microkernels
L4 designers put huge effort into IPC performance

Here's a slow IPC design

patterned on UNIX pipes
[diagram, message queue in kernel]
send(id, msg)
append msg to queue in kernel, return
recv(&id, &data)
if msg waiting in queue, remove, return
otherwise sleep()
called "asynchronous" and "buffered"
now the usual request-response pattern (RPC) involves:
[diagram: 2nd message queue for replies]
4 system calls (user->kernel->user)
send() -> recv()
recv() <- send
each may disturb CPU's caches (TLB, data, instruction)
four message copies (two for request, two for reply)
two context switches, two general-purpose schedulings

L4's fast IPC

"Improving IPC by Kernel Design," Jochen Liedtke, 1993
* synchronous
[diagram]
send() waits for target thread's recv()
common case: target is already waiting in recv()
send() jumps into target's user space, as if returning from recv()
no real context switch, no scheduler loop
* unbuffered
no queue in kernel
since synchronous, kernel can copy directly between user buffers
* small messages in registers
kernel send() path does not disturb many of the registers
e.g., no context switch
no copying required for small messages
since send() jumps into target's user space, along with registers
* huge messages as virtual memory grants
again, no copy required, though kernel send() code must change page table
* combined call() and sendrecv() system calls
[diagram]
IPC almost always used as request-response RPC
thus wasteful to use separate send() and recv() system calls
client: call(): send a message, wait for response
server: sendrecv(): reply to one request, wait for the next one
2x reduction in user/kernel crossings
* careful layout of kernel code to minimize cache footprint
result: 20x reduction in IPC cost

How to build a full operating system on a microkernel?

Remember the idea was to move most features into user-level servers.
File system, device drivers, network stack, process control, &c
For embedded systems this can be fairly simple.
What about services for general-purpose use, e.g. workstations, web servers?
Really need compatibility for existing applications.
E.g. the system needs mimic something like UNIX.
Re-implement UNIX kernel services as lots of user-level services?
Or: run existing Linux kernel as a process on top of the microkernel.
An "O/S server".
Perhaps not elegant, but pragmatic.
Part of a path to adoption:
Users might start by just running Linux apps.
Then gradually exploit possibilites of underlying microkernel.

Which brings us to today's paper:

"The Performance of micro-Kernel-Based Systems",
by Hartig et al, 1997

basic picture
[diagram]
L4 kernel
Linux kernel server
one L4 task per Linux process
IPC for system calls

What does it mean to run a Linux kernel at user-level?

The Linux kernel is just a program!
The authors modified Linux in a number of ways,
replacing hardware access with L4 system calls or IPC.
Process creation, configuring user page tables, memory allocation,
system call handling, interrupt handling.

L4/Linux's use of threads

Each Linux process has one or more L4 threads for its user code
Linux server has just one L4 thread (plus L4 threads waiting for interrupts)
At rest it is waiting for IPCs with system calls
Linux server switches its own L4 thread among kernel threads for its processes
When e.g. file system code sleep()s waiting for disk read
Or pipe read() sleep()s waiting for someone to write the pipe
Much as xv6 switches among kernel threads.
But an L4/Linux kernel thread switch has
no relation to user process switching
Instead, L4 separately switches among runnable L4 threads that
implement the Linux processes
So Linux kernel server can be running a kernel thread for process P1,
while L4 is running process P2 on another core

Why not use L4 threads to implement Linux server's kernel threads?

Because that would cause pain without any benefit.
Would introduce parallelism inside Linux.
But Linux 2.0 did not have SMP support -- e.g. no spinlocks.
And their hardware had only one core, so could be no parallel speedup anyway.

Drawback: L4 is in charge of scheduling user threads

So L4/Linux couldn't enforce Linux's notions of priority &c

L4/Linux server maps all user memory into its address space
(really, it allocates lots of memory, then gives its own memory to user
processes)
uses this for copyin()/copyout(), to dereference user pointers from sys calls
this keeps system call IPCs small -- data address, not the data itself
Linux server also uses its memory access for fork() and exec()

Example: how does fork() work?

process P1 calls fork() (P1 is really an L4 task)
P1's libc library turns fork() into an IPC to L4/Linux server
L4/Linux asks L4 to create a new task and thread -- P2
L4/Linux allocates memory pages (as many as P1 has)
L4/Linux uses IPC to tell L4 to map pages into P2
L4/Linux copies data from P1's pages to P2's pages
L4/Linux sends special IPC to P2 with SP and PC to cause it to run
L4/Linux sends reply to P1 via IPC

L4/Linux server acts as the pager for user processes

so L4 turns process page faults into IPC to Linux server
for e.g. copy-on-write fork, lazy allocation, memory mapped files

Drawback: L4 doesn't allow direct control over page tables

so Linux server could not switch its page table to include user virt addresses
until recently Linux used this trick to gain performance (no page table switch),
and for convenience in dereferencing syscall arguments

L4/Linux server uses Linux device drivers unchanged!

since L4 allows it direct access to device registers
except interrupts arrive via L4 IPC

How to evaluate?
What are some questions that the paper might answer?
It's not really about whether microkernels are a good idea.
It's main goal is to show they have good performance.

What kind of performance do we care about?

Is IPC fast?
-> microbenchmark
Is there some other performance obstacle?
-> whole-system benchmarks

IPC microbenchmarks
Table 2
getpid() is one system call on native Linux
and two L4 system calls (IPC send, IPC recv) on L4/Linux
nice result: takes only somewhat more than 2x as long on L4/Linux
and FAR faster than Mach+LinuxServer

What do we think the impact of syscalls taking 2x as long might be?

Disaster?
Hardly noticeable?

Whole-system benchmark: AIM

AIM forks a bunch of processes
Each randomly uses the disk, allocates memory, uses pipe, computes, &c
To do a fixed amount of total work
Figure 8 x-axis shows [some function of] number of concurrent AIM processes
y-axis shows time for all processes to complete
Only the slope really matters
slope is time per unit of work, so lower is better
Native Linux is best, but L4Linux is only a little slower
Mach+Linux is noticeably less efficient
Conclusions:
2x IPC time doesn't seem to make much overall difference
L4+Linux is only somewhat slower than Linux
L4+Linux is significantly faster than Mach+Linux

These results are not by themselves an argument for using L4

But they are an argument against rejecting L4 due to performance worries

What's the current situation?

Microkernels are sometimes used for embedded computing
Microcontrollers, Apple "enclave" processor
Running custom software
Microkernels, as such, never caught on for general computing
No compelling story for why one should switch from Linux &c
Many ideas from microkernel research have been adopted into modern UNIXes
Mach spurred adoption of sophisticated virtual memory support
Virtual machines are partially a response to the O/S server idea
Loadable kernel modules are a response to need to extensibility
Client/server e.g. DNS server, window server
MacOS has microkernel-style IPC

References:
The Fiasco.OC Microkernel -- a current L4 descendent
https://l4re.org/doc/
fast IPC in L4
https://cs.nyu.edu/~mwalfish/classes/15fa/ref/liedtke93improving.pdf
later evolution of L4
https://ts.data61.csiro.au/publications/nicta_full_text/8988.pdf

Microkernels: Mach vs. L4 Analysis
No ratings yet
Microkernels: Mach vs. L4 Analysis
32 pages
L4 OS Structure II Microkernels and Exokernels
No ratings yet
L4 OS Structure II Microkernels and Exokernels
23 pages
Kernel SMP Ban Galore 2003
No ratings yet
Kernel SMP Ban Galore 2003
18 pages
Linux Architecture
No ratings yet
Linux Architecture
39 pages
Do Microkernels Suck?: Gernot Heiser
No ratings yet
Do Microkernels Suck?: Gernot Heiser
20 pages
Comp9242 Advanced Os: S2/2016 W01: Introduction To Sel4 @gernotheiser
No ratings yet
Comp9242 Advanced Os: S2/2016 W01: Introduction To Sel4 @gernotheiser
55 pages
Inside the Linux Kernel Basics
No ratings yet
Inside the Linux Kernel Basics
40 pages
Monolithic Vs Microkernel
100% (1)
Monolithic Vs Microkernel
6 pages
2017edan85l4 1
No ratings yet
2017edan85l4 1
33 pages
Linux Internals and Networking
100% (3)
Linux Internals and Networking
177 pages
Figure List
No ratings yet
Figure List
57 pages
L3 OS Structure
No ratings yet
L3 OS Structure
16 pages
OS Overview
No ratings yet
OS Overview
32 pages
6s081 Lec Osorg 2
No ratings yet
6s081 Lec Osorg 2
25 pages
Operating System Structures Guide
No ratings yet
Operating System Structures Guide
29 pages
Kernel, System Call, Duel Mode (Unit-1) OS
No ratings yet
Kernel, System Call, Duel Mode (Unit-1) OS
11 pages
OS Kernel Presentation
No ratings yet
OS Kernel Presentation
11 pages
Microkernel
No ratings yet
Microkernel
15 pages
OS Services and Structures - CH 2
No ratings yet
OS Services and Structures - CH 2
36 pages
L Os
No ratings yet
L Os
4 pages
Operating Systems Notes Unit 1: Introduction & Basics: 1. What Is An Operating System (OS) ?
No ratings yet
Operating Systems Notes Unit 1: Introduction & Basics: 1. What Is An Operating System (OS) ?
83 pages
Os Unit 1 To 5
No ratings yet
Os Unit 1 To 5
9 pages
Design Principles
No ratings yet
Design Principles
28 pages
L 03 System Calls
No ratings yet
L 03 System Calls
23 pages
Operating System Structures: Bilkent University Department of Computer Engineering CS342 Operating Systems
No ratings yet
Operating System Structures: Bilkent University Department of Computer Engineering CS342 Operating Systems
58 pages
Microkernel OS Insights
No ratings yet
Microkernel OS Insights
12 pages
Microkernels and L4: COMP9242 2006/S2 Week 1
No ratings yet
Microkernels and L4: COMP9242 2006/S2 Week 1
86 pages
OS Answers StarkFile
No ratings yet
OS Answers StarkFile
30 pages
Unit 1 OS 2023
No ratings yet
Unit 1 OS 2023
23 pages
OS Full Notes
No ratings yet
OS Full Notes
42 pages
OS - Full - Notes Full Cousre Deatils Notes
No ratings yet
OS - Full - Notes Full Cousre Deatils Notes
72 pages
The Kernel
No ratings yet
The Kernel
27 pages
OS Unit 1 Part 2
No ratings yet
OS Unit 1 Part 2
23 pages
05 OS System Structure
No ratings yet
05 OS System Structure
25 pages
Lecture 01
No ratings yet
Lecture 01
33 pages
EContent 11 2025 04 25 10 58 44 UNIT1part2pptx 2025 01 30 16 00 14
No ratings yet
EContent 11 2025 04 25 10 58 44 UNIT1part2pptx 2025 01 30 16 00 14
51 pages
Unit1 II
No ratings yet
Unit1 II
28 pages
OS Notes
No ratings yet
OS Notes
38 pages
Linux Programming Assignment
No ratings yet
Linux Programming Assignment
8 pages
Linux Kernel Programming Guide
No ratings yet
Linux Kernel Programming Guide
41 pages
OS Full Notes
No ratings yet
OS Full Notes
41 pages
Basic Environment 2
No ratings yet
Basic Environment 2
20 pages
What Is Kernel?
No ratings yet
What Is Kernel?
5 pages
Operating Systems
No ratings yet
Operating Systems
32 pages
Operating Systems Notes
No ratings yet
Operating Systems Notes
52 pages
#2 (Design, Privileges, Concepts)
No ratings yet
#2 (Design, Privileges, Concepts)
30 pages
OS Full Notes-1
No ratings yet
OS Full Notes-1
41 pages
02 - Operating System Structures
No ratings yet
02 - Operating System Structures
55 pages
Osdev Report
No ratings yet
Osdev Report
29 pages
135 LE2 Reviewer
No ratings yet
135 LE2 Reviewer
6 pages
Introduction To The Linux Kernel
No ratings yet
Introduction To The Linux Kernel
7 pages
Module II
No ratings yet
Module II
74 pages
OS CHP1 - Operating System Structure, Dual Mode Op
No ratings yet
OS CHP1 - Operating System Structure, Dual Mode Op
5 pages
Unix Architecture
No ratings yet
Unix Architecture
14 pages
Pistachio Whitepaper
No ratings yet
Pistachio Whitepaper
4 pages
4computer System Operation
No ratings yet
4computer System Operation
66 pages
6S081 Intro To C Fa21
No ratings yet
6S081 Intro To C Fa21
66 pages
L Meltdown
No ratings yet
L Meltdown
4 pages
L Lockv2
No ratings yet
L Lockv2
4 pages
L Journal
No ratings yet
L Journal
7 pages
L Net
No ratings yet
L Net
5 pages
L Fs
No ratings yet
L Fs
6 pages
Mogul 96 Usenix
No ratings yet
Mogul 96 Usenix
14 pages
Cs221 Section2 Problems
No ratings yet
Cs221 Section2 Problems
5 pages
L VMM
No ratings yet
L VMM
6 pages
cs221 - LEC 6-Slides
No ratings yet
cs221 - LEC 6-Slides
59 pages
cs221 Section1 Solutions
No ratings yet
cs221 Section1 Solutions
11 pages
Cs221 LEC 4 Slides
No ratings yet
Cs221 LEC 4 Slides
73 pages
Cs221 LEC 3 Slides
No ratings yet
Cs221 LEC 3 Slides
43 pages
Cs221 LEC 1 Slides
No ratings yet
Cs221 LEC 1 Slides
23 pages
Cs221 Final Review p2
No ratings yet
Cs221 Final Review p2
23 pages
Cs221 LEC 2 Slides
No ratings yet
Cs221 LEC 2 Slides
37 pages
Marlabs Interview PDF
No ratings yet
Marlabs Interview PDF
3 pages
OS LectureNotes
No ratings yet
OS LectureNotes
86 pages
Python Interview Questions
100% (2)
Python Interview Questions
20 pages
Java Rational Number Class
No ratings yet
Java Rational Number Class
49 pages
CS4357 - Java - Set 2
No ratings yet
CS4357 - Java - Set 2
4 pages
Unit One
No ratings yet
Unit One
13 pages
Multi - Core Architectures and Programming - Lecture Notes, Study Material and Important Questions, Answers
50% (2)
Multi - Core Architectures and Programming - Lecture Notes, Study Material and Important Questions, Answers
49 pages
Module 1 Linux OS Architecture
No ratings yet
Module 1 Linux OS Architecture
49 pages
How Does Spring Boot Implement Asynchronous Programming - This Is How Masters Do It! - by Dylan Smith - Javarevisited - Medium
No ratings yet
How Does Spring Boot Implement Asynchronous Programming - This Is How Masters Do It! - by Dylan Smith - Javarevisited - Medium
24 pages
Project 4
No ratings yet
Project 4
17 pages
OSL Lab Manual
No ratings yet
OSL Lab Manual
140 pages
D. Granularity
No ratings yet
D. Granularity
24 pages
Joseph N. Windows Programming With C and C++ 2024
No ratings yet
Joseph N. Windows Programming With C and C++ 2024
78 pages
Lecture 3, OS
No ratings yet
Lecture 3, OS
13 pages
Cap560:Operating System: Course Outcomes
No ratings yet
Cap560:Operating System: Course Outcomes
1 page
Java Multithreading
No ratings yet
Java Multithreading
14 pages
Topic Wise MCQ of Operating Systems
100% (1)
Topic Wise MCQ of Operating Systems
27 pages
Operating Systems Course Outline
No ratings yet
Operating Systems Course Outline
52 pages
MFLib.dll Guide for Developers
No ratings yet
MFLib.dll Guide for Developers
39 pages
Chapter 5
No ratings yet
Chapter 5
29 pages
Java Question Bank Cse II-i Sem
No ratings yet
Java Question Bank Cse II-i Sem
5 pages
Operating System - PCCCS503 - Study Material
No ratings yet
Operating System - PCCCS503 - Study Material
56 pages
Build Your Own IOS Kernel Debugger
No ratings yet
Build Your Own IOS Kernel Debugger
77 pages
Ch02 OS9e
No ratings yet
Ch02 OS9e
97 pages
Chapter 5: Threads: Silberschatz, Galvin and Gagne ©2013 Operating System Concepts - 9 Edition
No ratings yet
Chapter 5: Threads: Silberschatz, Galvin and Gagne ©2013 Operating System Concepts - 9 Edition
16 pages
OS Process and Thread Management
No ratings yet
OS Process and Thread Management
101 pages
Distributed Systems: Processes, Threads, and Virtualization Study Guide
No ratings yet
Distributed Systems: Processes, Threads, and Virtualization Study Guide
6 pages
Multiprocessing and Thresding in Python
No ratings yet
Multiprocessing and Thresding in Python
4 pages
Unit-3 OS KCS-401
No ratings yet
Unit-3 OS KCS-401
168 pages
Cloud Computing Exam Questions
No ratings yet
Cloud Computing Exam Questions
4 pages

L Organization

Uploaded by

L Organization

Uploaded by

6.

S081 2020 L18: Operating System Organization, Microkernels

Answers depend on the application, and on programmer taste!

The traditional approach

The philosophy behind traditional kernels is powerful abstractions:

Powerful abstractions have led to big "monolithic" kernels

What's wrong with traditional kernels?

Microkernels -- a different approach

Why the interest in microkernels?

how does L4 thread switching work?

how do L4 external pagers work?

what can you use an L4 pager for?

problem: IPC performance

Here's a slow IPC design

L4's fast IPC

How to build a full operating system on a microkernel?

Which brings us to today's paper:

What does it mean to run a Linux kernel at user-level?

L4/Linux's use of threads

Why not use L4 threads to implement Linux server's kernel threads?

Drawback: L4 is in charge of scheduling user threads

Example: how does fork() work?

L4/Linux server acts as the pager for user processes

Drawback: L4 doesn't allow direct control over page tables

L4/Linux server uses Linux device drivers unchanged!

What kind of performance do we care about?

What do we think the impact of syscalls taking 2x as long might be?

Whole-system benchmark: AIM

These results are not by themselves an argument for using L4

What's the current situation?

You might also like