Project 2: Non-Preemptive Kernel

Sections	Notes
Overview Design Review Getting Started Kernel Threads and Scheduling Processes and System Calls Mutual Exclusion Timing a Context Switch Extra Credit Hints Grading Rubric Program Submission Glossary	Inline Assembly Starting GDB (untested) Design review sign up Precept 1 slides

Overview

With the bootup mechanism from the first project, we can start developing an operating system kernel. The main goal of this project is to extend the provided kernel to support multiprogramming. More specifically, you will:

Implement a non-preemptive scheduler
Perform context switches between threads and processes in Assembly
Measure the cost of switching contexts
Implement basic mutual exclusion

Although the kernel will be very primitive, you will be dealing with several core mechanisms and applying techniques that are important for building more advanced, multiprogrammed operating systems in the future.

The kernel of this project and future projects supports both processes and threads. Note that the processes in this project are different from those in traditional operating systems (like Unix). Our processes have their own protection domains, but they share a flat address space. In the future, we will be providing processes with separate address spaces. Our threads are kernel level threads that always share the same address space with the kernel; they will never run in user mode. The kernel threads are linked together with the kernel, subsequently they can access kernel data and functions directly. On the other hand, processes use the single entry-point mechanism described in the lecture.

Students have traditionally found this project to be significantly more complex than the the first, so please begin now and work at a steady pace.

Scott Erickson (scottme@cs.princeton.edu) is the TA in charge of this project.

Design Review

Please cover these points at your design review:

Process Control Block List what will be in your PCB
Scheduling Assuming that the threads below are put on the ready queue in order of their names and execution starts at thread 1, explain how execution unfolds and why
Kernel threads Describe how you will use a thread's PCB to prepare the system for running a thread, and how you will save a thread's state when it gets put on the blocked queue
Processes Same as above, but also discuss what extra work you may need to do for a process
Mutual exclusion (lock) Talk about how you will implement mutual exclusion

Thread 1

lock_init(&lock);
lock_acquire(&lock);
do_yield();
lock_release(&lock);
do_exit();

Thread 2

while(1)
{
    do_yield();
}

Thread 3

do_yield();
lock_acquire(&lock);
lock_release(&lock);
do_exit();

Thread 4

lock_acquire(&lock);
lock_release(&lock);
do_exit();

Use the signup sheet to signup for a design review time.

Getting Started

Use the start code which you can get from Blackboard or the fishbowl file server. Compiling this project is tricky, so we have provided the Makefile. If you type make (or use the all target) it will compile all sources and prepare an image, floppy.img, for use with bochs. We've also supplied a configuration file, bochsrc, that instructs bochs to use this image as a floppy. A make target, boot, is provided in the Makefile to load the image onto your USB stick if you'd like to test on a real machine, but make sure it is cat'ing to the correct path before using.

Kernel Threads and Scheduling

The kernel proper consists of a collection of kernel threads. These threads run in the most privileged protection ring, and their code is part of the kernel loaded by the boot block. Kernel threads share a flat address space, but they do not share register values or stacks.

After the boot block has loaded the kernel, it calls the function kernel.c:_start(). This function performs the setup necessary to run the tasks listed in tasks.c by initializing process control blocks (PCBs) and allocating a kernel stack for each task, plus a user stack for processes. You are encouraged to allocate space for the PCBs statically with pcb_t pcbs[NUM_TASKS]. Make sure to loop on NUM_TASKS when setting up PCBs as the number of tasks may be different during testing.

Note that in this project, processes need twice as much context information as threads. It's OK to use the same PCB structure for both processes and threads, and just waste space on threads by letting the user context information go unused. Stacks are STACK_SIZE bytes and are allocated contiguously at increasing addresses starting with STACK_MIN. Use the macro common.h:ASSERT([assertion]) to enforce the invariant that all stacks lie in the interval STACK_MIN exclusive to STACK_MAX inclusive.

Since you are writing a non-preemptive scheduler, kernel threads run without interruption until they yield or exit by calling scheduler.c:do_yield() or scheduler.c:do_exit(). These functions in turn call entry.S:scheduler_entry(), which saves the contents of the general purpose and flags registers in the PCB of the task that was just running. scheduler.c:scheduler() is then called to choose the next task, restore the next task's registers, and return to it. scheduler.c:scheduler() chooses the next task to run in a strict round-robin fashion, and in the first round, the tasks are run in the order in which they are specified in tasks.c.

scheduler.c:scheduler() increments the global variable scheduler.c:scheduler_count every time it runs. Do not change this.

Processes and System Calls

In future projects, processes will have their own protected address spaces and run in a less privileged protection ring than the kernel. For now, however, there are only slight differences between threads and processes. Unlike threads, processes are allocated two stacks: one user stack, for the process proper, and one kernel stack, for system calls made by the process. Moreover, process code is not linked with the kernel, although for this project it is part of the image loaded by the boot block.

There are two system calls: syslib.S:yield() and syslib.S:exit(), which processes call. These correspond in principle to the calls do_yield() and do_exit() that the threads use. Processes cannot invoke do_yield() and do_exit() directly though because they don't know the address of these functions, and in later projects they won't have the requisite privileges to modify the PCBs.

This project uses a dispatch mechanism to overcome the difficulty of processes not knowing the addresses of do_yield() and do_exit(). kernel.h defines a spot in memory called ENTRY_POINT at which _start() writes the address of the function label kernel_entry (in entry.S) at run-time. When a process calls yield() or exit() (in syslib.S), SYSCALL(i) (in the same file) gets called with an argument that represents which system call the caller wants (yield or exit), similar to how a number was stored in %ah before initiating a BIOS interrupt. SYSCALL(i) in turn calls kernel_entry, whose address was saved at ENTRY_POINT. kernel_entry saves the process's context to its PCB and calls kernel_entry_helper(), which then calls the appropriate function that was "unreachable" from the process before: do_yield() or do_exit(). When kernel_entry is returned to when the process gets run again, it has got to restore the process's context.

Mutual Exclusion

lock.c contains three functions for use by kernel threads: lock_init(l), lock_acquire(l), and lock_release(l). lock_init(l) initializes lock l; initially no thread holds the specified lock. In the event an uninitialized lock structure is passed to the other functions, the results are up to you. lock_acquire(l) must not block the thread that called it when no other thread is holding is holding l.

Threads call lock_acquire(l) to acquire lock l. If no thread currently holds l, it marks the lock as being held and returns. Otherwise it uses scheduler.c:block() to indicate to the scheduler that the thread is now waiting for the lock and should be put on the blocked queue. If there are one or more threads on the blocked queue when lock_release(l) is called by the thread holding the lock, the lock remains locked and the thread at the head of the blocked queue is put on the ready queue at the back, through a regular enqueue operation. If no threads are waiting, it marks the lock as being not held and returns. The results of a thread attempting to acquire a lock it already holds or to release a lock it doesn't hold are up to you.

We have put a non-conforming implementation of mutual exclusion in lock.c so that you can write and test the kernel threads part in isolation. When you are ready to test your own implementation, change the constant SPIN in lock.c to FALSE.

Note: your lecture notes describe how to synchronize access to the fields associated with a lock. Since there is no preemption, if lock_acquire(l) and block() don't call do_yield() in the middle of manipulating data structures, you don't need to worry about race conditions while these functions do their job. In particular, it is not necessary to protect the scheduler data structures with a test-and-set lock.

Timing a Context Switch

The function util.c:get_timer() returns the number of instructions that have been executed since boot. Using util.c:print_int() and util.c:print_str(), use the first line of the display to show the number of cycles required by a context switch between threads and between processes. We've left space in th3.c and process3.c for your code.

Please turn in your measurement code and have it run when I compile and boot your OS. We don't need or want you to state the results in the README.

Extra Credit

There is 1 point of extra credit available for this assignment. To get it, you must keep track of how much time each thread and/or process has run via get_timer(). When schedule() is called, the thread/process with the least amount of time on the CPU so far gets scheduled.

Hints

(Borrowed from previous years. Hints will be updated as questions come out from office hours / design reviews.)

The boot loader provided to you for this project does more than the one you wrote last project. Specifically, it enables address line 20 and enters protected mode.
One of the first things you will need to do is to implement a queue data structure. gdb is a much nicer environment than bochs in which to debug, so we strongly recommend you that you test your data structure code in user-space first.
We've placed examples of how to call C from assembly and vice versa in hello1.c and hello2.S. Build the executable by typing make hello.

The dependencies of the parts of this project look like this:

                   Processes --> Timing a process context switch
Kernel threads --> Mutual exclusion
                   Timing a thread context switch

We have tried to specify the scheduler in such a way that at any point in the execution, there is exactly one correct choice for which task to schedule next. If you have even the slightest doubt about what this choice is in some situation, please contact us for clarification--some of our test code will assume that your scheduler is doing what we think it should.
Saving and restoring the general purpose registers (eax, ecx, edx, ebx, esp, ebp, esi, edi) is reasonably straightforward, but the flags register (eflags) is tricky. The only way to save eflags is the instruction pushf, which pushes the contents of eflags onto the stack--eflags is not a valid operand for mov. Similarly, the only way to restore eflags is popf, which pops the top of the stack into eflags. Worse, you must take care not to alter any register before it is saved or after it is restored, and many instructions set and clear flags. Instructions that don't alter eflags include mov, pop, push, pushf, and ret, while arithmetic operations typically do. Consult the x86 manuals for more information.
To allocate space for the PCBs, declare an array of PCB structures, for example,
static pcb_t pcbs[NUM_TASKS];
and then link them together as necessary.
If you have an address as a uint32_t and want to write to it from C, you have to cast it to a pointer:
*(uint32_t *)address_as_an_integer = value;
The popf and pushf instructions do not take any operands. They always pop/push eflags from/onto the stack (i.e., the current value of esp, which may not correspond to any of the stacks you allocate to tasks).
Adding print_int and print_str to your program can help you debug.
Start from the simplest set of user programs:
- First try only 1 task (changing the definition of tasks in task.c).
- If your program works for this task, try another one.
- If your program works for every individual task, try combination of many tasks.
- Even for 1 task, you can comment out some codes which contain functions you're not ready to test. (For example, to test whether your program can start a user process, you can comment the calls to yield())

Grading Rubric

The project is worth 15 points plus one for extra credit. The breakdown is as follows:

Description	Points
Design review	5
OS can start a single user program	1
Yield() can store and restore stack correctly	1
Without locks, scheduler obeys FIFO order	1
Yield can store and restore stack, general registers and flags correctly	1
OS can unfold execution for the 4 threads in design review correctly	1
Your lock can guarantee mutual exclusion and FIFO order for locks (that is, the thread acquire()ing the lock earlier gets it earlier	2
th3.c and process3.c can time context switches	1
OS can successfully load and run all tasks provided in the template code on bochs	2

For 1 extra credit point, implement the scheduler so that it achieves fairness. Show me that it’s working in your readme and include test cases if possible. In other words, "it looks like it's working" is not enough.

Submission

Don't forget to complete the general requirements for every project!

Submit via Dropbox. Only one person per group should submit. When submitting the assignment, please turn in a zipped folder containing the starter code with your modifications, along with your readme. We should be able to run your Makefile to compile everything.

Glossary

address line 20: When PCs boot up, addresses 0x100000 to 0x1fffff are aliases for 0x00000 to 0xfffff to accommodate badly written real mode programs. Programs that use more than one megabyte of memory must disable this behavior. See http://www.win.tue.nl/~aeb/linux/kbd/A20.html.
kernel mode: Synonym for protection ring 0.
process control block: The process control block (PCB) contains all of the state the kernel needs to keep track of a process or kernel thread.
protected mode: In (16-bit) real mode, an address ds:si refers to byte 16*ds + si. In 32-bit protected mode, the segment registers contain offsets into the global descriptor table, whose entries control the translation of memory accesses into physical addresses. The boot block provided to you sets up the global descriptor table and segment registers so that no translation is applied; you can ignore the segment registers for this project. See http://www.internals.com/articles/protmode/introduction.htm.
protection ring: Intel processors since the 286 have four privilege levels, ring 0 through ring 3. Ring 0 is the most privileged; ring 3 is the least. Typically the kernel runs in ring 0, and user processes run in ring 3. For this project, all processes run in ring 0.
system call: In modern operating systems, processes are enjoined from accessing memory that they do not own. System calls are how processes communicate with the kernel. Typically, the process writes the system call number and its arguments in its own memory, and then it transfers control to the kernel with a special kind of far call. The kernel then performs the desired action and returns control to the process.
task: Following Linux, we use "task" to mean process or kernel thread.