Tuesday, November 27, 2007
Chapter 6 - Lecture
Ready queue is FIFO: process served in order of arrival at the queue
Shortest-Job-First (SJF) Scheduling
CPU is assigned to the process with the smallest next CPU burst, use FCFS to break ties
SJF is optimal: give the minimum average waiting time for a given set of processes.
Two schemes:
Nonpreemptive: once the CPU is given to a process, the process cannot be preempted until it finishes its CPU burst.
Preemptive: if a new process arrives with next CPU burst shorter than the remaining time of the current executing process, preempt the current executing process.
Also called Shortest-Remaining-Time-First (SRTF).
Priority Scheduling
A priority number (integer) is associated with each process
The CPU is allocated to the process with the highest priority (smallest integer ≡ highest priority). FCFS used to break ties
Preemptive: preempt the CPU if the priority of the newly arrived process is higher than the priority of the currently running process
Nonpreemptive: allow the currently running process to complete its CPU burst
SJF is a priority scheduling where priority number is the predicted next CPU burst time.
Problem: Starvation – low-priority processes may never execute.
Solution: Aging –gradually increase the priority of the processes as time progresses.
Round Robin (RR) Scheduling
Designed for time-sharing systems
The algorithm:
Each process gets a small unit of CPU time (time quantum or time slice), usually 10-100 milliseconds.
After the time quantum has elapsed, the process is preempted and added to the end of the ready queue (an FIFO queue).
Essentially FCFS scheduling with preemption
If there are n processes in the ready queue and the time quantum is
q, then response time is no more than (n-1)q
Smaller q smaller response time, but more context switches
Performance of RR depends on size of time quantum
q large: same as FCFS
q small: each of the n processes has the impression that it has its own
processor running at 1/n the speed of the real processor
q must be large with respect to context switch time, otherwise overhead is too high.
Multilevel Queue Scheduling
Ready queue is partitioned into separate queues; each process is permanently assigned to one queue based on some property of the process (e.g., process type)
E.g., separate queues for foreground (interactive) processes and background (batch) processes
Each queue has its own scheduling algorithm
E.g., foreground – RR, background – FCFS
Scheduling must be done between the queues.
Fixed priority preemptive scheduling
E.g., foreground queue has absolute priority over background queue
Possibility of starvation.
Time slice between the queues: each queue gets a certain amount of CPU time
E.g., 80% CPU time to foreground in RR, 20% to background in FCFS
Chapter 5 - Lecture
Overview
Multithreading Models
Threading Issues
Linux Threads
Pthreads
Threads Overview
A process can contain multiple threads of control
A thread (or lightweight process) contains:
A thread ID
A program counter
A register set
A stack
Threads belonging to the same process share:
Code section
Data section
Open files
A process with multiple threads of control can do multiple tasks at a time
Single and Multithreaded Processes
Benefits of Threads
Increase responsiveness to user
Thread creation faster than process creation
Thread context switch faster than process context switch
Utilization of multiprocessor architectures: threads belonging to the same process may run in parallel on different processors
User Threads
Supported by a user-level thread library
Library handles thread creation, scheduling, and management with no support from kernel
Advantage: user-level threads are fast to create and manage
Drawback: a user thread performing a blocking system call will cause entire process to block
Example user thread libraries: POSIX Pthreads, Mach Cthreads, Solaris 2 UI-threads
Kernel Threads
Supported by the operating system
Kernel handles thread creation, scheduling, and management
Advantages:
If a thread performs a blocking system call, the kernel can schedule another thread in the application for execution
Kernel can schedule threads on different processors in a multiprocessor system
Drawback: slower to create and manage than user threads
Most modern operating systems support kernel threads
Multithreading Models
Many systems support both user level and kernel level threads
How to map user level threads to kernel level threads?
Many-to-One
One-to-One
Many-to-Many
Many-to-One
Many user-level threads mapped to one kernel thread.
Allow the developer to create as many user threads as she wishes
Entire process will block if a thread makes a blocking system call
Multiple threads can’t run in parallel on multiprocessors
Example: green threads (a user-level thread library) in Solaris 2
Many-to-One Model
One-to-One
Each user-level thread mapped to a kernel thread
Allow another thread to run when a thread blocks
Allow multiple threads to run in parallel on multiprocessors
Most systems limit the number of threads supported due to overhead of creating kernel level threads
Examples: Windows NT/2000, OS/2
One-to-one Model
Many-to-Many Model
Many user level threads mapped to a smaller or equal number of kernel threads.
Avoid limitations of both many-to-one and one-to-one models
Developers can create as many user threads as necessary,
Kernel threads corresponding to the user threads can run in parallel on multiprocessors
Kernel can schedule another thread for execution if a thread blocks
Examples: Solaris 2, Windows NT/2000 with the fiber library
Many-to-Many Model
Threading Issues: fork and exec
What if one thread in a program calls fork?
Some UNIX systems have two versions of fork:
The new process duplicates all threads
The new process duplicates only the thread that invoked fork
What if one thread in a program calls exec?
The new program replaces the entire process (including all threads)
If exec is called immediately after fork, duplicating all threads is unnecessary
Threading Issues: Cancellation
Thread cancellation: killing a thread before it has completed
Asynchronous cancellation: the target thread is immediately cancelled
Can cause problems if the target thread is in the middle of allocating resources, performing I/O, or updating shared data
Deferred cancellation: the target thread periodically checks if it should terminate
Allow cancellation at safe points
Pthreads refer to safe points as cancellation points
Threading Issues: Signal Handling
A signal is used to notify a process about the occurrence of an event
Synchronous signal: sent from a process to itself
E.g., division by 0, illegal memory access
Asynchronous signal: sent from external source
E.g., timer expire, ctrl-c
A signal may be handled by
Default signal handler:
Every signal has one
Run by kernel
User-defined signal handler: override the default signal handler
Threading Issues: Signal Handling
If a process has several threads, where to deliver a signal?
To the thread to which the signal applies (e.g. division by 0)
To all the threads in the process (e.g. ctrl-c)
To certain threads in the process, i.e., those that are not blocking the signal
Typically a signal is delivered only to the first thread that is not blocking the signal
Assign a thread to receive and handle all signals for the process
Threading Issues: Thread Pools
Motivating example: a web server creates a new thread to service each request
Two concerns:
Overhead to create thread; thread discarded after it has completed its work
No limit on the number of threads created, may exhaust system resources
One solution: thread pools
Create a pool of threads at process startup
Request comes in: wake up a thread from pool, assign the request to it. If no thread available, server waits until one is free
Service completed: thread returns to pool
Benefits:
Faster to wake up an existing thread than creating a new thread
Pool limits the number of threads in system
Linux Threads
clone system call used to create kernel level threads
Allow child to share resources with parent (e.g. memory space, open files, signal handlers)
Parameter ‘flags’ indicates the extent of sharing
How to allow sharing?
Kernel data structure for a process created by clone contains pointers to the data structures of the parent
process
Pthreads
The POSIX standard (IEEE 1003.1c) API for thread creation and synchronization.
The standard specifies interface only, implementation is up to developer.
Common in UNIX operating systems.
On Linux machines
#include
Link with the pthread library
g++ threads.cc -lpthread
Pthreads: Creating
int pthread_create(pthread_t * thread, pthread_attr_t * attr,
void * (* start_routine) (void *), void * arg);
On success
A new thread is created with attributes “attr” (NULL=default), id of new thread is placed in “thread”.
New thread starts executing function “start_routine” with parameter “arg”
New thread executes concurrently with the calling thread
New thread runs until it returns from “start_routine” or it calls “pthread_exit”
Return 0
On failure
Return non-zero error code
pthread_create Example
void* my_thread (void *arg)
{
char *msg = (char *) arg;
cout << “Thread says “ << msg <<“\n”;
}
int main()
{
pthread_t t;
char *msg = “Hello World”;
pthread_create(&t, NULL, my_thread, msg);
return 0;
}
Pthreads: Waiting
int pthread_join(pthread_t th, void **thread_return); //wait for
a thread to terminate
th: the thread to wait for
thread_return: NULL or address of buffer to store return value of th
When a thread terminates, its memory resources (thread ID and stack) are not deallocated until another thread performs pthread_join on it.
At most one thread can wait for the termination of a given thread.
Return 0 on success, non-zero on failure
Chapter 4 - Lecture
How do we select a CPU-scheduling algorithm?
1. Define desired criteria
E.g., maximize CPU utilization with the constraint that max response time < 1 second
2. Evaluate algorithms under consideration
Four algorithm evaluation methods:
1. Deterministic modeling
2. Queuing models
3. Simulations
4. Implementation
Deterministic Modeling
Take a predetermined workload and determine the performance of each algorithm for that workload
Simple and fast
Results only valid for specific workloadnot useful
Queuing Models
The computer system contains a number of servers, each has a queue of waiting processes.
E.g., CPU with ready queue, I/O device with device queue
Queuing- network analysis: given distribution of process arrival and distribution of service time, we can compute utilization, average queue length, average waiting time, etc.
Require inaccurate/unrealistic assumptions for numerical solutionsresults may not be accurate
E.g. assume process arrival follows Poisson distribution, service time follows exponential distribution, processes are independent
Simulations
Write a simulator to simulate the activities in the system
Gather statistics of the algorithm performance as the simulation executes
Two ways to generate the data to drive the simulation:
1. Generate workload according to certain probability distribution
2. Use the sequence of actual events in the real system (recorded on a trace tape)
Advantage: results are accurate
Drawback:
Non-trivial to design, code, and debug a simulator
Simulations are time consuming
Implementation
Implement the scheduling algorithm in a real system
Cost is tremendous: coding algorithm, modifying kernel data structures, system must be taken down
Ideally, we want flexible scheduling algorithms
During OS build-time, boot time, or run time, variables used by the scheduler can be changed by system manager or by user
Scheduling in Practice
Solaris 2
Windows 2000
Linux
Solaris 2 Scheduling
Priority based, preemptive scheduling
Four scheduling classes, each includes a set of priorities:
Time sharing: default class for a process
Use multilevel feedback queue with time slicing in each queue
The higher the priority, the smaller the time slice
Interactive processes have higher prioritygood response time
CPU-bound processes have lower prioritygood throughput
Interactive:
Same scheduling policy as time sharing class
Give a priority boost to the task in the active window
System: for system threads
Threads have fixed priority
No time-slicing: thread runs until it blocks or is preempted by a higher
priority thread
Real-time: for processes that require immediate system access
Threads have fixed priority
Time slicing among threads with the same priority.
Windows 2000 Scheduling
Priority-based, preemptive scheduling
32 priority levels, large integer = high priority
One queue for each priority, RR scheduling in each queue
Six priority classes: real-time, high, above normal, normal, below normal, idle
Seven levels of relative priority per class
An integer priority is assigned to a thread based on the (class, relative priority) pair
For processes not in the real-time priority class, relative priority may be raised or lowered
Lowered when a thread’s time quantum runs outlimit CPU consumption of CPU-bound threads
Increased when a thread is released from a wait operation
give good response time to interactive threads
When a process moves into the foreground (i.e., currently selected on screen), increase time quantum by some factor (typically by 3)
Windows 2000 Priorities
Linux Scheduling
Two scheduling classes: time-sharing and real-time
Real-time class has priority over time-sharing class
Only processes running in user mode may be preempted
Time-sharing scheduling
Each process has a priority and a certain number of credits
Scheduler chooses the process with the most credits
Running process loses 1 credit when a timer interrupt occurs
Running process is suspended when its credits becomes 0
If all processes in ready queue have no credit, perform a recrediting operation
For every process, credits = credits/2 + priority (large integer =high priority)
The crediting system automatically gives high priority to interactive or I/O bound processes
Linux Scheduling
Real-time scheduling
Two scheduling classes
FCFS (no preemption)
RR (preemption when time-slice expires)
Each process has a priority and belongs to one scheduling class
Scheduler always runs the process with the highest priority.
Among processes of equal priority, use FCFS or RR
Process Creation
A process (parent) may create new processes (children) during execution
Resource sharing
Child obtains resources from OS
Parent and child share resources (such as memory or files)
Execution
Parent and children execute concurrently.
Parent waits until some or all children have terminated
Address space
Child is a duplicate of parent.
Child has a program loaded into it.
Process Creation in UNIX
int fork() : create a child process identical to parent
Child process has a copy of the address space of the parent process
On success:
Both parent and child continue execution at the instruction after fork
For parent: return code is the process ID (PID) of child
For child: return code is 0
On failure:
Child not created
Return code is –1 for parent
Waiting for Process Termination in UNIX
int wait(int *status) //wait for any child to terminate
int waitpid(int pid, int *status, int options) //wait for the specified child to terminate
If status is not NULL, status information of child is stored in the location pointed to by status.
Options: 0 - wait until child has exited
WNOHANG - return immediately if no child has exited
Return value: the process ID of the child which exited, or zero if
WNOHANG was used and no child was available, or -1 on error
If a child terminates before parent calls wait:
When parent calls wait, it returns immediately and child’s PCB is deallocated.
Loading Programs in UNIX
int execl(const char *path, const char *arg, ...);
int execlp(const char *file, const char *arg, ...);
int execv(const char *path, char *const argv[]);
int execvp(const char *file, char *const argv[]);
Used after a fork to replace a process’ memory space with a new program.
Does not return on success, return –1 on error
Second parameter:
l - list of arguments
v – array of arguments
Both list and array must be terminated by a NULL pointer
First parameter:
empty – full path of executable
p – name of executable, will search the search path for executable
Process Termination in UNIX
A process terminates when it makes the exit system call.
May return data to parent, parent can collect the data using wait.
Resources deallocated by operating system.
If a parent terminates without waiting for its children:
The init process becomes the children’s parent
The init process will wait for the children to terminate
Cooperating Processes
Independent process - cannot affect or be affected by
the execution of another process
Do not share data with any other process
Cooperating process - can affect or be affected by the
execution of another process
Share data with other processes
Concurrent execution of cooperating processes requires
mechanisms for process communication and
synchronization
Producer-Consumer Problem
A common paradigm for cooperating processes
A producer process produces information that is consumed by a consumer process.
Have a buffer of items filled by the producer and emptied by the consumer
unbounded-buffer - no limit on the size of the buffer
bounded-buffer - assume a fixed buffer size
Producer and consumer run concurrently and must be synchronized
Bounded-Buffer Shared-Memory Solution
Shared data
#define BUFFER_SIZE 10
typedef struct {
. . .
} item;
item buffer[BUFFER_SIZE];
int in = 0; //next free position in buffer
int out = 0; //first full position in buffer
Shared buffer implemented as a circular array
Buffer empty when in == out
Buffer full when (in+1) % BUFFER_SIZE == out
Allow at most BUFFER_SIZE-1 items in buffer
Producer Process
item nextProduced;
while (1) {
while (((in + 1) % BUFFER_SIZE) == out)
; /* do nothing */
buffer[in] = nextProduced;
in = (in + 1) % BUFFER_SIZE;
}
Consumer Process
item nextConsumed;
while (1) {
while (in == out)
; /* do nothing */
nextConsumed = buffer[out];
out = (out + 1) % BUFFER_SIZE;
}
Interprocess Communication (IPC)
Allow processes to communicate and to synchronize their actions without sharing the same address space
Message-Passing System:
IPC facility provides two operations:
send(message)
receive(message)
If P and Q wish to communicate, they need to:
Establish a communication link between them
Exchange messages via send/receive
Implementation of communication link
UNIX pipes, sockets
Direct Communication
Processes must name each other explicitly:
send (P, message) – send a message to process P
receive(Q, message) – receive a message from process Q
Properties of communication link
A link is established automatically between a pair of processes that want to communicate.
A link is associated with exactly two processes.
Between each pair there exists exactly one link.
The link may be unidirectional, but is usually bi-directional.
Indirect Communication
Messages are sent to and received from mailboxes (or ports).
Processes can communicate only if they share a mailbox.
Primitives:
send(A, message) – send a message to mailbox A
receive(A, message) – receive a message from mailbox A
Properties of communication link
A link is established between a pair of processes only if they
have a shared mailbox
A link may be associated with more than two processes.
Multiple links may exist between a pair of processes, each
link corresponds to one mailbox.
A link may be unidirectional or bi-directional.
Synchronization
Message passing may be either blocking or nonblocking (also knows as synchronous and
asynchronous)
Blocking send: sending process is blocked until the message is received by the receiving process or by the mailbox
Nonblocking send: sending process continues execution after sending a message
Blocking receive: receiver blocks until a message is available
Nonblocking receive: receiver gets (immediately) either a valid message or a null.
Buffering
Messages exchanged by communicating processes reside in a queue implemented in one of three ways.
1. Zero capacity: queue has maximum length 0
Link cannot have a message waiting
Sender must block until recipient receives message
2. Bounded capacity: queue has finite length n
A link can have at most n messages waiting.
Sender blocks if queue is full.
3. Unbounded capacity: queue has infinite length
No bound on number of waiting messages
Sender never blocks.
Friday, November 16, 2007
Assignment # 2 (11-17-07)
2. Whatare the five major activities of an operating system in regards to file management?
3. What is the purpose of commmand interpreter?
4. What is the puspose of the system Call?
5. What is the main advantage of layered pproach to system design?
Reference: Operating System Concepts 5th edition by Silberschatz/Galvan
Deadline: Wednesday Nov 23, 2007
Yellow Paper
Chapter 3
System Components
System Calls
System Programs
System Structure
Common System Components
Process Management
Main-Memory Management
File Management
I/O-System Management
Secondary-Storage Management
Networking
Protection System
Command-Interpreter System
Process Management
A process is a program in execution.
A process needs resources - CPU time, memory, files,and I/O devices
The operating system is responsible for
Process creation and deletion.
Process suspension and resumption.
Provision of mechanisms for:
process synchronization
process communication
deadlock handling
Ch4 - Ch8
Main-Memory Management
A program must be loaded into memory to be executed
Keep multiple processes in memory
Improve CPU utilization
Improve response speed to users
The operating system is responsible for:
Keeping track of who is using what parts of memory
Deciding which processes to load when memory space becomes available.
Allocating and de-allocating memory space as needed.
Ch9, Ch10
File Management
A file is a collection of related information defined by its creator.
OS maps files onto physical media and accesses files via the devices that control the physical media
OS is responsible for:
File creation and deletion.
Directory creation and deletion.
Supporting primitives for manipulating files and directories.
Mapping files onto secondary storage.
Backing up files on nonvolatile storage media
File protection: who can access a file in what ways
Ch11, Ch12
I/O-System Management
Manage and control I/O operations and I/O devices
The I/O subsystem consists of:
A general device-driver interface
Drivers for specific hardware devices
Buffering, caching, and spooling
Ch13 (skip)
Secondary-Storage Management
The operating system is responsible for:
Free space management
Storage allocation
Disk scheduling
Ch14 (skip)
Networking
Processors in a distributed system are connected through a communication network.
Communication among processors takes place using a set of protocols implemented by OS.
Ch15 - Ch17
Protection System
Protection refers to a mechanism for controlling access by processes or users to system resources.
Ch18
Command-Interpreter System
Command interpreter (or shell): get the next command statement and execute it
Graphical shell: Microsoft Windows and Mac OS
Command-line shell: UNIX
Can be part of kernel or just an application program
System Calls
System calls provide the interface between a process and the operating system.
Usually available as assembly-language instructions.
Some languages allow system calls to be made directly (e.g., C, C++)
Three methods to pass parameters to OS:
Pass parameters in registers.
Store the parameters in a table in memory, the table address is passed as a parameter in a register.
Parameters are pushed onto the stack by the program, and popped off the stack by operating system
Categories of System Calls
Process control: load & execute, create, end, abort, get/set process attributes, wait, etc.
File management: open, close, create, delete, read, write, reposition, get/set file attributes, etc.
Device management: request, release, read, write, reposition, get/set file attributes, etc.
UNIX: devices handled as virtual files, identified by special file names
Information maintenance: get/set time or date, get/set system data, get/set process/file/device attributes, etc.
Communications: open/close connection, send/receive message, etc.
Communication Models
Message-passing model: processes exchange info though an interprocess-communication facility provided by OS
Open connection before communication
Communication done by exchanging messages
Close connection after communication
Easier to implement for intercomputer communication
Shared-memory model: processes exchange info by reading/writing data in shared areas in memory
Fast and convenient within a computer
Must ensure processes do not write to the same location simultaneously
System Programs
System programs provide a convenient environment for program development and execution.
Categories of system programs:
File management: cp, ls, rm
Status information: df, date
File modification: vi, pico, emacs
Programming language support: compilers, assemblers,interpreters
Program loading and execution: linkers, loaders, debuggers
Communications: web browsers, email, remote login
System programs define the user interface
Shell
A system program that gets and executes the next user specified command
Two ways to implement commands
Shell contains the code to execute the command
Size of shell determined by number of commands
Difficult to add a new command
Commands are system programs
+ Shell can be small
+ Can easily add commands
- Slower
- Need pass parameters from shell to the system program
- Interpretation of parameters may be inconsistent
Unix: some commands are built in, the rest are system programs
System Structure
Simple structure
MS-DOS, Unix
Layered approach
Windows NT, OS/2
Microkernels
Mach, MacOS X server
UNIX System Structure
OS consists of two parts:
System programs
The kernel
Consists of everything below the system-call interface and above the physical hardware
Provides the file system, CPU scheduling, memory management, I/O, etc.
A large number of functions in one leveldifficult to enhance
Layered Approach
The operating system is divided into a number of layers
The bottom layer (layer 0) is the hardware
The highest (layer N) is the user interface
Each layer uses functions and services of only lower-level layers.
Benefits:
Simplify debugging and system verification: can debug one layer at a time
Can change the implementation of a layer without affecting other layers, provided that the interface and functions of the layer do not change
Problems:
Design of layers can be tricky
Less efficient: each layer may add overhead to a system call
Microkernel System Structure
Remove all non-essential components from kernel, implementing them as services run in user space
Usually, microkernel provides:
Minimal process and memory management
Communication facility
Communication between client program and various services provided by message passing
System calls become messages sent via kernel to appropriate service
Benefits:
Adding services doesn’t require kernel modification
Kernel codifications simpler when required
More secure: less code is running in kernel mode
More reliable: kernel OK when a service fails
Examples: Mach, MacOS X server
Assignment # 1 (11-17-07)
a. Batch
b. Interactive
c. Time Sharing
d. Real Time
e. Distributed
2. Whatare the 3 main purpose of the Operating System
3. Describe the 3 advantage of symmetric and asymetric multiprocessing
4. Why are destributed system desirable
Reference: Operating System Concepts 5th edition by Silberschatz/Galvan
Deadline: Wednesday Nov 21, 2007
Yellow Paper
Chapter 2
Computer System Operation
I/O Structure
Storage Structure
Storage Hierarchy
Hardware Protection
Computer-System Architecture
CPU and device controllers can execute concurrently, competing for memory access.
Interrupts
Modern operating systems are interrupt driven.
An interrupt can be from hardware (i.e., completion of I/O) or software
A software-generated interrupt (also called trap) is caused either by an error or a system call
For each type of interrupt, OS provides an interrupt service routine
When CPU is interrupted:
Complete current instruction
Transfer execution to interrupt service routine (ISR)
When ISR completes, resume interrupted computation
Interrupt Handling
How to locate an ISR?
Use interrupt vector - an array of pointers to ISRs
Interrupt vector indexed by device number
Interrupt vector stored at fixed location, usually in low memory
How to resume?
Save return address
If ISR needs to use registers, save old values in registers
After completing, ISR restores register values and load return address into program counter
I/O Interrupts
A device controller
is in charge of a specific type of device
maintain some local buffer and a set of special-purpose registers
is responsible for moving data between device and buffer
To start an I/O operation:
CPU loads registers in device controller
Device controller examines registers and takes appropriate action
When device controller completes I/O, triggers interrupt
Two I/O Methods
Synchronous I/O: kernel returns control to user program only upon I/O completion.
Asynchronous I/O: kernel returns control to user program without waiting for I/O completion
I/O Structure
How does kernel wait for I/O completion?
Use wait instruction or wait loop
If kernel always waits for I/O completion, at most one I/O request is outstanding at a time
Advantage: when an interrupt occurs, OS knows which device is interrupting
Disadvantage: can’t overlap I/O with I/O, I/O with computation
A better approach: kernel continues processing after starting I/O
If there are user programs ready to run, execute one
If nothing else to do: use wait instruction or wait loop
Can have multiple I/O requests outstanding
use device status table to keep track
If device busy, type of request and other parameters stored in table entry for device
Each device has a wait queue
I/O Structure (cont’d)
When an I/O device interrupts:
Operating system indexes into device-status table to determine device status and modify table entry
If the interrupt signals I/O completion, kernel processes next request if wait queue not empty
Direct Memory Access (DMA) Structure
Used for high-speed I/O devices
Device driver sets up source/destination addresses and transfer length in registers of DMA controller
DMA controller transfers a block of data between I/O device and memory without CPU intervention.
DMA controller generates an interrupt when entire block is transferred.
Storage Structure
Registers
Main memory
Magnetic disks
Magnetic tapes
Registers and Main Memory
Registers and main memory are the only storage CPU can access directly
Instruction in execution and data used by instructions must be in main memory
Registers
Built into CPU
Accessible by CPU within one clock cycle
Main memory
Array of words, each word has own address
Memory speeds much slower than CPU speeds: memory access may take many CPU cycles to complete
Remedy: add fast memory (cache) between CPU and main memory
Secondary Storage
Not possible to have programs and date reside in main memory permanently because:
Main memory is too small
Main memory is volatile, loses contents without power
Secondary storage – extension of main memory that provides large nonvolatile storage capacity.
Magnetic Disks
Magnetic disk – most common secondary-storage device
Two surfaces of a disk platter covered with magnetic recording material
Surface of a platter logically divided into circular tracks, which are subdivided into sectors.
Magnetic disk speed
Positioning time/random access time (several ms): seek time + rotational latency
Seek time: time to move the disk arm to the desired cylinder
Rotational latency: time for the desired sector to rotate to the disk head
Transfer rate (megabytes per sec): rate data flow between the disk drive and the computer
Magnetic Tapes
Larger capacity than magnetic disks
Random access much slower than magnetic disks, transfer rate similar to magnetic disks
Mainly used for backup
Storage Hierarchy
Storage systems organized in hierarchy.
Different storage systems have different speed, cost, size, and volatility
Caching – improve performance where a large access-time or transfer-rate disparity exists between two components
Memory caching: add cache (faster and smaller memory) between CPU and main memory
When need some data, check if it’s in cache
If yes, use the data from cache
If not, use data from main memory and put a copy in cache
Disk caching: main memory can be viewed as a cache for disks.
Cache management: selection of cache size and replacement policy
Must ensure data that is simultaneously stored in more than one level to be consistent.
Migration of A From Disk to Register
• An integer A is located in a file residing on magnetic disk
• Want increment A by 1
Cache Coherency and Consistency
Cache coherency in multiprocessor systems
Each CPU has a local cache
A copy of X may exist in several cachesmust make sure that an update of X in one cache is immediately reflected in all other caches where X resides
Hardware problem
Cache consistency in distributed systems
A master copy of the file resides at the server machine
Copies of the same file scattered in caches of different client machines
Must keep the cached copies consistent with the master file
OS problem
Hardware Protection
In multiprogramming systems, must protect OS and all other programs and their data from any erroneous program
Programming errors are detected by hardware and handled by OS
Hardware protection mechanisms:
Dual-Mode Operation
I/O Protection
Memory Protection
CPU Protection
Dual-Mode Operation
Add a mode bit to computer hardware to indicate two modes of operations.
User mode (1) – execution done on behalf of a user.
Monitor mode (0) – execution done on behalf of operating system.
User programs are executed in user mode
When an interrupt or trap occurs, hardware switches to monitor
mode.
Monitor
Dual-Mode Operation
Certain machine instructions that can harm system are designated as privileged instructions
Hardware allows privileged instructions to be executed only in monitor mode
If a user program executes a privileged instruction, hardware traps to OS
User can ask OS to execute privileged instructions by executing a system call
Dual-Mode Operation
When a system call is executed:
Control passes to a service routine, mode bit set to 0
Determine if request is OK
Execute request
Return control to the instruction following the system call
I/O Protection
Protect system from illegal I/O
All I/O instructions are privileged instructions.
I/O must be performed via system calls
Memory Protection
Protect OS memory and memory of user programs
Add two registers that determine the range of legal addresses a program may access:
Base register – holds the smallest legal memory address.
Limit register – contains the size of the range
CPU hardware compares every address generated in user mode with the registers, trap to OS if address is illegal
When executing in monitor mode, OS has unrestricted access to both monitor and user’s memory.
Special privileged instructions used to load base and limit registers
CPU Protection
Prevent a user program from never returning control to the OS
Timer – interrupts computer after a specified period to ensure operating system maintains control.
OS sets a counter
Counter decremented every clock tick.
When counter reaches 0, an interrupt occurs.
OS sets timer before turning over control to the user
Load-timer is a privileged instruction.
CPU Protection
Timer commonly used to implement time sharing:
Each job gets a time slice of N ms
Kernel sets timer for N ms, starts a job
At end of time slice, kernel switches to next job (i.e., sets timer for N ms and starts job)
Monday, November 12, 2007
Syllabus
Course Title: Operating system
Pre-Requisite: Cos 1, Cos 2
Credit: 3 Units
Course Description
Instructional Methodologies
1. Lectures
2. Exercises
3. Quizzes
4. Case Study
5. Research
6. Group Project
7. Baby Thesis
8. Term Exam
IV. Grading System
Attendance 10%
Seatwork/Assignments 20% (Case Study/Baby Thesis)
Quizzes 40%
Term Exams 30%
Grade=((Total x 2)+Term Exam) /3
Final Grade=Prelim Grade+Midterm Grade + Final Grade----------------------PRELIM-----------------------
I. Introduction
a. What is an operating system
b. Batch system
c. Time-sharing system
d. Personal Computer system
e. Parallel system
f. Real-time system
g. Distributed system
II. Computer System Structure
a. Computer system operation
b. I/O Structure
c. Storage structure
d. Storage Hierarchy
e. Hardware protection
f. General system architecture
III. OS Structure
a. System components
b. Operating system services
c. System calls
d. System programs
e. System structure
f. Virtual machines
g. System design and implementation
h. System generation
IV. Process
a. Process control
b. Process scheduling
c. Operation on process
d. Cooperating process
e. Interprocess communication
V. Thread
a. Overview
b. Benefits
c. User and kernel
d. Multithreading models
e. Solaris 2 threads
VI. CPU Scheduling
a. Basic concepts
b. Scheduling criteria
c. Scheduling algorithms
d. Multiple processor scheduling
e. Real-time scheduling
f. Thread scheduling
g. Algorithms
----------------------MIDTERM-----------------------
VII. Process Synchronization
a. Background
b. Critical-section problem
c. Two-task solution
d. Synchronization problems
e. Semaphores
f. Classical Synchronization problems
g. Monitors
h. OS Synchronization
VIII. Deadlocks
a. System model
b. Deadlock for handling
c. Methods for handling
d. Deadlock prevention
e. Deadlock avoidance
f. Deadlock detection
g. Recovery from deadlock
IX. Memory Management
a. Background
b. Swapping
c. Contigous memory allocation
d. Paging
e. Segmentation
f. Segmentation with paging
X. Virtual memory
a. Background
b. Demand paging
c. Page replacement
d. Allocation of frames
e. Thrashing
f. Operating system examples
g. Other considerations
XI. File Structure
a. File concepts
b. Access methods
c. Directory structure
d. Protection
e. File-system structure
f. Allocation methods
g. Free-space management
h. Directory implementation
i. Efficiency and performance
j. Recovery
XII. I/O system
a. Overview
b. I/O Hardware
c. Application to interface
d. Kernel I/O Subsystem
e. I/O request handling
f. Performance
XIII. Mass Storage
a. Disk structure
i. Disk Scheduling
ii. Disk Management
iii. Swap-management
iv. Disk Reliability
v. Stable-storage implementation
vi. Tertiary Structure
----------------------FINALS-----------------------
XIV. Network Structures
a. Background
b. Network types
c. Communication
d. Comm. Process
e. Robustness
f. Design issues
g. Network examples
XV. Distributed Communication
a. Sockets
b. Remote procedure Calls
c. Remote Method invocation
d. COBRA
e. Object registration
XVI. Distributed Coordination
a. Event Ordering
b. Mutaul exclusion
c. Deadlosk handling
d. Election algorithms
XVII. Distributed File System
a. Background
b. Naming and transparency
c. Remote access
d. State full vs. stales services
e. File Replication
f. Example of System NFS
XVIII. Protection
a. Goal protection
b. Dominion of protection
c. Access matrix
d. Implementation of acces matrix
e. Revocation of access rights
f. Lanaguage based protection
XIX. Security
a. The security probkem
b. Authentication
c. Program threats
d. System threat
e. Threat modeling
f. Encryption
g. Computer security classification
h. An example security model windows NT
XX. The Linux System
a. History
b. Design
c. Kernel modules
d. Process mgnt.
e. Scheduling
f. Mem. Mgnt.
g. Files system
h. I/O
i. Interprocess com.
j. Net. Struct.
k. Security
XXI. Windows 2000
a. History
b. Design prin.
c. Syste. Comp.
d. Environment subsystem
e. Networking
f. Programmer interface
XXII. Apple Macintosh OS
a. History
b. Design Principles
c. Kernel Modules
d. Process Mgnt.
e. Scheduling
f. Memory mgnt.
g. File Systems
h. I/O
i. Interprocess comm..
j. Network structures
k. security
Chapter 1-Introduction
Mainframe Systems
Desktop Systems
Multiprocessor Systems
Distributed Systems
System View of Operating System
Resource allocator: decide how to allocate resources to specific programs and users efficiently and fairly
Control program: mange the execution of user programs to prevent errors and improper use of the computer
What Makes Up an OS?
Common definition: the one program running at all times (usually called kernel).
System programs are sometimes considered part of OS
Provide a convenient environment for program development and execution
E.g., text editors, compilers, assemblers, linkers, debuggers
System Goals
Convenience for the user (PCs)
Efficient operation of the computer system (large multiuser systems)
maximize resource utilization
fair use of resources
Mainframe Systems
Simple batch systems
Multiprogrammed batch systems
Time-sharing systems
Simple Batch Systems
User prepare a job and submit it to a computer operator
User get output some time later
No interaction between the user and the computer system
Operator batches together jobs with similar needs to speedup processing
Task of OS: automatically transfers control from one job to another.
OS always resident in memory
Disadvantages of one job at a time:
CPU idle during I/O
I/O devices idle when CPU busy
Multiprogrammed Batch Systems
Keep more than one job in memory simultaneously
When a job performs I/O, OS switches to another job
Increase CPU utilization
All jobs enter the system kept in the job pool on a disk, scheduler brings jobs from pool into memory
Time-Sharing Systems
Like multiprogrammed batch, except CPU switches between jobs occur so frequently that the users can interact with a running program
User give instructions to the OS or a program, and wait for immediate results
Require low response time
Allow many users to share the computer simultaneously, users have the impression that they have their own machine
CPU is multiplexed among several jobs that are kept in memory and on disk
Desktop Systems
Personal computers – computer system dedicated to a single user.
CPU utilization not a prime concern, want maximize user convenience and responsiveness.
Can adopt technologies developed for mainframe operating systems: virtual memory, file systems, multiprogramming
File protection needed due to interconnections of computers
Operating systems for PCs: Windows, Mac OS, Linux
Multiprocessor Systems
Also known as parallel systems or tightly coupled systems
More than one processor in close communication, sharing computer bus, clock, memory, and usually peripheral devices
Communication usually takes place through the shared memory.
Advantages
Increased throughput: speed-up ratio with N processors <>
Economy of scale: cheaper than multiple single-processor systems
Increased reliability: graceful degradation, fault tolerant
Multiprocessor Systems
1. Symmetric multiprocessing (SMP)
Each processor runs an identical copy of the operating system
All processors are peers: any processor can work on any task
OS can distribute load evenly over the processors.
Most modern operating systems support SMP
2. Asymmetric multiprocessing
Master-slave relationship: a master processor controls the system, assigns works to other processors
Each processor is assigned a specific task
Don't have the flexibility to assign processes to the leastloaded CPU
More common in extremely large systems
Distributed Systems
Loosely coupled system – each processor has its own local memory and clock; processors communicate with one another through a network.
Need protocol to communicate: TCP/IP is the most common and best supported protocol
Three types of networks (based on distances between nodes)
1. Local area network (LAN): exists within a room/floor/building/campus
2. Wide area network (WAN): span a country or continent
3. Metropolitan area network (MAN): span a city
Transmission media: copper wires, optical fiber, radio
Types of Distributed Systems
1. Client-server systems
Client sends requests to server
Server sends reply to client
Two types of servers
2. File servers: allow clients to create, read, update, and delete files
Compute servers: execute actions requests by clients and send back results to clients
Peer-to-peer systems: all machines are equals, serve both as clients and servers
File-sharing networks: Gnutella, eDonkey 2000