03 - IPC (Inter-Process Communication)

Problem 1: Shared Memory Allocation

Why is it necessary for two processes to explicitly allocate shared memory to share a data structure such as an array?

Instructor Solution

It is necessary because, by default, modern operating systems enforce Process Isolation. Each process operates within its own Private Virtual Address Space, meaning the same memory address in Process A and Process B points to different physical locations. Without explicitly allocating Shared Memory:

Hardware Protection: The CPU’s Memory Management Unit (MMU) prevents one process from accessing another’s memory to ensure security and stability.
Independent Page Tables: Each process has its own set of page tables; therefore, changes made to an array in Process A are physically invisible to Process B.

Elaboration (by LLM)

Process isolation is a cornerstone of operating system security and stability:

The Problem Without Shared Memory:

Each process operates in its own virtual address space, isolated from all others:

Process A’s virtual address 0x1000 maps to physical address 0x4000 (via page table)
Process B’s virtual address 0x1000 maps to physical address 0x5000 (different page table)
Even though both use the same virtual address, they access different physical memory
Modifications in Process A are completely invisible to Process B

Memory Management Unit (MMU) Role:

The MMU hardware enforces this isolation:

Virtual-to-Physical Translation: Translates virtual addresses using per-process page tables
Access Control: Prevents access to physical memory not owned by the current process
Privilege Checking: Ensures user-mode processes cannot access kernel memory

Process A: write to address 0x1000
           ↓ (MMU translation)
           Actually writes to physical 0x4000

Process B: read from address 0x1000
           ↓ (different page table)
           Actually reads from physical 0x5000 (different data!)

Why Isolation is Critical:

Security: One process cannot steal another’s data
Stability: A crash in Process A doesn’t corrupt Process B’s memory
Privacy: User applications cannot access kernel data
Fairness: Processes cannot interfere with each other

Explicit Shared Memory Allocation:

To share data, processes must explicitly allocate a shared memory segment:

Create Segment: Call shmget() to allocate kernel-managed shared memory
Attach Segment: Both processes call shmat() to map it to their virtual address spaces
Same Physical Location: Now both virtual addresses point to the same physical memory

1
// Process A and B both execute this
2
int shmid = shmget(IPC_PRIVATE, 1024, IPC_CREAT | 0666);
3
char *shared_data = (char *)shmat(shmid, NULL, 0);
4

5
// Now both processes can read/write the same data
6
*shared_data = 'X';  // Visible to both!

Key Point:

Process isolation is enforced by hardware (MMU) and kernel page table management. Sharing requires explicit kernel involvement to coordinate virtual address mappings.

Problem 2: Pipe Mechanisms

What’s a pipe? What is it used for?

Instructor Solution

A pipe is a unidirectional communication channel between two related processes that acts as a temporary data buffer managed by the operating system kernel.

Mechanism: It follows a First-In, First-Out (FIFO) structure.
Buffer: If the pipe fills up, the writing process is blocked; if the pipe is empty, the reading process is blocked until data arrives.

Elaboration (by LLM)

Pipes are one of the oldest and most fundamental IPC mechanisms in Unix systems:

Basic Pipe Structure:

A pipe is essentially a kernel-managed circular buffer:

Writer Process                Pipe Buffer              Reader Process
     |                    (Circular FIFO)                   |
     v                                                      v
[Write data]  →→→→→→→→→→→→→→→→→→→→→→→→→→→→→→→→→→→  [Read data]
   fd[1]                  kernel memory                     fd[0]

The pipe has two file descriptors:

fd[0]: Read end (where data exits)
fd[1]: Write end (where data enters)

FIFO Ordering:

Data flows in strict order:

1
write(fd[1], "Hello", 5);
2
write(fd[1], "World", 5);
3
read(fd[0], buf, 5);   // Gets "Hello" (first data in)
4
read(fd[0], buf, 5);   // Gets "World" (second data in)

Blocking Behavior:

Write Blocking:

1
// Pipe has limited buffer (typically 64KB on Linux)
2
write(fd[1], large_data, 1MB);  // Kernel buffer fills up
3
                                // Write process blocked
4
// Unblocked when reader drains data

Read Blocking:

1
read(fd[0], buffer, 1024);  // Pipe is empty
2
                            // Read process blocks
3
// Unblocked when writer sends data

Related Processes Requirement:

Pipes work only between related processes (parent-child or processes created from same parent):

Pipes are created before fork()
Child inherits parent’s file descriptors
Unrelated processes cannot use pipes directly (use named pipes/FIFOs instead)

Pipe Cleanup:

When a pipe is no longer needed:

Both ends must be closed
Unused descriptors must be explicitly closed in parent/child
If write end closes, reader gets EOF
If read end closes, writer gets SIGPIPE signal

Pipe vs. Memory Performance:

Shared Memory: Direct memory access (very fast, ~nanoseconds)
Pipe:          Kernel copy (slower, ~microseconds)
               - Write side must copy to kernel buffer
               - Read side must copy from kernel buffer

Pipes trade performance for automatic synchronization and buffering.

Problem 3: Message Queues vs. Pipes

What’s a message queue? How does it differ from a pipe?

Instructor Solution

A message queue is a linked list of messages stored within the kernel. It allows processes to exchange data in the form of discrete, structured “packets” or messages rather than a continuous byte stream. It differs from a pipe in three key ways:

Structure: Pipes are byte streams with no internal boundaries; message queues handle formatted blocks of data with specific types or priorities.
Persistence: Pipes are transient and vanish when processes close them; message queues can be persistent, remaining in the kernel even after the creating process exits.
Access: Pipes are strictly First-In, First-Out (FIFO); message queues allow processes to retrieve messages out of order based on a message type or priority.

Elaboration (by LLM)

Message queues provide a more sophisticated communication mechanism than pipes:

Structural Differences:

Pipes (Byte Stream):

Writer: "Hello" "World" "Test"
         ↓ (merged into single stream)
Pipe buffer: H e l l o W o r l d T e s t
              ↓
Reader: Must know where boundaries are
        First read(5) → "Hello"
        Second read(5) → "World"
        Third read(4) → "Test"

Message Queues (Structured Messages):

Writer: {type: 1, data: "Hello"}
        {type: 2, data: "World"}
        {type: 1, data: "Test"}
         ↓ (stored as discrete messages)
Kernel Queue:
  ┌─────────────────────┐
  │ type: 1, data: "Hello" │
  ├─────────────────────┤
  │ type: 2, data: "World" │
  ├─────────────────────┤
  │ type: 1, data: "Test" │
  └─────────────────────┘
         ↓
Reader: Can extract by type
        msgrcv(..., type=1) → "Hello"
        msgrcv(..., type=1) → "Test"

Key Structural Benefit:

Message boundaries are preserved automatically. No need to encode length or delimiters.

Persistence Differences:

Pipes:

1
int fd[2];
2
pipe(fd);
3
fork();
4
// ... parent and child use the pipe
5
// When both processes exit, pipe is destroyed
6
close(fd[0]);
7
close(fd[1]);
8
// Pipe is gone from kernel

Message Queues:

1
int msqid = msgget(IPC_PRIVATE, IPC_CREAT | 0666);
2
fork();
3
// ... parent and child use message queue
4
exit(0);  // Parent exits
5
// Message queue remains in kernel!
6
// Another process can still access it
7
msgrcv(msqid, ...);  // Works, data still there

Message queues persist until explicitly removed (msgctl(..., IPC_RMID)) or system reboot.

Access Pattern Differences:

Pipes (Strict FIFO):

1
write(fd[1], message1);
2
write(fd[1], message2);
3
read(fd[0], buf);   // Always gets message1
4
read(fd[0], buf);   // Always gets message2
5
// No way to skip message1 and get message2

Message Queues (Priority/Type-Based):

1
struct Message {
2
    long mtype;  // Message type
3
    char data[256];
4
};
5

6
msg1.mtype = 1;
7
msg1.data = "Hello";
8
send(msqid, &msg1);
9

10
msg2.mtype = 2;
11
msg2.data = "Urgent";
12
send(msqid, &msg2);
13

14
// Retrieve high-priority message first
15
msgrcv(msqid, &buf, MAXSIZE, 2);  // Gets msg2 (type 2)
16
msgrcv(msqid, &buf, MAXSIZE, 1);  // Gets msg1 (type 1)

Comparison Table:

Feature	Pipes	Message Queues
Data Format	Byte stream	Structured messages
Boundaries	Must encode manually	Automatic
Access Order	Strict FIFO	By type/priority
Persistence	Transient	Persistent
Kernel ID	File descriptors	System V IPC ID
Related Procs	Required	Any process
Unread Data	Lost if reader dies	Preserved in kernel

When to Use Each:

Pipes: Simple producer-consumer, shell pipelines, related processes
Message Queues: Complex multi-message handling, persistent storage, priority-based processing

Problem 4: Connecting Two Child Processes

Assume you want to create 2 child processes, C1 and C2. You want the standard output of C1 to go to the standard input of C2. Write a very simple code to create child processes C1, C2 and tie them up using a pipe. C1 will then send the message “Hello\n” to its standard output, which C2 will read from its standard input and print it on the screen. After both children terminate, the parent prints “Parent done…\n” on the screen and terminates.

You may assume the existence of a system call pipe(int fd[2]) that creates a pipe and returns 2 descriptors. fd[0] will be the read end of the pipe, fd[1] will be the write end of the pipe. Also assume the existence of another system call dup2(int fd1, int fd2), which makes fd2 same as fd1, i.e., copies the contents of fd1 over fd2. Use fork() system call to create a child process.

Instructor Solution

1
#include <unistd.h>
2
#include <stdio.h>
3
#include <stdlib.h>
4
#include <sys/wait.h>
5

6
int main(int argc, char *argv[]) {
7
    int fd[2];
8
    pipe(fd); // From C1 to C2
9

10
    if (fork() == 0) {
11
        // C1
12
        dup2(fd[1], 1);
13
        close(fd[0]);
14
        close(fd[1]);
15
        printf("Hello\n");
16
        exit(0);
17
    } // end-if
18

19
    close(fd[1]);
20
    pid_t pid;
21

22
    if ((pid = fork()) == 0) {
23
        // C2
24
        dup2(fd[0], 0);
25
        close(fd[0]);
26
        char message[6];
27
        int len = read(0, message, 6); // Read hello
28
        write(1, message, len);
29
        exit(0);
30
    } // end-if
31

32
    close(fd[0]);
33
    waitpid(pid, NULL, 0);
34
    printf("Parent done…\n");
35
    return 0;
36
} /* end-main */

Elaboration (by LLM)

This program demonstrates the power of pipes combined with file descriptor redirection:

Understanding dup2() Function:

dup2(int oldfd, int newfd) makes file descriptor newfd refer to the same open file as oldfd:

Before dup2(fd[1], 1):
File Descriptor Table:
  0 → stdin
  1 → stdout (console)
  fd[1] → pipe write end

After dup2(fd[1], 1):
File Descriptor Table:
  0 → stdin
  1 → pipe write end  (stdout now points to pipe!)
  fd[1] → pipe write end (same as 1 now)

Process Execution Flow:

Parent Creates Infrastructure:

1. pipe(fd) creates pipe
   fd[0] = read end
   fd[1] = write end

2. First fork() for C1
   Child gets copy of all file descriptors

3. Second fork() for C2
   Gets copy of pipe file descriptors

C1 (First Child) Execution:

1
dup2(fd[1], 1);      // Redirect stdout to pipe
2
close(fd[0]);         // Don't need to read
3
close(fd[1]);         // Close duplicate descriptor
4
printf("Hello\n");   // Goes to pipe, not console!
5
exit(0);             // Process ends

After dup2(), any output to stdout (fd=1) actually goes to the pipe.

C2 (Second Child) Execution:

1
dup2(fd[0], 0);      // Redirect stdin to pipe
2
close(fd[0]);         // Close original descriptor
3
char message[6];
4
int len = read(0, message, 6);  // Read from pipe (fd=0)
5
write(1, message, len);          // Write to stdout
6
exit(0);

Now stdin (fd=0) comes from the pipe, so read(0, ...) reads from C1’s output.

Parent Cleanup:

1
close(fd[1]);    // Parent won't write
2
close(fd[0]);    // Parent won't read
3
waitpid(pid, NULL, 0);  // Wait for C2 to finish
4
printf("Parent done…\n");

Why Close Unused Descriptors?

This is critical:

Pipe Lifecycle:
  - If C1 closes fd[1] (write end), C2 knows when C1 is done
  - If C2 closes fd[0] (read end), C1 gets SIGPIPE if writing
  - If parent doesn't close fd[1], C2 never sees EOF (waits forever!)

Data Flow Timeline:

Time 0:   Parent creates pipe, forks C1 and C2

Time 1:   C1 redirects stdout → pipe
          C1: printf("Hello\n")
          → data written to pipe buffer

Time 2:   C2 redirects stdin ← pipe
          C2: read(0, message, 6)
          → blocks until data arrives
          → data is available from C1
          → reads "Hello\n"

Time 3:   C2 writes to stdout
          Console shows: "Hello"

Time 4:   C1 exits, closes fd[1]
          C2 sees EOF, exits

Time 5:   Parent's waitpid() returns
          Parent: printf("Parent done…\n")

Output Guarantee:

Hello
Parent done…

The “Hello” will always appear before “Parent done” because of the waitpid() synchronization point.

Key Lessons:

dup2() redirects I/O: Elegant way to redirect stdin/stdout
Close unused descriptors: Essential for proper pipe EOF handling
Process coordination: Parent ensures C2 finishes before continuing
FIFO behavior: Data flows in order through the pipe

Problem 5: Sending Data Between Processes

Implement a program that creates a child process and sends the message “Hello\n” to the child process over a pipe. The child must read the message coming from the pipe, print it out and then terminate. Make sure parent P waits for the child to terminate before printing “Parent done…\n” and terminating itself.

You may assume the existence of a system call pipe(int fd[2]) that creates a pipe and returns 2 descriptors. fd[0] will be the read end of the pipe, fd[1] will be the write end of the pipe. You can use waitpid(int pid) system call to wait for the termination of a process with “pid”. Use fork() system call to create a child process.

Instructor Solution

1
#include <unistd.h>
2
#include <stdio.h>
3
#include <stdlib.h>
4
#include <sys/wait.h>
5

6
int main(int argc, char *argv[]) {
7
    int fd[2];
8
    pipe(fd); // From Parent to C1
9
    write(fd[1], "Hello\n", 6);
10
    close(fd[1]);
11

12
    pid_t pid;
13
    if ((pid = fork()) == 0) {
14
        // C1
15
        char message[6];
16
        int len = read(fd[0], message, 6); // Read hello
17
        write(1, message, len); // Send hello to the screen
18
        exit(0);
19
    } // end-if
20

21
    close(fd[0]);
22
    waitpid(pid, NULL, 0);
23
    printf("Parent done…\n");
24
    return 0;
25
} /* end-main */

Elaboration (by LLM)

This program demonstrates a key communication pattern: parent writes first, then child reads.

Execution Order Differences:

Unlike Problem 4, the parent writes before creating the child:

1
pipe(fd);                    // Create pipe
2
write(fd[1], "Hello\n", 6); // Parent writes immediately
3
close(fd[1]);                // Parent closes write end
4
fork();                      // Now create child

Why Write Before Fork?

The data is already in the kernel’s pipe buffer:

After write():
Pipe buffer: [H][e][l][l][o][\n]
             ^data is here

After fork():
Child inherits fd[0] (read end)
Child can immediately read buffered data

Timing Guarantee:

Child doesn't need to wait for parent to write
(data is already there)

vs. Problem 4:
C2 must wait for C1's printf() to complete
(data arrives asynchronously)

Execution Timeline:

Time 0:  Parent creates pipe

Time 1:  Parent writes "Hello\n" to pipe buffer
         Pipe buffer now contains: "Hello\n"

Time 2:  Parent closes pipe write end (fd[1])
         This signals "no more data coming"

Time 3:  Parent creates child via fork()
         Child inherits fd[0] (pipe read end)
         Child immediately has access to buffered data

Time 4:  Parent closes fd[0] (local copy)
         Parent doesn't need to read

Time 5:  Child reads 6 bytes from pipe
         read(fd[0], message, 6) → gets "Hello\n"
         Data was already buffered, no blocking

Time 6:  Child writes to stdout
         Console: "Hello"

Time 7:  Child exits

Time 8:  Parent's waitpid() returns
         Parent: printf("Parent done…\n")

Key Differences from Problem 4:

Aspect	Problem 4 (Pipe Between Siblings)	Problem 5 (Parent→Child)
Who writes?	C1 (child process)	Parent (before fork)
Who reads?	C2 (child process)	C1 (child process)
Data buffering	Real-time (synchronous)	Buffered (parent waits?)
Timing guarantee	Child may block waiting for data	Data always available
Pipe buffer use	Continuous flow	Accumulated then read

Why Close Write End Before Fork?

This is essential:

1
close(fd[1]);  // MUST close before fork
2
fork();

Reason:

If parent doesn’t close fd[1], child inherits it
Child’s read() would block waiting for EOF
EOF only arrives when ALL write ends are closed
But parent still has fd[1] open!
Child would wait forever

With proper close:

After parent closes fd[1]:
- Only open write end references: ZERO
- Child's read() sees EOF immediately when pipe empties
- Child can complete the read() call

Memory Diagram:

Parent:
  Pipe creation:
  fd[2] = {read_end, write_end}

  After fork():
  ┌─ Parent ────────────────┐
  │ fd[0] → (closed)        │
  │ fd[1] → (closed)        │
  │ Other descriptors: 0,1,2│
  └─────────────────────────┘

  ┌─ Child ─────────────────┐
  │ fd[0] → read pipe       │
  │ fd[1] → (closed by exec)│
  │ Other descriptors: 0,1,2│
  └─────────────────────────┘

Output:

Hello
Parent done…

The order is guaranteed because parent waits for child to finish via waitpid().

Problem 6: Multi-Stage Process Communication

Consider implementing a program P that creates two child processes, C1 and C2 with the following constraints: P creates 2 children and sends “Hello\n” to C1, which receives this message and sends it over to C2, which simply prints it on the screen. To achieve this, you must create two pipes, one between P and C1 and another between C1 and C2.

You may assume the existence of a system call pipe(int fd[2]) that creates a pipe and returns 2 descriptors. fd[0] will be the read end of the pipe, fd[1] will be the write end of the pipe. Make sure parent P waits for all children to terminate before printing “Parent done…\n” and terminating itself. You can use waitpid(int pid) system call to wait for the termination of a process with “pid”. Use fork() system call to create a child process.

Instructor Solution

1
#include <unistd.h>
2
#include <stdio.h>
3
#include <stdlib.h>
4
#include <sys/wait.h>
5

6
int main(int argc, char *argv[]) {
7
    int fd1[2];
8
    pipe(fd1); // From Parent to C1
9
    write(fd1[1], "Hello\n", 6);
10
    close(fd1[1]);
11

12
    int fd2[2];
13
    pipe(fd2); // From C1 to C2
14

15
    if (fork() == 0) {
16
        // C1
17
        close(fd2[0]);
18
        char message[6];
19
        int len = read(fd1[0], message, 6); // Read hello
20
        write(fd2[1], message, len); // Send hello to C2
21
        exit(0);
22
    } // end-if
23

24
    close(fd1[0]);
25
    close(fd2[1]);
26

27
    pid_t pid;
28
    if ((pid = fork()) == 0) {
29
        // C2
30
        char message[6];
31
        int len = read(fd2[0], message, 6); // Read hello
32
        write(1, message, len);
33
        exit(0);
34
    } // end-if
35

36
    close(fd2[0]);
37
    waitpid(pid, NULL, 0);
38
    printf("Parent done…\n");
39
    return 0;
40
} /* end-main */

Elaboration (by LLM)

This is a multi-stage pipeline where data flows through two pipes: Parent → C1 → C2 → stdout.

Pipeline Architecture:

Parent                C1                  C2                 Console
  |                   |                   |                    |
  |-- Write "Hello"----|→ (fd1 pipe) →|   |                    |
  |                   |   |→ Read     |   |                    |
  |                   |   |→ Process  |   |                    |
  |                   |   |→ Write ---|→ (fd2 pipe) →|        |
  |                   |   |           |   |→ Read    |        |
  |                   |   |           |   |→ Write --|→ Hello |
  |--- Wait C2 done --|---|-----------|---|---------|        |
  |                   |   |           |   |           |        |
  |-- Done message    |   |           |   |           |        |
  |                   |   |           |   |           |        |

Resource Management: Critical Detail

Proper file descriptor cleanup is essential:

1
int fd1[2], fd2[2];
2
pipe(fd1);  // Create parent→C1 pipe
3
pipe(fd2);  // Create C1→C2 pipe
4

5
// After first fork() to create C1:
6
close(fd1[0]);  // Parent won't read from fd1
7
close(fd2[1]);  // Parent won't write to fd2
8

9
// C1 closes:
10
close(fd2[0]);  // C1 won't read from fd2 (only writes)
11

12
// After second fork() to create C2:
13
close(fd2[0]);  // Parent won't read fd2

Why These Closes Matter:

If parent keeps fd2[1] open:
- C2's read(fd2[0], ...) blocks waiting for EOF
- EOF won't arrive until parent also closes fd2[1]
- But parent won't close it (it only waits)
- DEADLOCK!

Proper closing ensures EOF signals propagate correctly

Execution Flow:

Time 0: Parent creates fd1, writes "Hello\n", closes fd1[1]
        Parent creates fd2

Time 1: Parent forks C1
        C1 inherits: fd1[0] (read from parent)
                    fd2[1] (write to C2)
        Parent closes fd1[0] (won't read)
        Parent closes fd2[1] (won't write)

Time 2: C1 reads from fd1[0]
        → Gets "Hello\n" (buffered from parent)

Time 3: C1 writes to fd2[1]
        → Data goes to C2 via pipe

Time 4: C1 exits
        Closes fd1[0], fd2[1]

Time 5: Parent forks C2
        C2 inherits fd2[0] (read from C1)
        Parent closes fd2[0] (won't read)

Time 6: C2 reads from fd2[0]
        → Gets "Hello\n" (written by C1)

Time 7: C2 writes to stdout (fd=1)
        Console shows: "Hello"

Time 8: C2 exits

Time 9: Parent's waitpid() returns
        Parent prints "Parent done…\n"

Data Flow Sequencing:

┌─ Parent ──────────────────────┐
│ Writes "Hello" to fd1[1]      │
│ Closes fd1[1]                 │
│ (now EOF will be signaled)     │
└───────────────────┬────────────┘
                    │
                    ↓ (fd1 pipe)
        ┌─ C1 ──────────────────┐
        │ Reads from fd1[0]     │
        │ (gets "Hello")        │
        │ Writes to fd2[1]      │
        │ (sends "Hello")       │
        └────────┬──────────────┘
                 │
                 ↓ (fd2 pipe)
             ┌─ C2 ──────────────────┐
             │ Reads from fd2[0]     │
             │ (gets "Hello")        │
             │ Writes to stdout      │
             │ Displays "Hello"      │
             └────────────────────────┘

File Descriptor Table Evolution:

Initially (Parent):
  fd1[0], fd1[1], fd2[0], fd2[1], + standard 0,1,2

After Parent closes:
  Only: fd1[1] (write), fd2[0] (read) inherited by children
  Actually: Parent only waits, C1 has fd1[0],fd2[1]
            C2 has fd2[0]

After C1 closes fd2[0]:
  C1 can't read fd2 (which is fine, C1 only writes)
  Ensures C1 doesn't accidentally block on reading

After C2 starts:
  C2 has fd2[0] (read)
  Parent has nothing (closed all pipe ends)

Synchronization Points:

1. Parent writes to fd1, closes write end
   → C1 can read immediately (data buffered)

2. C1 reads, writes to fd2
   → C2 can read C1's output

3. Parent waits for C2 (waitpid)
   → Ensures C2 finishes before "Parent done" prints
   → Guarantees output order

Output Order Guarantee:

Hello
Parent done…

The “Hello” always comes first because:

C2 must complete before waitpid() returns
Only after waitpid() does parent print “Parent done”

Key Learning:

Multi-stage pipelines require careful descriptor management. Each process must:

Close descriptors it won’t use
Ensure proper EOF signaling
Coordinate through parent’s synchronization

Problem 7: Executing Multiple Commands in Sequence

Write a simple code that forks two child processes “/bin/ls” and “/bin/wc” and connects the standard output of “/bin/ls” to the standard input of “/bin/wc” using a pipe. Your program then waits until both processes terminate and finally prints “My program exiting…” on the screen.

You may assume the existence of a system call pipe(int fd[2]) that creates a pipe and returns 2 descriptors. fd[0] will be the read end of the pipe, fd[1] will be the write end of the pipe. Also assume the existence of another system call dup2(int fd1, int fd2), which makes fd2 same as fd1, i.e., copies the contents of fd1 over fd2. You can use waitpid(int pid) system call to wait for the termination of a process with “pid”. Use fork() system call to create a child process.

Instructor Solution

1
#include <unistd.h>
2
#include <stdio.h>
3
#include <sys/wait.h>
4

5
/* /bin/ls | /bin/wc */
6
int main(int argc, char *argv[]) {
7
    int fd[2];
8
    pipe(fd);
9

10
    if (fork() == 0) {
11
        // fork a child and load /bin/ls
12
        dup2(fd[1], 1);
13
        close(fd[0]);
14
        close(fd[1]);
15
        execl("/bin/ls", "ls", NULL);
16
    }
17

18
    close(fd[1]);
19
    pid_t pid;
20

21
    if ((pid = fork()) == 0) {
22
        dup2(fd[0], 0);
23
        close(fd[0]);
24
        execl("/bin/wc", "wc", NULL);
25
    } // end-if
26

27
    close(fd[0]);
28
    waitpid(pid, NULL, 0);
29
    printf("My program exiting...\n");
30
    return 0;
31
} /* end-main */

Elaboration (by LLM)

This program demonstrates the classic shell pipeline implementation: ls | wc.

How Unix Shells Implement Pipelines:

When you type ls | wc, the shell actually:

1
$ ls | wc
2

3
# Shell internally does (roughly):
4
if fork() == 0:
5
    # Child 1: ls
6
    dup2(pipe_write_end, stdout)
7
    exec("/bin/ls")
8
else:
9
    # Parent: wc
10
    if fork() == 0:
11
        dup2(pipe_read_end, stdin)
12
        exec("/bin/wc")

This C program mirrors that behavior exactly.

Program Architecture:

Parent (Shell Simulator)
  |
  |→ fork() → Child 1: /bin/ls (stdout redirected to pipe)
  |→ fork() → Child 2: /bin/wc (stdin redirected from pipe)
  |→ wait for Child 2
  |→ exit

Step-by-Step Execution:

Step 1: Create Pipe

1
int fd[2];
2
pipe(fd);
3
// fd[0] = read end
4
// fd[1] = write end

Step 2: Fork Child 1 (ls)

1
if (fork() == 0) {
2
    // In child
3
    dup2(fd[1], 1);    // stdout → pipe write end
4
    close(fd[0]);      // Don't need to read from pipe
5
    close(fd[1]);      // Close original descriptor
6
    execl("/bin/ls", "ls", NULL);  // Replace with /bin/ls
7
}

After dup2() and execl():

/bin/ls runs
Its stdout (fd=1) points to the pipe
Directory listing goes into pipe buffer

Step 3: Parent Closes Write End

1
close(fd[1]);  // Parent won't write

Critical: If parent doesn’t close, Child 2’s read will block waiting for EOF.

Step 4: Fork Child 2 (wc)

1
if ((pid = fork()) == 0) {
2
    dup2(fd[0], 0);    // stdin ← pipe read end
3
    close(fd[0]);      // Close original descriptor
4
    execl("/bin/wc", "wc", NULL);  // Replace with /bin/wc
5
}

After dup2() and execl():

/bin/wc runs
Its stdin (fd=0) points to the pipe
Reads directory listing from Child 1
Counts lines, words, characters
Outputs result to stdout (console)

Step 5: Parent Waits and Exits

1
close(fd[0]);          // Parent also won't read
2
waitpid(pid, NULL, 0); // Wait for wc to finish
3
printf("My program exiting...\n");

Data Flow Timeline:

Time 0:    Parent creates pipe
           Parent forks Child 1 (ls)

Time 1:    Child 1 redirects stdout to pipe
           Parent closes write end of pipe
           Parent forks Child 2 (wc)

Time 2:    Child 1 executes /bin/ls
           /bin/ls queries directory
           /bin/ls writes to stdout (actually pipe)
           Directory entries flow into pipe buffer

Time 3:    Child 2 executes /bin/wc
           /bin/wc reads from stdin (actually pipe)
           /bin/wc counts lines, words, chars
           /bin/wc writes results to stdout (console)

Time 4:    Both children complete
           Parent's waitpid() returns
           Parent prints "My program exiting..."

Sample Output:

1
$ ./pipeline
2
     12      24     256
3
My program exiting...

The first line is from wc counting ls output. The second line is from the parent process.

Key Differences from Manual Data Passing:

Aspect	Problem 6 (Manual Messages)	Problem 7 (Real Commands)
Data Format	Fixed message sizes	Stream of data
Processing	Custom C code	Existing Unix utilities
Flexibility	Limited to program logic	Can chain any commands
Real-World	Uncommon in practice	Used constantly in shell

Buffering Behavior:

The pipe buffers data between ls and wc:

/bin/ls writes:        /bin/wc reads:
file1\n ────→ │ F │
file2\n ────→ │ I │ ──→ Counts
file3\n ────→ │ L │
               │ E │
               │ S │
               └───┘

Pipe acts as a FIFO buffer, allowing ls and wc to run at different speeds without coordination.

Real-World Usage:

This is how shell commands work:

1
$ cat huge_file.txt | grep pattern | sort | uniq | wc
2

3
# Internally:
4
fork() → /bin/cat
5
  pipe() → dup2 to /bin/grep
6
    pipe() → dup2 to /bin/sort
7
      pipe() → dup2 to /bin/uniq
8
        pipe() → dup2 to /bin/wc
9
          wc outputs to console

Each program is independent, but they’re seamlessly connected via pipes.

Critical Implementation Details:

dup2() before execl(): Must redirect before execution
Close unused ends: Essential for EOF signaling
Parent’s role: Create infrastructure, then get out of the way
Synchronization: Only parent needs to know when wc finishes

Problem 8: File Input to Command Pipeline

Assume you are asked to implement a program to count the number of characters, words and lines in a text file named “file.txt”. You know that there is a Unix system utility program called wordcount located at “/bin/wc” and you would like to make use of its services as follows: You want to create a child process that will execute “/bin/wc” and have the child receive the input data from the parent over a pipe. The parent process (your program) will read in the file “file.txt” one character at a time and send the character to “/bin/wc” over the pipe.

Implement the above-described program. Recall that by default “/bin/wc” reads its data from the standard input. So after your program creates a pipe and a child process, it must manipulate the child’s standard input descriptor to point to the read-end of the pipe using dup2(int fd1, int fd2) system call, which makes fd2 same as fd1, i.e., copies the contents of fd1 over fd2. Recall that you create a new child using fork(), load a new executable using execv(char *execFilename, char *args[]), and create a pipe using pipe(int fd[2]), which creates a pipe where fd[0] will be the read end of the pipe, fd[1] will be the write end of the pipe.

Instructor Solution

1
#include <unistd.h>
2
#include <stdio.h>
3
#include <fcntl.h> // For open()
4

5
int main(int argc, char *argv[]) {
6
    int fd[2];
7
    pipe(fd);
8

9
    if (fork() == 0) {
10
        dup2(fd[0], 0);
11
        close(fd[0]);
12
        close(fd[1]);
13
        // Child. Execl "/bin/wc < pipe"
14
        execl("/bin/wc", "wc", NULL);
15
    } // end-if
16

17
    close(fd[0]);
18
    int fd2 = open("file.txt", O_RDONLY);
19
    char ch;
20

21
    while (read(fd2, &ch, 1) > 0)
22
        write(fd[1], &ch, 1);
23

24
    close(fd2);
25
    close(fd[1]);
26
    return 0;
27
} /* end-main */

Elaboration (by LLM)

This program demonstrates feeding file contents to a command via a pipe, implementing wc < file.txt without shell redirection.

Program Architecture:

file.txt
  ↓ (read by parent)
  ↓
Parent process
  ├─ Reads file.txt byte-by-byte
  └─ Writes each byte to pipe
              ↓
              ↓ (pipe)
              ↓
          /bin/wc
            ↓
          stdin
            ↓
        Count lines, words, chars
            ↓
          stdout (console)

Key Difference from Problem 7:

Aspect	Problem 7 (ls \| wc)	Problem 8 (file → wc)
Source	Another command (ls)	A file (file.txt)
Parent	Creates infrastructure, waits	Actively feeds data
I/O Method	Pipe only	File + Pipe
Command	Implicit (ls is source)	Explicit (wc is sink)

Execution Flow:

Setup Phase:

1
int fd[2];
2
pipe(fd);  // Create pipe: fd[0]=read, fd[1]=write
3

4
if (fork() == 0) {
5
    // Child process: /bin/wc
6
    dup2(fd[0], 0);   // stdin ← pipe read end
7
    close(fd[0]);     // Close duplicate
8
    close(fd[1]);     // Close write end (child won't write)
9
    execl("/bin/wc", "wc", NULL);
10
    // At this point, /bin/wc replaces the child process
11
}

Parent Phase (Data Feeding):

1
close(fd[0]);  // Parent won't read from pipe
2
int fd2 = open("file.txt", O_RDONLY);  // Open input file
3
char ch;
4

5
while (read(fd2, &ch, 1) > 0)   // Read one byte at a time
6
    write(fd[1], &ch, 1);        // Write to pipe
7

8
close(fd2);
9
close(fd[1]);  // Close write end → signals EOF to wc

Why Read Byte-by-Byte?

While inefficient, reading one byte at a time:

Simplicity: Demonstrates the concept clearly
Works: Pipe buffers data, so performance difference is minimal
Educational: Shows that pipes work with any granularity

More Efficient Version:

1
char buffer[4096];
2
int bytes;
3
while ((bytes = read(fd2, buffer, sizeof(buffer))) > 0)
4
    write(fd[1], buffer, bytes);

But the solution provided is correct and pedagogically valuable.

Data Flow:

file.txt contents: "line1\nline2\nline3\n"

Parent reads:     'l' → writes to pipe
                  'i' → writes to pipe
                  'n' → writes to pipe
                  'e' → writes to pipe
                  '1' → writes to pipe
                  '\n' → writes to pipe
                  ... (continues)

Pipe buffer accumulates data:
  ┌──────────────────────┐
  │ line1                │
  │ line2                │  ← /bin/wc reads from here
  │ line3                │
  └──────────────────────┘

/bin/wc reads entire buffer, counts:
  3 lines
  3 words
  18 characters

Output: 3 3 18

Timing & Buffering:

Without the pipe, the sequence would be:

Parent:  write 1 byte → wc can't process partial data
Parent:  write 1 byte → wc still waiting
...
Parent:  close write end → wc sees EOF, processes buffer

Pipe allows wc to start processing once enough data arrives.

File Descriptor States:

Initial (Parent perspective):

fd 0: stdin
fd 1: stdout
fd 2: stderr
fd[0]: pipe read
fd[1]: pipe write
fd2: file.txt read

After fork():

Parent:          Child (becomes /bin/wc):
fd 0: stdin      fd 0: pipe read (redirected)
fd 1: stdout     fd 1: stdout
fd 2: stderr     fd 2: stderr
fd[0]: closed    fd[0]: closed (already dup2'd)
fd[1]: write     fd[1]: closed
fd2: file read   (fd2 not inherited in exec)

Parent Data Transfer Loop:

1
while (read(fd2, &ch, 1) > 0)  // Returns 0 at EOF
2
    write(fd[1], &ch, 1);
3

4
// After loop exits (file EOF reached):
5
close(fd2);
6
close(fd[1]);  // CRITICAL: signals EOF to wc

Why closing fd[1] is critical:

When parent closes write end
/bin/wc’s read() sees EOF
wc knows no more data is coming
wc proceeds to output results

Without the close:

/bin/wc reads from stdin
Stdin is redirected to pipe fd[0]
Parent has fd[1] still open
Even though parent closed the loop
wc doesn't see EOF because fd[1] is still open somewhere
wc blocks forever waiting for more data

Full Program Flow:

Time 0:   Main creates pipe
          Main forks child
          Child redirects stdin ← pipe
          Child executes /bin/wc

Time 1:   Main closes pipe read end
          Main opens file.txt

Time 2:   Main reads first byte from file
          Main writes byte to pipe
          /bin/wc's stdin receives data

Time 3:   Main continues reading file
          Multiple bytes accumulate in pipe buffer
          /bin/wc can now start reading and processing

Time 4:   Main finishes reading file
          Main closes file descriptor
          Main closes pipe write end → SIGNALS EOF

Time 5:   /bin/wc's read() returns 0 (EOF)
          /bin/wc completes its count
          /bin/wc outputs: "3 3 18\n"
          /bin/wc exits

Time 6:   Main returns 0
          Program ends

Expected Output:

Assuming file.txt contains 3 lines:

3 3 18

Three lines, three words, eighteen characters total.

Key Learning Points:

Parent as Data Source: Parent can feed any data to child via pipe
File to Command: Demonstrates how < redirection works internally
Byte-by-Byte Transfer: Inefficient but correct and educational
EOF Signaling: Closing write end is essential for child to know when data ends
Pipe Buffering: Allows asynchronous communication between parent and child

Real-World Equivalent:

1
$ wc < file.txt
2

3
# This C program implements this command line

Problem 9: File-Based Counter with Reformatted Output

Assume that you are asked to implement a program (Counter.c) that takes 1 argument, a filename, and counts the number of characters, words and lines in the file and prints them on the screen on separate lines. A sample output of your program is given below:

bash% Counter a.txt
Characters: 10234
Words: 1353
Lines: 286
bash%

Instead of implementing the program yourself, you decide to make use of the existing system utility /bin/wc, which counts the number of characters, words and lines fed in at its standard input and prints them out on the screen all on the same line. A sample output of /bin/wc on the same input file a.txt is given below (notice the input redirection):

bash% /bin/wc < file.txt
10234 1353 286
bash%

Implement this program as follows: Your program must create a child process to run “/bin/wc”. You must manipulate the child process’s file descriptors so that “/bin/wc” reads its input from the file passed to your program as the first argument at the command line. Your program (the parent) and the child “/bin/wc” will communicate using a pipe. That is, the child process will send out its result over the pipe to your program. Your program will then read the message from the pipe and print out the result on the screen as required.

Recall that you create a new child using fork(), start a new executable using execv(char *execFilename, char *args[]), create a pipe using pipe(int fd[2]) and copy the contents of fd1 to the contents of fd2 using dup2(int fd1, int fd2) system calls.

Instructor Solution

1
#include <unistd.h>
2
#include <stdio.h>
3
#include <fcntl.h> // For open()
4

5
int main(int argc, char *argv[]) {
6
    int fd[2];
7
    pipe(fd);
8

9
    if (fork() == 0) {
10
        dup2(fd[1], 1);
11
        close(fd[0]);
12
        close(fd[1]);
13
        int fd = open("file.txt", O_RDONLY);
14
        dup2(fd, 0);
15
        close(fd);
16
        // Child. Execl "/bin/wc < file.txt > pipe"
17
        execl("/bin/wc", "wc", NULL);
18
    } // end-if
19

20
    close(fd[1]);
21
    dup2(fd[0], 0);
22
    int numLines, numWords, numChars;
23
    scanf("%d %d %d", &numLines, &numWords, &numChars);
24
    printf("Characters: %d\n", numChars);
25
    printf("Words: %d\n", numWords);
26
    printf("Lines: %d\n", numLines);
27
    return 0;
28
} /* end-main */

Elaboration (by LLM)

This is the most complex IPC pattern yet: bidirectional communication where the parent both orchestrates the child and parses its output.

Comparison with Previous Problems:

Problem	Source	Sink	Parent Role
7	ls	wc	Passive observer
8	file	wc	Data feeder
9	file	wc	Data feeder + parser

Program Architecture:

Parent Process
  │
  ├─ Create pipe
  │
  ├─ Fork child
  │  │
  │  └─ Child:
  │     ├─ Redirect stdin ← file.txt
  │     ├─ Redirect stdout → pipe
  │     └─ Execute /bin/wc
  │        └─ wc outputs: "<lines> <words> <chars>\n"
  │
  ├─ Close write end
  │
  ├─ Redirect parent's stdin ← pipe
  │
  ├─ Parse output with scanf()
  │
  └─ Print formatted output

Key Insight: Two Redirections in Child

The child performs TWO dup2() operations:

1
dup2(fd[1], 1);          // stdout → pipe (output redirect)
2
int fd = open("file.txt", O_RDONLY);
3
dup2(fd, 0);             // stdin ← file (input redirect)

This effectively executes: /bin/wc < file.txt > pipe

Execution Steps:

Step 1: Child Setup

1
if (fork() == 0) {
2
    // Child process
3
    dup2(fd[1], 1);      // Connect stdout to pipe write end
4
    close(fd[0]);        // Won't read from pipe
5
    close(fd[1]);        // Close duplicate
6

7
    int fd = open("file.txt", O_RDONLY);
8
    dup2(fd, 0);         // Connect stdin to file
9
    close(fd);           // Close duplicate
10

11
    execl("/bin/wc", "wc", NULL);
12
}

File Descriptor Mapping (Child at exec time):

Before exec:     After exec (inside /bin/wc):
fd 0: file       fd 0: file (stdin for wc)
fd 1: pipe       fd 1: pipe (stdout for wc)
fd 2: stderr     fd 2: stderr

Step 2: Parent Waits for Data

1
close(fd[1]);         // Parent won't write
2
dup2(fd[0], 0);       // Parent's stdin ← pipe
3
int numLines, numWords, numChars;
4
scanf("%d %d %d", &numLines, &numWords, &numChars);

The parent redirects its own stdin to read from the pipe!

Why Redirect Parent’s stdin?

Alternatively:

1
// Without dup2:
2
char buffer[256];
3
read(fd[0], buffer, sizeof(buffer));
4
sscanf(buffer, "%d %d %d", &numLines, &numWords, &numChars);
5

6
// With dup2:
7
dup2(fd[0], 0);
8
scanf("%d %d %d", &numLines, &numWords, &numChars);

Both work, but dup2() is cleaner and matches the elegance of Unix redirection.

Data Flow:

file.txt:
  line1
  line2
  line3

  ↓ (child reads via stdin)

/bin/wc processes:
  Counts: 3 lines, 3 words, 18 chars
  Outputs: "   3       3      18\n"

  ↓ (written to pipe via stdout)

Pipe buffer:
  "   3       3      18\n"

  ↓ (parent reads via stdin via dup2)

Parent scanf():
  Parses: numLines=3, numWords=3, numChars=18

  ↓ (reformats)

Parent printf():
  Characters: 18
  Words: 3
  Lines: 3

Data Format Transformation:

From wc (standard format):
  3 3 18
  ^lines ^words ^chars (order: lines, words, chars)

To our format:
  Characters: 18
  Words: 3
  Lines: 3
  (order: chars, words, lines + labels)

Timing & Synchronization:

Time 0:   Parent creates pipe, forks child

Time 1:   Child redirects stdin ← file.txt
          Child redirects stdout → pipe
          Child executes /bin/wc

Time 2:   /bin/wc reads from file
          /bin/wc counts contents
          /bin/wc outputs to pipe

Time 3:   Parent closes pipe write end
          Parent redirects stdin ← pipe
          Parent calls scanf()
          scanf() blocks until data available

Time 4:   /bin/wc finishes, outputs counts
          Pipe receives: "   3       3      18\n"

Time 5:   scanf() reads from stdin (pipe)
          Parses three integers
          scanf() returns

Time 6:   Parent printf() outputs formatted results
          Console shows:
          Characters: 18
          Words: 3
          Lines: 3

Time 7:   Parent returns 0
          Program ends

Example Run:

Assuming file.txt contains:

hello world
this is test
final line

Actual wc output:

1
$ wc < file.txt
2
  3  7 40

Program output:

Characters: 40
Words: 7
Lines: 3

Why This Pattern Matters:

This demonstrates:

Bidirectional Communication: Parent both launches child and reads its output
Data Parsing: Converting raw command output to structured data
Formatting: Transforming one output format to another
File Descriptors: Multiple redirections in a single process

Real-World Use Cases:

1
# Parent might count files and display nicely:
2
$ find . -type f | wc -l
3

4
# Or parse system info:
5
$ df | awk ...
6

7
# Or monitor logs:
8
$ tail -f logfile | grep ERROR | wc -l

Key Differences from Simple Redirection:

Aspect	Shell \|	This Program
Who parses	Shell (simple)	Application (complex)
Output use	Human readable	Programmatic
Data flow	Linear	Parent ← Child ← File
Complexity	High	Moderate

Important Details:

dup2(fd[0], 0): Parent now reads from pipe via stdin
scanf(): Parses integers from parent’s stdin (the pipe)
printf(): Parent outputs formatted results
No waitpid(): Not needed; scanf() implicitly waits for child data

Actually, for robustness, a waitpid() at the end would be good practice:

1
pid_t pid = fork();  // Should save this
2
if (pid == 0) { ... }
3
// Later:
4
waitpid(pid, NULL, 0);  // Ensure child is reaped

Problem 10: Pipe and Shared Memory Output Analysis

What would be the output of the following code segments (a) and (b) when run in a Linux system. Explain briefly.

Code Segment (a):

1
main() {
2
    pid_t pid;
3
    int fds[2];
4
    int l = 6;
5
    write(1, "Main entered\n");
6
    pipe(fds);
7

8
    if ((pid = fork()) == 0) {
9
        int v = 0;
10
        read(fds[0], &v, sizeof(int));
11
        printf("v: <%d>\n", v);
12
    } else {
13
        write(fds[1], &l, sizeof(int));
14
        waitpid(pid);
15
        printf("Main done\n");
16
    } // end-else
17

18
} // end-main

Instructor Solution

Expected Output:

Main entered
v: <6>
Main done

Elaboration (by LLM)

This demonstrates inter-process communication via pipes, showing how a parent writes data to a child process.

Pipe Creation and Fork Timing:

The pipe is created before the fork, so both parent and child inherit file descriptors to the same pipe:

fds[0]: read end of pipe
fds[1]: write end of pipe

Both processes start with open references to both ends.

Process Communication Flow:

Parent writes integer 6 to pipe via fds[1]
Child waits (blocks) on read() from fds[0] until data arrives
When parent writes, child’s read() unblocks with the value 6
Parent calls waitpid() to ensure child completes before printing “Main done”

Execution Timeline:

Main process:        Child process:
  write(1, ...)
  pipe(fds)
  fork()
    |                  int v = 0
    |                  read(fds[0], ...) → BLOCKS
  write(fds[1], ...)
    |                  ← unblocks, v = 6
    |                  printf("v: <6>")
  waitpid(pid)       exits
    |
  printf("Main done")

Why This Works:

The pipe persists across the fork(), maintaining the connection
read() blocks the child until data is available
The parent writes the integer (4 bytes) to the pipe
Child receives the exact same 4 bytes in the same order
Parent synchronizes with waitpid() to maintain correct output order

Common Mistakes:

Forgetting to close unused file descriptors (both parent and child have both ends open)
Not calling waitpid() (could print “Main done” before child finishes)
Assuming pipe can communicate without blocking (pipes are synchronous)

Code Segment (b):

1
main() {
2
    char *pSm = NULL;
3
    int *pi;
4
    char *pStr;
5
    int *pg;
6

7
    // Allocate 1024 bytes shared memory
8
    pSm = GetSharedMemory(1024);
9
    memset(pSm, 0, 1024);
10
    pi = (int *)pSm;
11
    pg = pi + 1;
12
    pStr = (char *)(pg + 1);
13

14
    (*pg)++;
15

16
    if (fork() == 0) {
17
        while (*pi == 0);
18
        printf("Str: <%s>, g: %d\n", pStr, *pg);
19
        (*pg)++;
20
        strcpy(pStr, "AA");
21
        *pi = 0;
22
    } else {
23
        strcpy(pStr, "Message");
24
        (*pg)++;
25
        *pi = 1;
26
        while (*pi == 1);
27
        printf("Str: <%s>, g: %d\n", pStr, *pg);
28
    } // end-else
29
} // end-main

Instructor Solution

Expected Output:

Str: <Message>, g: 2
Str: <AA>, g: 3

Elaboration (by LLM)

This demonstrates synchronization and coordination between parent and child processes using shared memory and busy-waiting (spinlock).

Shared Memory Layout:

pSm (base):     [  pi (int)  ][  pg (int)  ][  pStr (string)  ]...
Offset:         0             8             16

Variables in shared memory:
  pi   → pointer to integer at offset 0   (synchronization flag)
  pg   → pointer to integer at offset 8   (counter, initially 1)
  pStr → pointer to string at offset 16   (message buffer)

Pointer Arithmetic:

1
pi = (int *)pSm;           // Points to offset 0
2
pg = pi + 1;               // Points to offset 8 (pi + 4 bytes = +8)
3
pStr = (char *)(pg + 1);   // Points to offset 16 (pg + 4 bytes = +12, then cast)

Execution Sequence:

Parent & Child both increment g: (*pg)++ → g = 2
- Happens before fork (parent executes)
- After fork, child sees g = 1 initially, but parent already incremented
Parent:
- Writes “Message” to shared string
- Increments g → g = 2
- Sets pi = 1 (signal to child: ready)
- Busy-waits until child sets pi = 0
- Prints: “Str: , g: 2”
Child:
- Busy-waits until pi == 1 (parent’s signal)
- Prints: “Str: , g: 2” (sees parent’s message)
- Increments g → g = 3
- Overwrites string with “AA”
- Sets pi = 0 (signal to parent: done)
Parent resumes: Wakes from busy-wait
- Prints: “Str: , g: 3” (sees child’s modifications)

Critical Synchronization Points:

1
// Child waits for parent
2
while (*pi == 0);         // Busy-wait for pi = 1
3

4
// Parent waits for child
5
while (*pi == 1);         // Busy-wait for pi = 0

These spinlocks ensure sequential execution: child doesn’t read until parent writes, and parent doesn’t proceed until child finishes.

Why Shared Memory Instead of Pipes?

Allows bidirectional communication without separate pipe structures
Permits sharing complex data structures (strings, arrays, records)
Synchronization is explicit (via flag variables), not implicit
More flexible than pipes for general-purpose IPC

Potential Issues:

Busy-waiting wastes CPU: Could use semaphores or condition variables instead
Race condition on g: Both processes modify g, but operations are atomic (integer increment)
Non-portable: GetSharedMemory() is not standard; typically uses shmget(), shmat(), or memory-mapped files

Problem 11: File Redirection with fork and execv

Implement a program that forks a child which will execute “/bin/wc” as follows:

bash% /bin/wc < file.txt > out.txt

As you can see, “/bin/wc” takes its standard input from file.txt & sends its standard output to out.txt. To implement this program, you can use the following system calls: fork() creates a child process, execv(char *execFilename, char *args[]) loads an executable on top of an existing process, dup2(int fd1, int fd2) copies the contents of fd1 to the contents of fd2, open(char *filename, int mode) opens a file, where mode can be O_RDONLY, O_WRONLY. Your program must wait until “/bin/wc” terminates and then prints the message “DONE\n” before terminating. You can use waitpid(int pid) system call to wait for the termination of a process with “pid”.

Instructor Solution

1
#include <unistd.h>
2
#include <stdio.h>
3
#include <fcntl.h> // For open()
4
#include <sys/wait.h>
5

6
/* /usr/bin/wc < file.txt > out.txt */
7
int main(int argc, char *argv[]) {
8
    pid_t pid;
9

10
    if ((pid = fork()) == 0) {
11
        int fd = open("file.txt", O_RDONLY);
12
        dup2(fd, 0);
13
        close(fd);
14
        fd = open("out.txt", O_WRONLY);
15
        dup2(fd, 1);
16
        close(fd);
17
        // /bin/wc < file.txt > out.txt
18
        execl("/bin/wc", "wc", NULL);
19
    } // end-if
20

21
    waitpid(pid, NULL, 0);
22
    printf("DONE\n");
23
    return 0;
24
} /* end-main */

Elaboration (by LLM)

This program demonstrates dual input/output redirection: simultaneously redirecting stdin from a file and stdout to a file.

Implementing Shell Redirection:

When you type wc < file.txt > out.txt, the shell:

Creates a new process for wc
Redirects stdin (fd=0) to read from file.txt
Redirects stdout (fd=1) to write to out.txt
Executes /bin/wc

This C program directly implements that behavior.

File Descriptor Redirection Sequence:

1
int fd = open("file.txt", O_RDONLY);
2
dup2(fd, 0);  // stdin ← file.txt
3
close(fd);
4

5
fd = open("out.txt", O_WRONLY);
6
dup2(fd, 1);  // stdout → out.txt
7
close(fd);
8

9
execl("/bin/wc", "wc", NULL);

File Descriptor State at Each Step:

Before open("file.txt"):
  fd 0: stdin (console)
  fd 1: stdout (console)
  fd 2: stderr (console)

After open("file.txt") [fd=3]:
  fd 0: stdin
  fd 1: stdout
  fd 2: stderr
  fd 3: file.txt (read)

After dup2(fd, 0) [fd=3 → 0]:
  fd 0: file.txt (now stdin reads from file)
  fd 1: stdout
  fd 2: stderr
  fd 3: file.txt (duplicate reference)

After close(fd) [close 3]:
  fd 0: file.txt
  fd 1: stdout
  fd 2: stderr

After open("out.txt", O_WRONLY) [fd=3]:
  fd 0: file.txt
  fd 1: stdout
  fd 2: stderr
  fd 3: out.txt (write)

After dup2(fd, 1) [fd=3 → 1]:
  fd 0: file.txt (stdin)
  fd 1: out.txt (now stdout writes to file)
  fd 2: stderr
  fd 3: out.txt (duplicate reference)

After close(fd) [close 3]:
  fd 0: file.txt
  fd 1: out.txt
  fd 2: stderr

Now when /bin/wc runs:

wc reads from stdin (fd=0) → reads from file.txt
wc writes to stdout (fd=1) → writes to out.txt
wc writes errors to stderr (fd=2) → still console

Data Flow:

file.txt
  ↓
/bin/wc (reads from stdin/fd=0)
  ↓
Processes content (counts lines, words, chars)
  ↓
Writes to stdout (fd=1)
  ↓
out.txt

Key Difference: Dual Redirection

Compare with Problem 8 (input only):

1
// Problem 8: Only input redirection
2
int fd = open("file.txt", O_RDONLY);
3
dup2(fd, 0);
4
close(fd);
5
execl("/bin/wc", "wc", NULL);
6
// Output goes to console

With Problem 11 (dual redirection):

1
// Problem 11: Input AND output redirection
2
int fd = open("file.txt", O_RDONLY);
3
dup2(fd, 0);
4
close(fd);
5
fd = open("out.txt", O_WRONLY);  // Added!
6
dup2(fd, 1);                      // Redirect stdout too
7
close(fd);
8
execl("/bin/wc", "wc", NULL);
9
// Input from file, output to file

Important: Must Reuse fd Variable

Note the code reuses fd for both files:

1
int fd = open("file.txt", O_RDONLY);
2
dup2(fd, 0);
3
close(fd);  // Close this descriptor
4

5
fd = open("out.txt", O_WRONLY);  // Reuse variable, get fd=3 again
6
dup2(fd, 1);
7
close(fd);

Why? Because the kernel recycles file descriptor numbers. After closing fd=3, the next open() returns fd=3 again.

Process Execution Timeline:

Time 0:   Parent forks child

Time 1:   Child opens file.txt
          Child redirects stdin to file.txt
          Child opens out.txt (for writing)
          Child redirects stdout to out.txt

Time 2:   Child executes /bin/wc
          /bin/wc starts running

Time 3:   /bin/wc reads from stdin (actually file.txt)
          Processes lines, words, characters
          /bin/wc writes to stdout (actually out.txt)

Time 4:   /bin/wc finishes
          Child process terminates

Time 5:   Parent's waitpid() returns
          Parent prints "DONE\n"
          Parent exits

Comparison: Manual vs. Shell Syntax

1
# Shell command
2
$ wc < file.txt > out.txt
3

4
# What happens internally (this C program does it manually)
5
fork()
6
  stdin ← file.txt
7
  stdout → out.txt
8
  exec(/bin/wc)

File Mode Consideration:

The code opens out.txt with O_WRONLY but not O_CREAT | O_TRUNC:

1
fd = open("out.txt", O_WRONLY);  // File must already exist!

Better practice:

1
fd = open("out.txt", O_WRONLY | O_CREAT | O_TRUNC, 0644);

This creates the file if it doesn’t exist and truncates it if it does.

Real-World Output:

Assuming file.txt contains:

line one
line two
line three

The out.txt will contain:

3 6 22

(3 lines, 6 words, 22 characters)

Key Lesson:

This program shows how operating systems implement shell I/O redirection:

The shell redirects file descriptors before exec()
The executed program doesn’t know about the redirection
All I/O goes through the redirected descriptors
The program remains completely unaware of file names

Problem 12: Pipeline of Two Commands (cat | wc)

Implement a program that implements the following command by forking two child processes and tying the standard output of the first child with the standard input of the second with a pipe. You program must wait until the second process terminates and then prints “ALL DONE\n” before terminating itself.

bash% /bin/cat file.txt | /bin/wc

To implement this program, you can use the following system calls: fork() creates a child process, execv(char *execFilename, char *args[]) loads an executable on top of an existing process, pipe(int fd[2]), which creates a pipe where fd[0] will be the read end of the pipe, fd[1] will be the write end of the pipe. Your program must wait until “/bin/wc” terminates and prints “DONE\n” before terminating. You can use waitpid(int pid) system call to wait for the termination of a process with “pid”.

Instructor Solution

1
#include <unistd.h>
2
#include <stdio.h>
3
#include <fcntl.h> // For open()
4
#include <sys/wait.h>
5

6
/* /bin/cat file.txt | /usr/bin/wc */
7
int main(int argc, char *argv[]) {
8
    int fd[2];
9
    pipe(fd);
10

11
    if (fork() == 0) {
12
        // fork a child and load /bin/cat
13
        dup2(fd[1], 1);
14
        close(fd[0]);
15
        close(fd[1]);
16
        execl("/bin/cat", "cat", "file.txt", NULL);
17
    }
18

19
    close(fd[1]);
20
    pid_t pid;
21

22
    if ((pid = fork()) == 0) {
23
        dup2(fd[0], 0);
24
        close(fd[0]);
25
        execl("/bin/wc", "wc", NULL);
26
    } // end-if
27

28
    close(fd[0]);
29
    waitpid(pid, NULL, 0);
30
    printf("DONE\n");
31
    return 0;
32
} /* end-main */

Elaboration (by LLM)

This is the final form of the command pipeline pattern: connecting two independent external commands via a pipe. This is exactly how Unix shells implement pipelines.

The Complete Pipeline Pattern:

Parent
  ├─ Child 1 (/bin/cat)
  │   ├─ Input: file.txt
  │   ├─ Output: pipe (fd[1])
  │   └─ Process: read and output file contents
  │
  ├─ Pipe (buffer)
  │
  └─ Child 2 (/bin/wc)
      ├─ Input: pipe (fd[0])
      ├─ Output: stdout (console)
      └─ Process: count lines, words, chars

Key Pattern: Parent Closes Write End

1
if (fork() == 0) {
2
    dup2(fd[1], 1);  // Child 1 writes to pipe
3
    close(fd[0]);    // Child 1 won't read
4
    close(fd[1]);    // Close duplicate
5
    execl("/bin/cat", "cat", "file.txt", NULL);
6
}
7

8
close(fd[1]);  // PARENT MUST CLOSE WRITE END!
9
               // Otherwise Child 2 never sees EOF

Why Parent Must Close fd[1]:

If parent doesn’t close the write end:

Child 1 (/bin/cat):
  ├─ Has write end (fd[1])
  ├─ Writes file contents
  └─ Eventually exits (closes fd[1])

Child 2 (/bin/wc):
  ├─ Reads from pipe
  ├─ Blocks waiting for more data
  ├─ Checks: "Is write end closed?"
  ├─ Sees: Parent still has fd[1] open
  ├─ Thinks: "More data coming, keep waiting..."
  └─ DEADLOCK!

With parent closing fd[1]:

Child 1 (/bin/cat):
  ├─ Has write end (fd[1])
  ├─ Writes file contents
  └─ Eventually exits (closes fd[1])

Child 2 (/bin/wc):
  ├─ Reads from pipe
  ├─ Checks: "Is write end closed?"
  ├─ Sees: All write ends are closed
  ├─ Receives EOF signal
  ├─ Stops reading
  ├─ Processes buffered data
  ├─ Outputs results
  └─ Exits normally

File Descriptor Management:

After pipe() and before any fork():

Parent: fd[0] (read), fd[1] (write)

After first fork() (Child 1 - cat):

Parent: fd[0] (read), fd[1] (write)

Child 1:
  ├─ Inherits fd[0], fd[1]
  ├─ dup2(fd[1], 1): stdout → pipe
  ├─ close(fd[0]): won't read
  ├─ close(fd[1]): close original
  ├─ Now: fd 0,1,2 available, fd 3+ unused
  └─ exec(/bin/cat)

Parent closes write end:

Parent: close(fd[1])
  Now: fd[0] (read only), fd[1] closed

After second fork() (Child 2 - wc):

Parent: fd[0] (read), fd[1] (closed)

Child 2:
  ├─ Inherits fd[0], fd[1]
  ├─ dup2(fd[0], 0): stdin ← pipe
  ├─ close(fd[0]): close original
  ├─ Now: all write ends are closed
  │   (Child 1's closed + Parent's closed)
  ├─ EOF will be generated when pipe empties
  └─ exec(/bin/wc)

Parent closes read end:

Parent: close(fd[0])
  Now: nothing to do, just wait

Data Flow Timeline:

Time 0:   Parent creates pipe
          Parent forks Child 1 (cat)

Time 1:   Child 1 redirects stdout to pipe
          Parent closes pipe write end
          Parent forks Child 2 (wc)

Time 2:   Child 1 executes /bin/cat
          /bin/cat opens file.txt
          /bin/cat reads file contents
          /bin/cat outputs to stdout (pipe)
          Data flows: file.txt → pipe buffer

Time 3:   Child 2 executes /bin/wc
          /bin/wc reads from stdin (pipe)
          /bin/wc starts counting as data arrives
          CPU-I/O overlap: cat reading while wc processing

Time 4:   Child 1 finishes reading file
          /bin/cat exits
          Write end of pipe closed
          Pipe sends EOF to Child 2

Time 5:   Child 2 sees EOF
          /bin/wc finishes counting
          /bin/wc outputs results to console
          /bin/wc exits

Time 6:   Parent's waitpid() returns
          Parent prints "DONE\n"
          Parent exits

Concurrency:

Notice that /bin/cat and /bin/wc run concurrently:

Time 0-5ms:   cat reading file, wc waiting for data
Time 5-50ms:  cat outputs lines 1-10 to pipe
              wc reads and processes lines 1-5
              cat continues outputting lines 11-20
              wc continues processing lines 6-10
Time 50ms:    cat finishes, closes write end
Time 51ms:    wc finishes processing all data, outputs result

Without pipes, the sequence would be:

Time 0-10ms:  cat reads entire file
Time 10ms:    cat finishes
Time 10-15ms: wc counts (cat is done, waiting)

Pipes enable efficient overlap of I/O and processing.

Comparison: Problem 7 vs Problem 12

Both implement ls | wc, but:

Aspect	Problem 7	Problem 12
First cmd	/bin/ls (implicit)	/bin/cat file.txt
Second cmd	/bin/wc	/bin/wc
Focus	Basic pipeline	File reading
Pattern	Shell-like	File + pipe combo

Real-World Shell Equivalent:

1
$ cat file.txt | wc
2

3
# What the shell does internally:
4
# (Exactly what this C program does)

Error Handling Not Shown:

The code omits error checking for clarity:

1
// Robust version:
2
if (fork() == 0) {
3
    if (dup2(fd[1], 1) < 0) perror("dup2");
4
    close(fd[0]);
5
    close(fd[1]);
6
    if (execl("/bin/cat", "cat", "file.txt", NULL) < 0)
7
        perror("execl");
8
}

Always check return values in production code.

Key Lesson:

This demonstrates the complete pipeline implementation:

Create pipe before forking
First child writes to pipe, executes first command
Parent closes write end (critical!)
Second child reads from pipe, executes second command
Parent closes read end, waits for second child
Both commands run concurrently, sharing data via pipe

Problem 13: Process Proxy with Bidirectional Pipes

Consider implementing a program P that creates two child processes, C1 and C2 with the following constraints: P creates 2 children C1 and C2 and acts as a proxy between the two: That is, C1 sends a message “Hello\n” to P, which receives this message and sends it over to C2, which simply prints it on the screen. To achieve this, you must create two pipes, one between P and C1 and another between P and C2.

You may assume the existence of a system call pipe(int fd[2]) that creates a pipe and returns 2 descriptors. fd[0] will be the read end of the pipe, fd[1] will be the write end of the pipe. Make sure parent P waits for all children to terminate before printing “Parent done…\n” and terminating itself. You can use waitpid(int pid) system call to wait for the termination of a process with “pid”. Use fork() system call to create a child process.

Instructor Solution

1
#include <unistd.h>
2
#include <stdio.h>
3
#include <stdlib.h>
4
#include <sys/wait.h>
5

6
int main(int argc, char *argv[]) {
7
    int fd1[2];
8
    pipe(fd1); // From C1 to Parent
9

10
    if (fork() == 0) {
11
        // C1
12
        write(fd1[1], "Hello\n", 6);
13
        exit(0);
14
    } // end-if
15

16
    int fd2[2];
17
    pipe(fd2); // From Parent to C2
18
    char message[6];
19
    int len = read(fd1[0], message, 6); // Message from C1
20
    write(fd2[1], message, len); // Message to C2
21
    close(fd1[0]);
22
    close(fd1[1]);
23
    close(fd2[1]);
24

25
    pid_t pid;
26
    if ((pid = fork()) == 0) {
27
        // C2
28
        char message[6];
29
        int len = read(fd2[0], message, 6); // Read hello
30
        write(1, message, len);
31
        close(fd2[0]);
32
        exit(0);
33
    } // end-if
34

35
    close(fd2[0]);
36
    waitpid(pid, NULL, 0);
37
    printf("Parent done…\n");
38
    return 0;
39
} /* end-main */

Elaboration (by LLM)

This program demonstrates a proxy pattern where the parent acts as a middleman between two child processes. Compare with Problem 6:

Pattern	Problem 6 (Chain)	Problem 13 (Proxy)
Data path	C1 → C2	C1 → Parent → C2
Architecture	Linear pipeline	Hub-and-spoke
Use case	Direct data flow	Parent processes data
Complexity	Simpler	More powerful

Proxy Architecture:

C1 (Producer)         Parent (Proxy)         C2 (Consumer)
     |                     |                      |
     |─── write(fd1) ──────┤                      |
     |                     |                      |
     |              read(fd1[0])                  |
     |                     |                      |
     |              [could process here]         |
     |                     |                      |
     |              write(fd2[1]) ────────────────┤
     |                     |              read(fd2)
     |                     |                      |
    EXIT                   |              [display]
                     waitpid() ←─────────────── EXIT
                           |
                      printf("done")

Execution Flow:

1
// Main creates C1
2
if (fork() == 0) {
3
    write(fd1[1], "Hello\n", 6);  // C1 sends data
4
    exit(0);                        // C1 exits
5
}
6

7
// Main creates C2 AFTER getting C1's data
8
int len = read(fd1[0], message, 6);  // Parent receives from C1
9
write(fd2[1], message, len);          // Parent forwards to C2
10

11
if ((pid = fork()) == 0) {
12
    read(fd2[0], message, 6);     // C2 receives from parent
13
    write(1, message, len);        // C2 displays
14
    exit(0);
15
}

Sequential Execution:

Unlike Problem 6 (concurrent), this is sequential:

C1 writes → Parent blocks reading
         ↓
Parent reads, then creates C2
         ↓
C2 reads → Parent blocked reading
         ↓
C2 writes output
         ↓
Parent proceeds

The read() calls act as synchronization points.

Data Flow with Synchronization:

Time 0:   Parent forks C1
          C1: write(fd1[1], "Hello\n", 6)
          Parent: blocked on read(fd1[0], ...)

Time 1:   C1 finishes write
          C1: exit(0)
          Parent: read() completes, gets "Hello\n"

Time 2:   Parent creates fd2
          Parent: write(fd2[1], message, len)
          Parent forks C2
          C2: blocked on read(fd2[0], ...)

Time 3:   Parent closes fd2[1]
          C2: read() completes, gets "Hello\n"
          C2: write(1, message, len)
          Console: "Hello"
          C2: exit(0)

Time 4:   Parent: waitpid() returns
          Parent: printf("Parent done…\n")

Pipe Closure Pattern:

1
// After reading from fd1
2
close(fd1[0]);   // Won't read fd1 again
3
close(fd1[1]);   // Won't write fd1 (never opened write end in parent)
4

5
// After writing to fd2, before second fork
6
close(fd2[1]);   // Parent won't use write end anymore
7
// But parent still has fd2[0] open (inherited by C2)
8
// C2 can read from it
9

10
// After second fork
11
close(fd2[0]);   // Parent won't read fd2

Why This Pattern?

Proxy pattern is useful when:

Parent must process/validate data before forwarding
Parent needs to log intermediate values
Parent implements business logic between I/O
Parent coordinates multiple producers/consumers

Example Enhancement:

1
// Parent could transform the data:
2
int len = read(fd1[0], message, 6);
3

4
// Process/validate/transform
5
char transformed[6];
6
transform(message, transformed, len);
7

8
// Forward modified data
9
write(fd2[1], transformed, len);

Comparison: Sequential vs. Concurrent

Problem 6 (Concurrent):

C1 and C2 run simultaneously
Parent acts as pipe connector
Both read/write independently
Potential race conditions

Problem 13 (Sequential):

C1 runs, parent waits for its data
Parent processes/forwards to C2
Parent controls timing
Easy to synchronize

File Descriptor Lifecycle:

After pipe(fd1), pipe(fd2):
  Parent: 0(stdin), 1(stdout), 2(stderr), fd1[0], fd1[1], fd2[0], fd2[1]

After first fork() → C1:
  C1 inherits all parent descriptors
  C1: write(fd1[1], ...)
  C1: exit(0) → closes all descriptors

After read(fd1[0], ...):
  Data transferred from C1
  Parent closes fd1[0] and fd1[1]

After fork() → C2:
  C2 inherits parent descriptors
  C2: read(fd2[0], ...)
  C2: write(1, ...) → outputs to console
  C2: exit(0) → closes all descriptors

Parent cleanup:
  close(fd2[0]) and close(fd2[1]) already done implicitly

Key Difference: read() Blocking

1
int len = read(fd1[0], message, 6);

This call blocks parent until:

C1 writes data to fd1[1], OR
C1 exits and all write ends are closed (EOF)

This provides natural synchronization without explicit locks.

Real-World Applications:

Request-Response: Child sends request, parent responds
Filter: Parent filters/transforms data between children
Aggregator: Parent collects from multiple sources, sends to destination
Router: Parent routes data based on content

Output:

Hello
Parent done…

Lesson:

The proxy pattern shows how pipes enable complex process coordination:

Pipes provide automatic blocking for synchronization
Parent can implement logic between data transfers
Decouples producer (C1) from consumer (C2)
Parent becomes the control point for the system

Problem 14: Two-Way Pipe Communication with External Program

Assume that you are asked to implement a program (sum.c) that computes the sum of an array of integers stored in the following array: int nums[]. Your program needs to compute the sum and prints it on the screen. Assuming that int nums[] = {1, 2, 3}, here is a sample run of your program:

bash% sum
The sum of the numbers in the array: 6
bash%

Instead of implementing the program yourself, you decide to make use of a system utility /bin/add, which computes the sum of a stream of integers fed in at its standard input and prints the sum on the screen. A sample output of /bin/add is given below, where the integers 1 2 3 4 are input by the user from the keyboard, and /bin/add prints their sum 10 on the standard output on the next line.

bash% /bin/add
1 2 3 4
10
bash%

Implement this program as follows: Create a child process to run “/bin/add” and create two pipes P1 and P2. P1 will be used by your program to send the numbers in “nums” array to /bin/add and P2 will be used by /bin/add to send your program the sum of these numbers. Make sure that you connect standard input of /bin/add to the “read end” of P1 and connect standard output of /bin/add to the “write end” of P2.

Instructor Solution

1
#include <unistd.h>
2
#include <stdio.h>
3
#include <stdlib.h>
4
#include <string.h>
5

6
int main(int argc, char *argv[]) {
7
    int fd1[2];
8
    pipe(fd1); // From Parent to child
9
    int fd2[2];
10
    pipe(fd2); // From child to parent
11

12
    if (fork() == 0) {
13
        // Child
14
        dup2(fd1[0], 0);
15
        dup2(fd2[1], 1);
16
        close(fd1[0]);
17
        close(fd1[1]);
18
        close(fd2[0]);
19
        close(fd2[1]);
20
        execl("./add", "add", NULL);
21
    } // end-if
22

23
    close(fd1[0]);
24
    close(fd2[1]);
25
    int nums[] = {1, 2, 3, 4};
26

27
    for (int i = 0; i < sizeof(nums) / sizeof(int); i++) {
28
        char buffer[20];
29
        sprintf(buffer, "%d ", nums[i]);
30
        write(fd1[1], buffer, strlen(buffer));
31
    }
32

33
    close(fd1[1]);
34
    char answer[20] = {'\0'};
35
    read(fd2[0], answer, 20);
36
    printf("The sum of the numbers in the array: %s", answer);
37
    close(fd2[0]);
38
    return 0;
39
} /* end-main */

Elaboration (by LLM)

This is the bidirectional pipe pattern: parent and child communicate in both directions. The external program (./add) remains unaware of pipes—it simply reads from stdin and writes to stdout.

Two-Pipe Architecture:

Parent                                Child (/bin/add)
  |                                       |
  fd1[1] (write) ──────pipe1──────→ stdin (0)
                                       |
                                    (process)
                                       |
  fd2[0] (read) ←──────pipe2←────── stdout (1)
  |

File Descriptor Setup in Child:

1
if (fork() == 0) {
2
    // Child inherits: fd1[0], fd1[1], fd2[0], fd2[1]
3

4
    dup2(fd1[0], 0);    // stdin ← parent's pipe1 (read end)
5
    dup2(fd2[1], 1);    // stdout → parent's pipe2 (write end)
6

7
    close(fd1[0]);      // Close originals (now duplicated as 0,1)
8
    close(fd1[1]);      // Don't need to write pipe1
9
    close(fd2[0]);      // Don't need to read pipe2
10
    close(fd2[1]);      // Close original (now duplicated as 1)
11

12
    execl("./add", "add", NULL);
13
}

File Descriptor State:

Child at exec time:

fd 0: stdin ← pipe1[0] (reads from parent)
fd 1: stdout → pipe2[1] (writes to parent)
fd 2: stderr (console)

When /bin/add reads from stdin, it reads parent’s data. When /bin/add writes to stdout, it writes to parent’s pipe.

Parent File Descriptor Management:

1
close(fd1[0]);  // Parent won't read from fd1
2
close(fd2[1]);  // Parent won't write to fd2

Parent only:

Writes to fd1[1] (send data to child)
Reads from fd2[0] (receive data from child)

Data Flow:

Parent creates arrays:
  int nums[] = {1, 2, 3, 4}

Parent sends data:
  write(fd1[1], "1 ", 2)
  write(fd1[1], "2 ", 2)
  write(fd1[1], "3 ", 2)
  write(fd1[1], "4 ", 2)
           ↓
      pipe1 buffer
           ↓
Child (/bin/add) receives:
  Reads from stdin (fd=0)
  Accumulates: "1 2 3 4 "
  Parses integers: 1, 2, 3, 4
  Computes sum: 10
  Writes to stdout (fd=1)
  Output: "10\n"
           ↓
      pipe2 buffer
           ↓
Parent receives:
  read(fd2[0], answer, 20)
  Gets: "10\n"
  Displays: "The sum of the numbers in the array: 10"

String Formatting:

Notice parent formats numbers as strings:

1
for (int i = 0; i < sizeof(nums) / sizeof(int); i++) {
2
    char buffer[20];
3
    sprintf(buffer, "%d ", nums[i]);       // Convert int to string
4
    write(fd1[1], buffer, strlen(buffer)); // Send as text
5
}

The child program (./add) expects text input (like keyboard input), so parent must convert:

1 (int) → "1 " (string) → pipe → child reads as string

Close and EOF Signaling:

1
close(fd1[1]);  // CRITICAL!
2
                // After writing all numbers, close write end
3
                // Child sees EOF on stdin
4
                // Child knows no more data coming
5
                // Child can process and output result

Without closing fd1[1]:

Child: reads from stdin
Child: gets "1 2 3 4 "
Child: waits for more data
Parent: still has fd1[1] open
Child: doesn't see EOF
DEADLOCK!

Process Execution Timeline:

Time 0:   Parent forks child

Time 1:   Child redirects stdin ← pipe1, stdout → pipe2
          Parent closes unused ends: fd1[0], fd2[1]

Time 2:   Child executes /bin/add
          /bin/add starts, reads from stdin

Time 3:   Parent writes integers to fd1[1]
          1 2 3 4

Time 4:   /bin/add reads from stdin (pipe1)
          Receives: "1 2 3 4 "
          Parses and sums: 10

Time 5:   Parent closes fd1[1]
          /bin/add sees EOF on stdin
          Knows no more input coming

Time 6:   /bin/add outputs: "10\n" to stdout (pipe2)

Time 7:   Parent reads from fd2[0]
          Gets: "10\n"

Time 8:   Parent printf()
          Output: "The sum of the numbers in the array: 10"

Time 9:   /bin/add exits
          Parent closes fd2[0]
          Parent returns 0

Contrast: Unidirectional vs. Bidirectional

Problem 8 (Parent → Child):

Parent sends file contents to child
Child receives and processes
Child outputs to console
Parent doesn't read from child

Problem 14 (Bidirectional):

Parent sends data to child
Child processes and returns result
Parent reads child's output
Parent displays result

Why Two Pipes?

One pipe is unidirectional:

Can’t send data in both directions
Problem: Child output goes to console, not parent

Two pipes enable:

Parent → Child: send input
Child → Parent: receive output
Parent can process child’s result

Real-World Applications:

1
// Calculator:
2
Parent: "2 + 3"
3
Child: "5"
4

5
// Translator:
6
Parent: "hello"
7
Child: "hola"
8

9
// Encryption:
10
Parent: plaintext
11
Child: ciphertext
12

13
// Query:
14
Parent: SQL query
15
Child: result set

Edge Cases:

Buffer overflow:

1
char answer[20] = {'\0'};
2
read(fd2[0], answer, 20);  // Could overflow if child sends >20 bytes

Better:

1
char answer[20];
2
int n = read(fd2[0], answer, sizeof(answer)-1);
3
if (n > 0) answer[n] = '\0';

Blocking reads:

If child crashes without writing:

1
read(fd2[0], answer, 20);  // Blocks forever!

Would need timeout or signal handling.

Key Lesson:

Bidirectional pipe communication enables:

Request-Response pattern: Parent asks, child answers
Service abstraction: External programs as services
Data transformation: Parent controls workflow
Decoupling: Child doesn’t know about parent

This is the foundation of many Unix tools and server architectures.

Problem 15: Reading Output from External Program

Assume that there is a system utility program “/bin/rand_nums” that prints 5 random integers on the screen. Here is a sample run of this program:

bash% /bin/rand_nums
3 7 1 6 2
bash%

You are asked to implement a program called “sum.c” that will run “/bin/rand_nums”, get the 5 numbers generated, compute their sum and print the sum on the screen. To make this possible, you will need to create a child process to run “/bin/rand_nums”, create a pipe for the child process to send the generated numbers to your program over the pipe. Make sure that you connect descriptor 1 of “/bin/rand_nums” to the “write end” of the pipe so that the numbers are sent to your program over the pipe. Also connect your descriptor 0 to the “read end” of the pipe so that you can use “scanf” to read the numbers as if they are coming from the keyboard.

Here is a sample run of your program:

bash% ./sum
The sum of the 5 random numbers is 19
bash%

Recall that you create a new child using fork(), start a new executable using execl(char *execFilename, char *arg1, ...), create a pipe using pipe(int fd[2]) and copy the contents of fd1 to the contents of fd2 using dup2(int fd1, int fd2) system calls.

Instructor Solution

1
#include <stdio.h>
2
#include <unistd.h>
3

4
int main(int argc, char *argv[]) {
5
    int fd[2];
6
    pipe(fd); // From child to parent
7

8
    if (fork() == 0) {
9
        // Child
10
        dup2(fd[1], 1);
11
        close(fd[1]);
12
        close(fd[0]);
13
        execl("./rand_nums", "rand_nums", NULL);
14
    } // end-if
15

16
    close(fd[1]);
17
    dup2(fd[0], 0);
18
    close(fd[0]);
19
    int sum = 0;
20

21
    for (int i = 0; i < 5; i++) {
22
        int num;
23
        scanf("%d", &num);
24
        sum += num;
25
    }
26

27
    printf("The sum of the 5 random numbers is %d\n", sum);
28
    return 0;
29
} /* end-main */