Configure multiple IPs on single interface (eth0)

05 - Application Layer

Problem 1: Short Answer Questions

(a) The available bandwidth on transcontinental links is increasing every year, yet the round-trip latency is starting to approach the speed of light limits. What are the implications of this for applications such as HTTP?
Solution

In short: The increasing bandwidth-delay product means applications become more latency-limited than bandwidth-limited. HTTP performance is increasingly constrained by round-trip time rather than download speed, making latency optimization (connection reuse, pipelining, multiplexing) more critical than ever.

Elaboration:

The Bandwidth-Delay Product Problem:
```
Bandwidth Delay Product = Bandwidth × RTT

Example:
- 1 Gbps link, 100 ms RTT = 100 Mb = 12.5 MB
- 10 Gbps link, 100 ms RTT = 1 Gb = 125 MB

This represents "data in flight" on the link at any moment.
```
What This Means for HTTP:
1. Latency becomes the bottleneck
  1 Scenario: Download 1 MB file 2 3 Slow link (10 Mbps, 50 ms RTT): 4 Time = TCP handshake (50 ms) + HTTP request (50 ms) + 5 transmission (800 ms) = ~900 ms 6 Dominated by transmission time 7 8 Fast link (1 Gbps, 100 ms RTT): 9 Time = TCP handshake (100 ms) + HTTP request (100 ms) + 10 transmission (8 ms) = ~208 ms 11 Dominated by latency/roundtrips!
2. Connection overhead matters more
  1 With HTTP/1.0 (new connection per object): 2 For 10 objects: 10 × TCP handshake (3 × RTT) = 30 RTTs 3 4 With fast link, 100 ms RTT: 5 = 3 seconds just for handshakes! 6 7 Transmission time for small objects becomes negligible
3. Why bandwidth improvements help less
  1 Going from 100 Mbps to 1 Gbps (10× improvement): 2 Saves ~8 ms per 1 MB 3 4 But RTT is still 100 ms (physics limit) 5 = 1200× slower than speed-of-light 6 7 Latency is the real constraint
Implications for HTTP Applications:
- Connection Reuse Critical: Persistent connections (HTTP/1.1) become essential
- Pipelining/Multiplexing Needed: HTTP/2, HTTP/3 multiplexing over single connections
- DNS Caching Essential: DNS lookups add 50-200 ms per domain
- Geographic Distribution: Content delivery networks (CDNs) needed to reduce latency
- Protocol Overhead Matters: TCP/TLS handshakes become dominant cost
- Parallel Connections Less Useful: Can’t pipeline effectively anyway due to RTT
Real World Example:
```
1
Downloading webpage with 50 objects from different CDNs:
2

3
Old approach (HTTP/1.0, 6 parallel connections):
4
  50 objects × 100 ms RTT = 5 seconds
5
  Bandwidth barely used
6

7
New approach (HTTP/2 multiplexing, single connection):
8
  ~200 ms (1-2 RTTs for connection + pipelining)
9
  Same bandwidth, much faster
```
Conclusion:

As bandwidth increases, applications become latency-bound rather than bandwidth-bound. HTTP design must minimize RTTs through connection multiplexing, protocol efficiency, and geographic proximity rather than expecting bandwidth improvements to help.

(b) What’s an authoritative name server? What part of the name space hierarchy is City University of New York (CUNY) name server is responsible for. Briefly explain.

Solution

In short: An authoritative name server is the official source for DNS records in a particular zone of the namespace hierarchy. CUNY’s name server (cuny.edu) is responsible for the cuny.edu domain zone, containing all hosts within CUNY (qc.cuny.edu, hunter.cuny.edu, etc.).

Elaboration:

What is an Authoritative Name Server?

1
An authoritative name server:
2
- Maintains the official DNS records for a specific zone
3
- Responds to queries about hosts in that zone
4
- Is the "source of truth" for that zone
5
- Does NOT perform recursive queries (typically)

DNS Hierarchy:

1
Root zone (.)
2
├── .edu zone (root delegates)
3
│   ├── .cuny.edu zone (edu delegates)
4
│   ├── .mit.edu zone
5
│   └── .stanford.edu zone
6
│
7
└── .com zone
8
    ├── .google.com zone
9
    └── .amazon.com zone

CUNY’s Responsibility:

1
CUNY's authoritative nameserver (ns.cuny.edu or ns1.cuny.edu):
2

3
Responsible zone: cuny.edu
4

5
Contains records for:
6
- **(qc)**cuny.edu (Queens College)
7
- **(hunter)**cuny.edu (Hunter College)
8
- **(baruch)**cuny.edu (Baruch College)
9
- **(host1)**cuny.edu
10
- **(host2)**cuny.edu
11
- ... any host *.cuny.edu
12

13
Does NOT contain:
14
- **(mit)**edu records (MIT's server handles)
15
- **(google)**com records (Google's server handles)
16
- Any hosts outside cuny.edu

How it Works:

1
Query: What is the IP of qc.cuny.edu?
2

3
1. Client → Root nameserver: "Who handles .edu?"
4
   Root: "Ask ns.edu.cuny or similar"
5

6
2. Client → .edu nameserver: "Who handles .cuny.edu?"
7
   .edu: "Ask ns.cuny.edu (CUNY's nameserver)"
8

9
3. Client → ns.cuny.edu (authoritative): "IP of qc.cuny.edu?"
10
   CUNY: "It's 136.48.100.1" (authoritative answer)

Key Characteristics:

Aspect	Authoritative	Non-Authoritative (Resolver)
Source	Official zone records	Cached records
Updates	Maintained by zone admin	Cached with TTL
Responsibility	One zone only	Multiple zones (via caching)
Record Type	Complete for zone	Partial (what was requested before)

Conclusion:

CUNY’s authoritative nameserver manages the cuny.edu zone and knows the official IP addresses for all hosts within that zone (qc.cuny.edu, hunter.cuny.edu, etc.). It is the authoritative source for these records in the DNS hierarchy.

(c) Suppose a user requests a Web page that consists of some text and two images. Will the client send one request and receive 3 response messages? Explain.

Solution

In short: It depends on the HTTP version and connection type. With HTTP/1.0 or non-persistent connections, the answer is NO—the client sends 3 requests (one for HTML, one for each image) and receives 3 responses. With HTTP/1.1 persistent connections or HTTP/2, it’s more complex.

Elaboration:

HTTP/1.0 with Non-Persistent Connections:

1
Web page structure:
2
- HTML document (text)
3
- Image 1
4
- Image 2
5

6
Client behavior:
7
1. Send HTTP GET request for HTML
8
   Receive response #1 (HTML)
9

10
2. Parse HTML, find <img> references
11
   Send HTTP GET request for Image 1
12
   Receive response #2 (Image 1)
13

14
3. Send HTTP GET request for Image 2
15
   Receive response #3 (Image 2)
16

17
Result: 3 requests, 3 responses ✅

HTTP/1.1 with Persistent Connections:

1
Client behavior:
2
1. Send GET for HTML
3
   Receive response #1
4

5
2. Same TCP connection still open!
6
   Send GET for Image 1
7
   Receive response #2
8

9
3. Same TCP connection still open!
10
   Send GET for Image 2
11
   Receive response #3
12

13
Result: 3 requests, 3 responses
14
But: All over the SAME TCP connection (more efficient)

HTTP/1.1 with Pipelining:

1
Client behavior (if pipelining enabled):
2
1. Send GET request for HTML
3
2. Send GET request for Image 1 (without waiting for response)
4
3. Send GET request for Image 2 (without waiting for response)
5

6
Then receive:
7
- Response #1 (HTML)
8
- Response #2 (Image 1)
9
- Response #3 (Image 2)
10

11
Result: 3 requests, 3 responses
12
But: Requests pipelined, responses arrive in order

HTTP/2 with Multiplexing:

1
Client behavior:
2
1. Single TCP connection
3
2. Send frame for HTML
4
3. Send frame for Image 1
5
4. Send frame for Image 2
6

7
Server can interleave responses:
8
- Send HTML chunks
9
- Send Image 1 chunks
10
- Send Image 2 chunks
11
- All on same connection, simultaneously
12

13
Result: 3 responses, but not necessarily discrete "messages"

The Key Point:

Aspect	HTTP/1.0	HTTP/1.1 Persistent	HTTP/2
Requests	3 (separate connections)	3 (same connection)	3 (same connection)
Responses	3 (separate)	3 (same connection)	3 (multiplexed)
Sequential?	Yes	Yes (default)	No (interleaved)
Efficiency	Poor	Good	Excellent

Conclusion:

The answer is technically YES (3 requests → 3 responses), but the nuance depends on HTTP version:

HTTP/1.0: Three separate TCP connections, three distinct responses
HTTP/1.1+: One persistent connection, three responses in order
HTTP/2: One connection, three responses multiplexed together

(d) Can two distinct Web pages from the same origin server, e.g., www.mit.edu/research.html and www.mit.edu/students.html, be sent over the same persistent connection? Why or why not?

Solution

In short: YES. HTTP/1.1 persistent connections allow multiple requests and responses to be exchanged over the same TCP connection. Two distinct web pages from the same origin server can absolutely be sent over the same persistent connection.

Elaboration:

How Persistent Connections Work:

1
Traditional (HTTP/1.0, non-persistent):
2
1. TCP connection established
3
2. GET /research.html
4
3. Receive response (research.html)
5
4. TCP connection closed
6

7
5. NEW TCP connection established
8
6. GET /students.html
9
7. Receive response (students.html)
10
8. TCP connection closed

With HTTP/1.1 Persistent Connection:

1
1. TCP connection established (SYN, SYN-ACK, ACK)
2

3
2. GET /research.html
4
   Receive response (research.html)
5
   Connection stays OPEN
6

7
3. GET /students.html (same TCP connection!)
8
   Receive response (students.html)
9
   Connection stays OPEN
10

11
4. TCP connection closed (when idle timeout or explicit close)

Benefits:

Benefit	Impact
No TCP handshake overhead	Save 3 RTTs per page
No SSL/TLS renegotiation	Save 2 RTTs if HTTPS
Connection warm-up	Congestion window increases
Network efficiency	Better link utilization

Example Timeline:

1
Time 0 ms:
2
  Send: GET /research.html
3

4
Time 50 ms:
5
  Receive: 200 OK + research.html
6

7
Time 60 ms:
8
  Send: GET /students.html (same connection)
9

10
Time 110 ms:
11
  Receive: 200 OK + students.html
12

13
Total time: 110 ms
14

15
If separate connections:
16
  Connection 1: TCP handshake (50 ms) + request/response (50 ms) = 100 ms
17
  Connection 2: TCP handshake (50 ms) + request/response (50 ms) = 100 ms
18
  Total: 200 ms (90 ms extra!)

HTTP Request Format (same connection):

1
GET /research.html HTTP/1.1
2
Host: www.mit.edu
3
Connection: keep-alive
4

5
[Server sends response, connection remains open]
6

7
GET /students.html HTTP/1.1
8
Host: www.mit.edu
9
Connection: keep-alive
10

11
[Server sends response, connection remains open]

Conditions:

1
Persistent connections work when:
2
1. Both pages from SAME server (www.mit.edu)
3
2. HTTP/1.1 is used (default in modern browsers)
4
3. Connection header not set to "close"
5
4. Content-Length or chunked encoding provided
6
5. No HTTP errors that close connection (500, 503, etc.)

Conclusion:

YES, two distinct web pages from the same origin server can be sent over the same persistent connection. This is the default behavior in HTTP/1.1, and it significantly improves performance by eliminating TCP handshake overhead.

(e) Can two distinct Web pages from different origin servers, e.g., www.mit.edu/research.html and www.cuny.edu/students.html, be sent over the same persistent connection? Why or why not?

Solution

In short: NO. Persistent connections are specific to a single server. A connection to www.mit.edu cannot be reused for requests to www.cuny.edu. The client must establish a separate TCP connection to each origin server.

Elaboration:

Why Not?

1
HTTP/1.1 persistent connections are tied to:
2
1. Host (e.g., www.mit.edu)
3
2. Port (e.g., 80 for HTTP)
4
3. Protocol (HTTP vs HTTPS)
5

6
Connection to www.mit.edu:80 is separate from www.cuny.edu:80
7
Cannot be reused across different servers

TCP Connection Mechanics:

1
TCP connection identified by 5-tuple:
2
- Source IP
3
- Source port
4
- Destination IP (www.mit.edu = 128.30.2.36)
5
- Destination port (80)
6
- Protocol (TCP)
7

8
Connection to www.cuny.edu (136.48.0.1) would be:
9
- Source IP (same)
10
- Source port (different)
11
- Destination IP (different!) ← DIFFERENT SERVER
12
- Destination port (80)
13
- Protocol (TCP)
14

15
Completely different connection

HTTP Request Format (different servers):

1
← Connection to www.mit.edu
2
GET /research.html HTTP/1.1
3
Host: www.mit.edu
4

5
[Response received]
6
[Connection closed or kept open for more mit.edu requests]
7

8
← NEW Connection to www.cuny.edu
9
GET /students.html HTTP/1.1
10
Host: www.cuny.edu
11

12
[Response received]

Timeline Comparison:

1
Same server (www.mit.edu):
2
Time 0: GET /research.html
3
Time 50: Receive response
4
Time 60: GET /students2.html (same TCP connection)
5
Time 110: Receive response
6
Total: 110 ms
7

8
Different servers (www.mit.edu vs www.cuny.edu):
9
Time 0: TCP handshake to mit.edu
10
Time 50: GET /research.html
11
Time 100: Receive response
12
Time 101: TCP handshake to cuny.edu (NEW connection)
13
Time 151: GET /students.html
14
Time 201: Receive response
15
Total: 201 ms (extra TCP handshake!)

Exception: HTTP Proxies

1
A proxy can maintain persistent connections to multiple servers:
2

3
Browser → Proxy: GET www.mit.edu/research.html
4
Proxy ← → www.mit.edu (connection 1)
5

6
Browser → Proxy: GET www.cuny.edu/students.html
7
Proxy ← → www.cuny.edu (connection 2)
8

9
But the browser itself still only connects to ONE proxy
10
Proxy manages connections to multiple servers

Modern Workaround: CDNs

1
Instead of different servers:
2
- Both pages served from CDN edge server
3
- Same origin server (CDN node)
4
- Persistent connection works
5

6
Browser → CDN node for mit.edu content
7
Browser → CDN node for cuny.edu content (same CDN server)
8
Can reuse connection within CDN

Conclusion:

NO, two web pages from different origin servers cannot use the same persistent connection. Each server requires a separate TCP connection. This is a fundamental limitation of TCP (which is server-specific) and HTTP (which respects TCP connection boundaries).

(f) With nonpersistent connections between the browser and the origin server, is it possible for a single TCP segment to carry two distinct HTTP request messages. Explain.

Solution

In short: NO. With nonpersistent connections, each HTTP request requires its own TCP connection (3-way handshake, request, response, close). A TCP segment carries data from one connection only, so two HTTP requests would require two separate TCP connections and thus two separate segments.

Elaboration:

Understanding TCP Segments:

1
A TCP segment is the unit of data at the transport layer
2
- Contains TCP header + payload (HTTP data)
3
- Belongs to ONE TCP connection (identified by source/dest IP:port)
4

5
One segment = One TCP connection
6
Cannot carry data from two different connections

Nonpersistent Connection Model:

1
Request 1:
2
1. SYN (TCP handshake)
3
2. SYN-ACK
4
3. ACK
5
4. [TCP segment with HTTP GET request]
6
   Carries: GET /page1.html HTTP/1.0\r\n...
7

8
5. [Response received, connection closes]
9

10
Request 2:
11
6. NEW SYN (new TCP connection)
12
7. SYN-ACK
13
8. ACK
14
9. [NEW TCP segment with HTTP GET request]
15
   Carries: GET /page2.html HTTP/1.0\r\n...
16

17
10. [Response received, connection closes]

Why Not in One Segment?

1
Hypothesis: Send both in one segment?
2

3
GET /page1.html HTTP/1.0\r\n...
4
GET /page2.html HTTP/1.0\r\n...
5

6
Problem 1: Which connection?
7
- TCP connection is between specific IP:port pairs
8
- One connection to server A
9
- One connection to server B (different)
10
- Can't fit both in one segment
11

12
Problem 2: HTTP protocol expectation
13
- Server receives segment on established connection
14
- Reads first request: GET /page1.html
15
- Sends response for page1
16
- Connection closes (nonpersistent)
17
- Second request is lost!
18

19
Problem 3: Multiple requests aren't delimited
20
- How would server know where one HTTP message ends?
21
- Without Content-Length or keep-alive, message ends with connection close

What WOULD Work (but breaks nonpersistent model):

1
If we COULD send two requests in one segment:
2

3
GET /page1.html HTTP/1.1\r\n
4
Host: server.com\r\n
5
Connection: keep-alive\r\n
6
Content-Length: 0\r\n
7
\r\n
8
GET /page2.html HTTP/1.1\r\n
9
Host: server.com\r\n
10
Content-Length: 0\r\n
11
\r\n
12

13
But this REQUIRES:
14
- Persistent connection (HTTP/1.1)
15
- Keep-alive header
16
- Proper message framing
17
- This is NOT nonpersistent!

TCP and HTTP Constraints:

Constraint	Implication
Nonpersistent = new TCP connection per request	Each request needs own 3-way handshake
TCP segment belongs to one connection	Can’t mix requests from different connections
Connection close ends message	Second request lost when connection closes
HTTP/1.0 (nonpersistent) has no framing	Can’t delimit multiple requests

Example Timeline:

1
Time 0:   SYN ——————→
2
Time 10:  ←———— SYN-ACK
3
Time 20:  ACK ——————→
4
Time 30:  GET request ——→ [One segment carries one HTTP request]
5
Time 80:  ←———— Response
6
Time 90:  FIN ——————→ [Connection closes]
7

8
Time 91:  NEW SYN ———→ [New connection for second request]
9
Time 101: ←——— SYN-ACK
10
Time 111: ACK ——————→
11
Time 121: GET request ——→ [Different segment, different connection]
12
Time 171: ←———— Response

Conclusion:

NO. A TCP segment cannot carry two distinct HTTP request messages in a nonpersistent connection model because:

Each request requires its own TCP connection
A TCP segment belongs to exactly one connection
Nonpersistent connections close after response, losing any additional data
HTTP/1.0 has no framing mechanism to delimit multiple messages

This is why persistent connections (HTTP/1.1) were invented—to allow multiple requests over one connection and better utilize bandwidth.

(g) We know that a separate TCP connection is established for data transfer in FTP. Briefly describe the client and server communication to make this possible.

Solution

In short: FTP uses two TCP connections: a control connection for commands and a data connection for file transfer. The client sends commands (USER, PASS, RETR, STOR) over the control connection, and the server establishes a data connection when needed, either in active mode (server initiates) or passive mode (client initiates).

Elaboration:

Two Connection Model:

1
FTP Client                          FTP Server
2

3
Control connection ←———————————→ Control port 21
4
(client commands,
5
 server responses)
6

7
Data connection ←———————————→ Data port 20 (active)
8
                              or random port (passive)
9
(file data transfer)

Active Mode (Server-Initiated Data Connection):

1
Step 1: Control Connection Established
2
  Client → Server: TCP connection to port 21
3
  This persists for entire FTP session
4

5
Step 2: User Authentication
6
  Client → Server (control): USER username\r\n
7
  Server → Client (control): 331 Password required\r\n
8
  Client → Server (control): PASS password\r\n
9
  Server → Client (control): 230 Login successful\r\n
10

11
Step 3: Issue Retrieve Command
12
  Client → Server (control): RETR filename\r\n
13
  Server → Client (control): 150 Opening data connection\r\n
14

15
Step 4: Server Initiates Data Connection
16
  Server → Client: TCP connection from port 20 to client's data port
17
  [Data transfer begins on this connection]
18
  [Transfer completes]
19
  Server → Client (data): Connection closes
20

21
Step 5: Server Notifies Completion
22
  Server → Client (control): 226 Transfer complete\r\n

Passive Mode (Client-Initiated Data Connection):

1
Motivation: Firewalls/NAT often block incoming connections
2

3
Step 1-2: Control Connection & Authentication (same as active)
4

5
Step 3: Request Passive Mode
6
  Client → Server (control): PASV\r\n
7
  Server → Client (control): 227 Entering Passive Mode (h1,h2,h3,h4,p1,p2)\r\n
8
  [Response contains server's IP and random port number]
9

10
Step 4: Client Initiates Data Connection
11
  Client → Server: TCP connection to provided IP:port
12
  (Server was listening on this port)
13

14
Step 5: Issue Retrieve Command
15
  Client → Server (control): RETR filename\r\n
16
  Server → Client (control): 150 Opening data connection\r\n
17
  [Data transfer on already-established data connection]
18
  [Transfer completes]
19
  Data connection closes
20

21
Step 6: Completion Notification
22
  Server → Client (control): 226 Transfer complete\r\n

Active Mode Timeline:

1
Time 0:    Control connection (client port → server:21)
2
Time 10:   USER command sent
3
Time 20:   PASS command sent
4
Time 30:   RETR filename command sent
5
Time 40:   ← Server initiates data connection (server:20 → client port X)
6
Time 50:   Data transfer begins
7
Time 500:  Data transfer completes
8
Time 510:  ← 226 Transfer complete (control connection)

Passive Mode Timeline:

1
Time 0:    Control connection (client port → server:21)
2
Time 10:   USER command sent
3
Time 20:   PASS command sent
4
Time 30:   PASV command sent
5
Time 40:   ← Server responds with port number (e.g., 1234)
6
Time 50:   Data connection initiated (client → server:1234)
7
Time 60:   RETR filename command sent
8
Time 70:   Data transfer begins
9
Time 500:  Data transfer completes
10
Time 510:  ← 226 Transfer complete (control connection)

Command Examples on Control Connection:

Command	Purpose	Response
USER	Provide username	331 (need password)
PASS	Provide password	230 (success) or 530 (fail)
RETR	Retrieve file	150 (opening data), 226 (done)
STOR	Store file	150 (opening data), 226 (done)
LIST	List directory	150 (opening data), 226 (done)
QUIT	End session	221 (goodbye)

Why Separate Data Connection?

1
1. Protocol separation
2
   - Control: Command/response (ASCII text, small)
3
   - Data: File transfer (binary, large)
4

5
2. Flexibility
6
   - Can transfer multiple files without re-authenticating
7
   - Can use different data rates
8
   - Can resume interrupted transfers
9

10
3. Network efficiency
11
   - Control connection lightweight
12
   - Data connection optimized for throughput
13

14
4. Compatibility
15
   - Works with firewall rules
16
   - Can use active or passive depending on network

Key Points:

1
- Control connection: Always client → server:21 (persists)
2
- Data connection: Separate, established per transfer
3
- Active: Server initiates data connection from port 20
4
- Passive: Client initiates to server's random port
5
- Responses on control connection indicate data connection status

Conclusion:

FTP uses a control connection (to port 21) for commands and responses, and a separate data connection (port 20 for active, random port for passive) for actual file transfer. The client sends FTP commands over the control connection, and the server initiates (active mode) or accepts (passive mode) the data connection as needed. This separation allows efficient file transfer while maintaining session control and enabling features like authentication and error reporting.

(h) Does a user e-mail agent upload an outgoing e-mail to a mail server using POP3/IMAP, or SMTP? Briefly explain.

Solution

In short: SMTP is used to upload outgoing email. POP3 and IMAP are used only for downloading/retrieving received email. The mail client uses SMTP to send messages to the mail server, which then routes them to recipients.

Elaboration:

Three Separate Protocols:

1
User's Mail Client
2
↓
3
SMTP (port 25, 465, 587): SEND outgoing email
4
↓
5
Mail Server (SMTP server)
6
↓
7
Routes to recipient's mail server via SMTP
8
↓
9
Recipient's Mail Server (POP3/IMAP server)
10
↓
11
POP3 or IMAP (port 110, 143, 993): RETRIEVE email
12
↓
13
Recipient's Mail Client

SMTP (Simple Mail Transfer Protocol):

1
Purpose: SENDING email
2

3
Client workflow:
4
1. User composes email in mail client (Outlook, Gmail, etc.)
5
2. User clicks "Send"
6
3. Mail client connects to SMTP server (port 587 with TLS)
7
4. Client authenticates: AUTH LOGIN
8
5. Client sends message:
9
   - MAIL FROM: sender@domain.com
10
   - RCPT TO: recipient@domain.com
11
   - DATA (message body)
12
6. Server accepts and routes message
13
7. Connection closes
14

15
Server then:
16
- Looks up recipient's mail server via DNS
17
- Connects to recipient's SMTP server
18
- Delivers message
19
- (May queue if recipient offline)

POP3 (Post Office Protocol 3):

1
Purpose: RETRIEVING email (download-and-delete model)
2

3
Client workflow:
4
1. Mail client connects to POP3 server (port 110 or 995)
5
2. User authenticates with username/password
6
3. Server returns list of messages
7
4. Client downloads messages
8
5. Messages deleted from server (typically)
9
6. Connection closes
10

11
Characteristic:
12
- Intended for single client access
13
- After download, email usually removed from server
14
- Not ideal for multiple devices

IMAP (Internet Message Access Protocol):

1
Purpose: RETRIEVING email (keep-on-server model)
2

3
Client workflow:
4
1. Mail client connects to IMAP server (port 143 or 993)
5
2. User authenticates
6
3. Server provides folder structure (Inbox, Drafts, Sent, etc.)
7
4. Client can:
8
   - Preview messages without downloading
9
   - Download specific messages
10
   - Delete, flag, organize messages
11
   - Synchronize across devices
12
5. Messages stay on server
13
6. Connection can remain open
14

15
Characteristic:
16
- Designed for multiple client access
17
- Emails remain on server until explicitly deleted
18
- Great for accessing from multiple devices
19
- More bandwidth-efficient (selective download)

Complete Email Flow:

1
Alice sends email to Bob:
2

3
Step 1 (SMTP - Send):
4
  Alice's client → SMTP server (mail.alice.com:587)
5
  Sends: alice@alice.com → bob@bob.com
6

7
Step 2 (SMTP - Route):
8
  mail.alice.com → mail.bob.com (SMTP)
9
  Message transferred between servers
10

11
Step 3 (IMAP/POP3 - Receive):
12
  Bob's client → mail.bob.com (IMAP port 993)
13
  Bob downloads/reads message

Key Distinction:

Protocol	Direction	Purpose	Port
SMTP	Client → Server	SEND email	25, 465, 587
POP3	Client ← Server	RETRIEVE email	110, 995
IMAP	Client ← Server	RETRIEVE email	143, 993

Conclusion:

SMTP is used for uploading/sending outgoing email to the mail server. POP3 and IMAP are used for downloading received email from the mail server. These are distinct protocols with different purposes in the email infrastructure.

(i) What’s a MIME type? What is it used for? Briefly explain.

Solution

In short: A MIME type (Multipurpose Internet Mail Extensions) is a standard label that identifies the format of data (e.g., text/plain, image/jpeg, application/pdf). It tells systems how to interpret and display the content, enabling proper handling of diverse file types across networks.

Elaboration:

What is MIME?

1
MIME Type Syntax:
2
type/subtype
3

4
Examples:
5
- text/plain (plain text)
6
- text/html (HTML document)
7
- image/jpeg (JPEG image)
8
- image/png (PNG image)
9
- application/pdf (PDF document)
10
- application/json (JSON data)
11
- audio/mpeg (MP3 audio)
12
- video/mp4 (MP4 video)
13
- application/zip (ZIP archive)

Purpose:

1
Without MIME types:
2
- System receives file called "document"
3
- Is it text? Binary? Image? Archive?
4
- How should it be displayed?
5
- What program should open it?
6
- Ambiguous and error-prone
7

8
With MIME types:
9
- Server sends "Content-Type: application/pdf"
10
- Client knows it's a PDF
11
- Client launches PDF reader
12
- Content displayed correctly

Common MIME Types:

Type	Subtype	Purpose
text	plain, html, css, javascript	Text-based files
image	jpeg, png, gif, svg+xml	Image files
audio	mpeg, wav, ogg	Audio files
video	mp4, webm, ogg	Video files
application	pdf, json, xml, zip, octet-stream	Data/binary files
multipart	form-data, mixed, related	Multiple parts in one message

How MIME Works in HTTP:

1
HTTP Response:
2
HTTP/1.1 200 OK
3
Content-Type: text/html; charset=utf-8
4
Content-Length: 1234
5

6
<!DOCTYPE html>
7
<html>
8
...

Browser sees “text/html” → Renders as HTML webpage

1
HTTP Response:
2
HTTP/1.1 200 OK
3
Content-Type: application/pdf
4
Content-Length: 50000
5

6
[binary PDF data]

Browser sees “application/pdf” → Launches PDF viewer

MIME in Email:

1
Email with attachment:
2

3
From: alice@example.com
4
To: bob@example.com
5
Subject: Photos
6
MIME-Version: 1.0
7
Content-Type: multipart/mixed
8

9
--boundary123
10
Content-Type: text/plain
11

12
Here are the photos you requested.
13

14
--boundary123
15
Content-Type: image/jpeg
16
Content-Transfer-Encoding: base64
17

18
[binary image data encoded as base64]
19

20
--boundary123--

Mail client:

Reads “multipart/mixed”
Recognizes multiple parts
Displays text portion
Saves image/jpeg as attachment with .jpg extension

MIME with Parameters:

1
Content-Type: text/plain; charset=utf-8
2
└─ Type: text
3
├─ Subtype: plain
4
└─ Parameter: charset=utf-8 (UTF-8 encoding)
5

6
Content-Type: image/jpeg; name="photo.jpg"
7
└─ Type: image
8
├─ Subtype: jpeg
9
└─ Parameter: filename for download
10

11
Content-Type: multipart/form-data; boundary=----WebKitFormBoundary
12
└─ Type: multipart
13
├─ Subtype: form-data
14
└─ Parameter: boundary delimiter for parts

Why MIME Matters:

Content Negotiation
- Server can offer multiple formats
- Client requests preferred format
- “Accept: text/html, application/json”
Charset Handling
- “text/html; charset=utf-8”
- Ensures proper character encoding
- Prevents garbled text
Plugin/Handler Selection
- OS looks at MIME type
- Launches appropriate application
- User doesn’t need to specify
Interoperability
- Standard way to describe content
- Works across all platforms
- Enables automation

Conclusion:

A MIME type is a standard label (e.g., text/html, image/jpeg) that identifies the format of data. It’s used to tell systems how to interpret, display, and handle content, enabling proper routing and processing of diverse file types across email systems, web servers, and applications.

(j) Why might a domain address (e.g., www.cnn.com) have several IP addresses?

Solution

In short: A domain can have multiple IP addresses for load balancing (distributing traffic across servers), geographic redundancy (serving from multiple locations), fault tolerance (if one server fails, others handle traffic), and scalability (handling large traffic volumes).

Elaboration:

Load Balancing:

1
Single IP address (poor):
2
  All requests → Single server
3
  Server capacity: 1000 requests/sec
4
  If 2000 requests arrive: 50% get dropped
5

6
Multiple IP addresses (good):
7
  DNS returns multiple IPs in rotation
8
  Requests distributed across servers
9
  Total capacity: 4000 requests/sec (4 servers × 1000 each)
10

11
www.cnn.com might have:
12
- **(93)**184.216.34
13
- **(93)**184.216.35
14
- **(93)**184.216.36
15
- **(93)**184.216.37

How DNS Round-Robin Works:

1
Client 1 → DNS: What's the IP for www.cnn.com?
2
          ← DNS: [93.184.216.34, 93.184.216.35, 93.184.216.36, ...]
3
          Client gets first IP: 93.184.216.34
4

5
Client 2 → DNS: What's the IP for www.cnn.com?
6
          ← DNS: [93.184.216.35, 93.184.216.36, ..., 93.184.216.34]
7
          (rotated list)
8
          Client gets: 93.184.216.35
9

10
Client 3 → DNS: What's the IP for www.cnn.com?
11
          ← DNS: [93.184.216.36, ..., 93.184.216.34, 93.184.216.35]
12
          Client gets: 93.184.216.36
13

14
Result: Traffic spread across all servers

Geographic Redundancy:

1
Single server (bad):
2
  Server in New York
3
  Los Angeles users: ~3000 ms latency
4
  Europe users: ~10000 ms latency
5
  User experience: Poor
6

7
Multiple geographic locations (good):
8
  Server 1: New York (1.2.3.4)
9
  Server 2: Los Angeles (1.2.3.5)
10
  Server 3: London (1.2.3.6)
11
  Server 4: Tokyo (1.2.3.7)
12

13
  User in LA → Connects to 1.2.3.5 (local server)
14
  Latency: ~10 ms (much better)
15

16
GeoDNS can be used:
17
  Clients in US get US servers
18
  Clients in Europe get EU servers
19
  Clients in Asia get Asia servers

Fault Tolerance:

1
Single server (risky):
2
  www.cnn.com → 1.2.3.4
3
  Server 1.2.3.4 crashes
4
  Website is DOWN
5
  All users affected
6
  No redundancy
7

8
Multiple servers (safe):
9
  www.cnn.com → [1.2.3.4, 1.2.3.5, 1.2.3.6, 1.2.3.7]
10

11
  Server 1.2.3.4 crashes:
12
  Clients keep connecting to other IPs
13
  Users redirected automatically
14
  Website stays UP
15
  Minimal disruption
16

17
Health checks:
18
  Monitoring service checks each server
19
  If server unhealthy: Remove from DNS responses
20
  Clients automatically avoid failed server

Traffic Scalability:

1
Peak traffic analysis:
2
- Typical traffic: 10,000 requests/sec per server
3
- Peak traffic (election day, breaking news): 100,000 requests/sec
4

5
Single server solution:
6
  Would need 10 servers during peak only
7
  Expensive and wasteful
8

9
Multiple permanent servers:
10
  Run 4-5 servers normally
11
  During peak: Some requests queued briefly
12
  Costs less than scaling to 10
13
  Handles most spikes gracefully

Real Example: CNN

1
www.cnn.com DNS lookup returns:
2

3
; <<>> dig www.cnn.com
4

5
www.cnn.com. 300 IN A 151.101.1.67
6
www.cnn.com. 300 IN A 151.101.65.67
7
www.cnn.com. 300 IN A 151.101.129.67
8
www.cnn.com. 300 IN A 151.101.193.67
9

10
Note: These are Fastly CDN IPs in different geographic regions

Other Reasons:

CDN (Content Delivery Network)
- Multiple edge servers worldwide
- Each has own IP
- Users served from nearest edge
- Faster content delivery
A/B Testing
- Version A served from IP 1.2.3.4
- Version B served from IP 1.2.3.5
- Different users test different versions
Graceful Degradation
- During maintenance: Reduce IPs in DNS response
- Gradually drain traffic from server being updated
- Zero downtime deploys
DDoS Mitigation
- Multiple IPs spread attack traffic
- Easier to filter/block attack sources
- Continues serving through attack

Conclusion:

A domain has multiple IP addresses primarily for:

Load balancing (distribute traffic)
Geographic redundancy (serve from multiple locations)
Fault tolerance (survive server failures)
Scalability (handle traffic spikes)
Performance (users connect to nearest server)

This is achieved through DNS round-robin, GeoDNS, health checks, and CDN architecture.

(k) Why might a Web server have several IP addresses for a single interface?

Solution

In short: A web server might have multiple IP addresses on a single network interface to host multiple domains/websites, serve different services on different IPs, implement virtual hosting, isolate traffic for different customers, or handle SSL/TLS certificates for multiple domains.

Elaboration:

Virtual Hosting:

1
Single server, multiple websites:
2

3
IP Configuration:
4
eth0: 1.2.3.4 (www.site1.com)
5
eth0: 1.2.3.5 (www.site2.com)
6
eth0: 1.2.3.6 (www.site3.com)
7

8
All on same physical network interface (eth0)
9
All running on same server machine
10

11
When client connects:
12
  Client → 1.2.3.4 (gets site1)
13
  Client → 1.2.3.5 (gets site2)
14
  Client → 1.2.3.6 (gets site3)

Before SNI (Server Name Indication):

1
SSL/TLS required different IP per domain:
2

3
Problem: How does server know which cert to use?
4
- HTTPS handshake happens before HTTP Host header
5
- Server doesn't know which domain client wants
6
- Can't select correct certificate
7

8
Solution: One IP per SSL domain
9
- **(www)**site1.com → IP 1.2.3.4 with Site1 cert
10
- **(www)**site2.com → IP 1.2.3.5 with Site2 cert
11
- **(www)**site3.com → IP 1.2.3.6 with Site3 cert
12

13
Now server can identify domain from incoming IP

Modern Era (with SNI):

1
SNI (Server Name Indication) - TLS extension:
2
- Client sends hostname during TLS handshake
3
- Server knows which cert to use
4
- Multiple domains can share one IP!
5

6
But legacy support may still require:
7
- Multiple IPs for older clients
8
- Fallback for incompatible browsers

Practical Scenarios:

Scenario 1: Shared Hosting Provider

1
Company: "WebHost.com" shared hosting
2

3
One physical server hosts 100 customer websites:
4
192.168.1.100:
5
  - IP address 203.0.113.1 → customer1.com
6
  - IP address 203.0.113.2 → customer2.com
7
  - IP address 203.0.113.3 → customer3.com
8
  - ... up to 203.0.113.100 → customer100.com
9

10
Benefits:
11
- Single server, multiple paying customers
12
- Each customer has own IP (feels exclusive)
13
- Different SSL certs for each

Scenario 2: Service Isolation

1
Large enterprise server configuration:
2

3
eth0 (single physical NIC):
4
  - **(10)**0.1.10: Public-facing web server (www.company.com)
5
  - **(10)**0.1.11: Admin dashboard (secure, restricted access)
6
  - **(10)**0.1.12: API server (api.company.com)
7
  - **(10)**0.1.13: Backup/Health check IP
8

9
Benefits:
10
- Can apply different firewall rules per IP
11
- Different QoS (Quality of Service) per IP
12
- Easier to limit access (block 10.0.1.11 from outside)

Scenario 3: Multi-Tenant Application

1
SaaS platform: Multiple customers on same server
2

3
eth0:
4
  - **(1)**2.3.100: Company A instance
5
  - **(1)**2.3.101: Company B instance
6
  - **(1)**2.3.102: Company C instance
7

8
Each customer accesses their own IP:
9
  Company A employees → https://app.companyA.com (→ 1.2.3.100)
10
  Company B employees → https://app.companyB.com (→ 1.2.3.101)
11
  Company C employees → https://app.companyC.com (→ 1.2.3.102)
12

13
Benefits:
14
- Logical separation (feels like dedicated server)
15
- Different SLA/performance tiers per customer
16
- Can restart one customer's instance without affecting others

Linux Configuration Example:

1
# Configure multiple IPs on single interface (eth0)
2

3
ip addr add 1.2.3.4/24 dev eth0
4
ip addr add 1.2.3.5/24 dev eth0
5
ip addr add 1.2.3.6/24 dev eth0
6

7
# Verify:
8
ip addr show eth0
9

10
1: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP>
11
  inet 1.2.3.4/24 scope global eth0
12
  inet 1.2.3.5/24 scope global secondary eth0
13
  inet 1.2.3.6/24 scope global secondary eth0

Web Server Configuration (Apache):

1
# Virtual host on different IPs
2

3
<VirtualHost 1.2.3.4:443>
4
  ServerName www.site1.com
5
  DocumentRoot /var/www/site1
6
  SSLCertificateFile /path/to/site1.crt
7
</VirtualHost>
8

9
<VirtualHost 1.2.3.5:443>
10
  ServerName www.site2.com
11
  DocumentRoot /var/www/site2
12
  SSLCertificateFile /path/to/site2.crt
13
</VirtualHost>
14

15
<VirtualHost 1.2.3.6:443>
16
  ServerName www.site3.com
17
  DocumentRoot /var/www/site3
18
  SSLCertificateFile /path/to/site3.crt
19
</VirtualHost>

Benefits Summary:

Benefit	Use Case
Virtual Hosting	Host multiple websites on one server
SSL/TLS per domain	Each domain with own certificate (pre-SNI)
Service Isolation	Different firewall rules per service
Tenant Separation	Multi-tenant SaaS platforms
Performance Control	Different rate limits per IP
Billing/Accounting	Track usage per customer IP

Modern Alternative: Name-Based Hosting

1
With SNI support, can now use:
2

3
eth0: 1.2.3.4 (only one IP)
4

5
VirtualHosts:
6
<VirtualHost 1.2.3.4:443>
7
  ServerName www.site1.com
8
  SSLEngine on
9
  SSLCertificateFile /path/to/site1.crt
10
</VirtualHost>
11

12
<VirtualHost 1.2.3.4:443>
13
  ServerName www.site2.com
14
  SSLEngine on
15
  SSLCertificateFile /path/to/site2.crt
16
</VirtualHost>
17

18
Client sends "ServerName" in TLS handshake
19
Server picks correct cert based on name
20
Multiple sites on single IP!

Conclusion:

A web server may have multiple IP addresses on a single interface for virtual hosting (multiple domains), SSL/TLS per domain (pre-SNI era), service isolation (different traffic types), tenant separation (multi-tenant platforms), or performance/billing management. While modern SNI enables multiple sites on one IP, multiple IPs may still be used for security, isolation, legacy compatibility, or administrative control.

(l) Briefly describe what HEAD, GET, POST, PUT, PATCH, DELETE HTTP requests are used for.

Solution

In short: GET retrieves data; POST sends data for server processing; HEAD is like GET but without response body; PUT replaces an entire resource; PATCH partially updates a resource; DELETE removes a resource. These form the foundation of RESTful APIs.

Elaboration:

GET - Retrieve Data

1
Purpose: Request data without modifying server state
2

3
Example:
4
GET /api/users/123 HTTP/1.1
5
Host: api.example.com
6

7
Response:
8
200 OK
9
{
10
  "id": 123,
11
  "name": "John Doe",
12
  "email": "john@example.com"
13
}
14

15
Characteristics:
16
- Data in URL query string: GET /users?id=123&sort=name
17
- Idempotent (multiple identical requests = same result)
18
- Safe (doesn't modify server state)
19
- Cacheable (browsers cache GET responses)
20
- Bookmarkable
21
- Should NOT have request body

POST - Create or Process Data

1
Purpose: Submit data to server for processing (create, process, etc.)
2

3
Example 1: Create new user
4
POST /api/users HTTP/1.1
5
Host: api.example.com
6
Content-Type: application/json
7

8
{
9
  "name": "Jane Doe",
10
  "email": "jane@example.com"
11
}
12

13
Response:
14
201 Created
15
Location: /api/users/124
16
{
17
  "id": 124,
18
  "name": "Jane Doe",
19
  "email": "jane@example.com"
20
}
21

22
Example 2: Form submission
23
POST /login HTTP/1.1
24
Host: example.com
25
Content-Type: application/x-www-form-urlencoded
26

27
username=alice&password=secret123
28

29
Characteristics:
30
- Data in request body (hidden from URL)
31
- NOT idempotent (repeated requests create multiple resources)
32
- NOT safe (modifies server state)
33
- Not cached (usually)
34
- Not bookmarkable

HEAD - Retrieve Headers Only

1
Purpose: Like GET but without response body (just headers)
2

3
Example:
4
HEAD /document.pdf HTTP/1.1
5
Host: example.com
6

7
Response:
8
200 OK
9
Content-Type: application/pdf
10
Content-Length: 50000
11
Last-Modified: Mon, 01 Jan 2024 10:00:00 GMT
12

13
(no body, but headers tell us file info)
14

15
Use Cases:
16
1. Check if resource exists without downloading
17
2. Check file size before downloading
18
3. Check last modification date
19
4. Verify URL validity
20
5. Bandwidth-efficient checks
21

22
Characteristics:
23
- Same as GET but no response body
24
- Faster (no data transfer)
25
- Useful for large files
26
- Idempotent and safe

PUT - Replace Entire Resource

1
Purpose: Replace a resource entirely with new data
2

3
Example: Update user 123 completely
4
PUT /api/users/123 HTTP/1.1
5
Host: api.example.com
6
Content-Type: application/json
7

8
{
9
  "name": "John Smith",
10
  "email": "john.smith@example.com",
11
  "phone": "555-1234"
12
}
13

14
Response:
15
200 OK
16
{
17
  "id": 123,
18
  "name": "John Smith",
19
  "email": "john.smith@example.com",
20
  "phone": "555-1234"
21
}
22

23
Key Difference (PUT vs POST):
24
- PUT: Client specifies resource ID (PUT /users/123)
25
- POST: Server generates resource ID (POST /users)
26

27
Characteristics:
28
- Replaces entire resource
29
- Client specifies ID in URL
30
- Idempotent (PUT twice = same result)
31
- If resource doesn't exist: May create it (201) or error (404)

PATCH - Partial Update

1
Purpose: Partially update a resource (only changed fields)
2

3
Example: Update only name field
4
PATCH /api/users/123 HTTP/1.1
5
Host: api.example.com
6
Content-Type: application/json
7

8
{
9
  "name": "John Smith"
10
}
11

12
Current state before PATCH:
13
{
14
  "id": 123,
15
  "name": "John Doe",
16
  "email": "john@example.com",
17
  "phone": "555-0000"
18
}
19

20
Response (after PATCH):
21
200 OK
22
{
23
  "id": 123,
24
  "name": "John Smith",           ← Changed
25
  "email": "john@example.com",    ← Unchanged
26
  "phone": "555-0000"             ← Unchanged
27
}
28

29
PUT (for comparison - replaces all):
30
PUT /api/users/123
31
{ "name": "John Smith" }
32

33
Result with PUT:
34
{
35
  "id": 123,
36
  "name": "John Smith",
37
  "email": null,        ← Lost!
38
  "phone": null         ← Lost!
39
}
40
(All fields not specified are removed/nulled)
41

42
Characteristics:
43
- Only changed fields required
44
- More efficient than PUT
45
- Idempotent (usually)
46
- Not all servers support PATCH

DELETE - Remove Resource

1
Purpose: Delete a resource from server
2

3
Example: Delete user 123
4
DELETE /api/users/123 HTTP/1.1
5
Host: api.example.com
6

7
Response:
8
204 No Content
9
(Resource deleted, no body needed)
10

11
Or:
12
Response:
13
200 OK
14
{ "message": "User 123 deleted successfully" }
15

16
Characteristics:
17
- Removes resource
18
- Idempotent (deleting twice has same effect)
19
- Safe to call multiple times
20
- No request body (usually)
21
- May return 404 if already deleted

Summary Comparison:

Method	Purpose	Idempotent	Safe	Body
GET	Retrieve data	Yes	Yes	No
HEAD	Retrieve headers	Yes	Yes	No
POST	Create/process	No	No	Yes
PUT	Replace resource	Yes	No	Yes
PATCH	Partial update	Yes*	No	Yes
DELETE	Remove resource	Yes	No	No

REST API Example:

1
Resource: /api/articles/42
2

3
GET /api/articles/42
4
  → Retrieve article 42
5

6
POST /api/articles
7
  → Create new article
8

9
PUT /api/articles/42
10
  → Replace entire article 42
11

12
PATCH /api/articles/42
13
  → Update some fields of article 42
14

15
DELETE /api/articles/42
16
  → Delete article 42
17

18
GET /api/articles
19
  → List all articles

Conclusion:

GET: Read data (safe, idempotent)
HEAD: Check headers without body (efficient read)
POST: Create new resource or trigger action (not idempotent)
PUT: Replace entire resource (idempotent, client-specified ID)
PATCH: Partially update resource (idempotent, efficient update)
DELETE: Remove resource (idempotent)

These form the CRUD operations (Create, Read, Update, Delete) for RESTful APIs.

(m) Briefly describe the motivation for HTTP/2.0.

Solution

In short: HTTP/2.0 was motivated by the need to reduce latency and overhead in HTTP/1.1, which suffered from head-of-line blocking, multiple connection limitations, and inefficient header transmission. HTTP/2 introduced multiplexing, binary framing, and header compression to dramatically improve performance.

Elaboration:

Problems with HTTP/1.1:

Problem 1: Head-of-Line Blocking

1
HTTP/1.1 limitation:
2
Request 1 →
3
           Response 1 (takes 500 ms) ←
4
Request 2 →
5
           Response 2 (takes 500 ms) ←
6
Request 3 →
7
           Response 3 (takes 500 ms) ←
8

9
Total: 1500 ms (sequential, pipelined doesn't help much)
10

11
If Request 1 delayed (500 ms wait):
12
Requests 2 and 3 stuck waiting
13
"Head of line" (first in queue) blocks rest

Problem 2: Limited Parallelism

1
HTTP/1.1: 6-8 parallel connections (browser limit)
2

3
Modern website: 100+ resources
4
- 100 JavaScript files
5
- 50 images
6
- 20 CSS files
7
- Fonts, videos, etc.
8

9
Only 6 can download simultaneously
10
Rest wait in queue
11

12
Creating more connections wastes resources:
13
- TCP handshake overhead
14
- TLS negotiation overhead
15
- Congestion window reset

Problem 3: Header Compression Missing

1
HTTP/1.1 headers: Plain text, often repeated
2

3
Example request:
4
GET /image1.jpg HTTP/1.1
5
Host: www.example.com
6
User-Agent: Mozilla/5.0...
7
Accept-Language: en-US,en;q=0.9
8
Accept-Encoding: gzip, deflate, br
9
Cookie: session=abc123; user=john; ...
10

11
Size: ~500 bytes
12

13
Next request (same domain):
14
GET /image2.jpg HTTP/1.1
15
Host: www.example.com
16
User-Agent: Mozilla/5.0...
17
Accept-Language: en-US,en;q=0.9
18
Accept-Encoding: gzip, deflate, br
19
Cookie: session=abc123; user=john; ...
20

21
Same 500 bytes again!
22
90% of headers are identical
23
Massive waste sending headers repeatedly

Problem 4: Text-Based Overhead

1
HTTP/1.1: Text-based parsing
2

3
GET /index.html HTTP/1.1\r\n
4
Host: example.com\r\n
5
Connection: keep-alive\r\n
6
\r\n
7

8
Parser must:
9
- Read character by character
10
- Find line breaks (\r\n)
11
- Parse key: value pairs
12
- Handle whitespace variations
13
- Error-prone
14

15
Also: Humans can read it (debug), but wasteful

HTTP/2 Solutions:

Solution 1: Multiplexing

1
HTTP/2: Multiple streams on single connection
2

3
Request 1 (Stream 1) →     Frame 1
4
                           Frame 1
5
Request 2 (Stream 3) →     Frame 2
6
                           Frame 1
7
Request 3 (Stream 5) →     Frame 3
8
                           Frame 2
9
                           Frame 3
10
← Response 1 (Stream 1)
11
← Response 2 (Stream 3)
12
← Response 3 (Stream 5)
13

14
Single connection, all streams active simultaneously
15
No head-of-line blocking
16

17
Timeline:
18
Time 0:    Send req1, req2, req3 (all at once)
19
Time 100:  Response 1 arrives (quick)
20
Time 200:  Response 3 arrives (quick)
21
Time 300:  Response 2 arrives (slow but others not blocked)
22

23
Total: 300 ms (instead of 1500 ms with HTTP/1.1)

Solution 2: Binary Framing

1
HTTP/1.1: Text-based
2
HTTP/2: Binary framing layer
3

4
Each message split into frames:
5
- Headers frame
6
- Data frame (s)
7
- Trailers frame
8

9
Frame format:
10
[Length: 3 bytes][Type: 1 byte][Flags: 1 byte]
11
[Stream ID: 4 bytes][Payload: variable]
12

13
Advantages:
14
- Efficient parsing (binary, not text)
15
- Fixed format, easier to implement
16
- Multiplexable (each frame tagged with stream ID)
17
- Can prioritize frames

Solution 3: Header Compression (HPACK)

1
HTTP/2: HPACK header compression
2

3
First request:
4
Host: www.example.com
5
User-Agent: Mozilla/5.0...
6
Accept: text/html
7

8
Size: 500 bytes
9

10
Second request (same domain):
11
Only changed field: User-Agent
12

13
Instead of resending all 500 bytes:
14
Send: "Request 2 uses previous headers except User-Agent"
15
Size: 50 bytes (10% of original!)
16

17
How it works:
18
- Maintain header table at client and server
19
- Reference previous headers by index
20
- Only transmit differences
21
- Typical compression: 85-90% reduction

Solution 4: Server Push

1
HTTP/1.1:
2
1. Browser requests index.html
3
2. Server responds with index.html
4
3. Browser parses, sees <link rel="stylesheet" href="style.css">
5
4. Browser requests style.css
6
5. Server responds
7

8
Latency: 2 round-trips for html + css
9

10
HTTP/2 Server Push:
11
1. Browser requests index.html
12
2. Server responds with index.html
13
3. Server predicts client needs style.css
14
4. Server proactively PUSH style.css
15
5. Browser receives both in parallel
16

17
Latency: 1 round-trip for html + css
18
Browser doesn't need to wait for HTML before knowing about CSS

Real-World Performance Improvement:

1
Benchmark: Loading website with 100 resources
2

3
HTTP/1.1 (6 parallel connections):
4
- TCP handshakes: 6 × 50 ms = 300 ms
5
- TLS handshakes: 6 × 100 ms = 600 ms
6
- Header transmission: 100 × 500 bytes = 50 KB
7
- Head-of-line blocking: Significant
8
Total: ~3-5 seconds
9

10
HTTP/2 (1 connection, multiplexing):
11
- TCP handshake: 1 × 50 ms = 50 ms
12
- TLS handshake: 1 × 100 ms = 100 ms
13
- Header transmission: Compressed 85% = 7.5 KB
14
- No head-of-line blocking
15
- Binary efficient parsing
16
Total: ~1-2 seconds
17

18
Improvement: 2-3x faster

Conclusion:

HTTP/2.0 was motivated by HTTP/1.1’s inefficiencies: head-of-line blocking, limited parallelism (6-8 connections), repeated headers, and text-based overhead. HTTP/2 addressed these through multiplexing (many streams on one connection), binary framing (efficient parsing), header compression (HPACK), and server push (proactive delivery), resulting in 2-3x faster load times while reducing bandwidth usage.

(n) Briefly describe the motivation for HTTP/3.0.

Solution

In short: HTTP/3.0 was motivated by latency problems with TCP and TLS handshakes on high-latency or lossy networks. HTTP/3 replaces TCP with QUIC (UDP-based), which supports 0-RTT resumption, faster handshakes, connection migration, and per-stream congestion control to improve performance on mobile and unreliable networks.

Elaboration:

Problems with HTTP/2 (and TCP):

Problem 1: TCP Handshake Overhead

1
Every new HTTPS connection requires:
2

3
1. TCP handshake (3-way):
4
   Client → SYN
5
   Server ← SYN-ACK
6
   Client → ACK
7
   Latency: 1 RTT
8

9
2. TLS 1.2 handshake:
10
   Client → ClientHello
11
   Server ← ServerHello, Certificate, ...
12
   Client → ClientKeyExchange, ...
13
   Server ← Finished
14
   Client → Finished
15
   Latency: 2 RTT (minimum)
16

17
Total: 3 RTT before data transmission
18
On high-latency networks (100 ms RTT):
19
  3 × 100 ms = 300 ms just for handshakes!
20

21
Mobile: Even worse (high latency, variable)

Problem 2: Head-of-Line Blocking at TCP Layer

1
HTTP/2 multiplexes over TCP, but TCP itself has HoL blocking:
2

3
TCP guarantees ordered delivery:
4

5
Packet 1 (Request A) sent at time T
6
Packet 2 (Request B) sent at time T+10ms
7

8
Network: Packet 1 dropped, Packet 2 arrives
9

10
TCP must:
11
1. Wait for Packet 1 retransmission
12
2. Before delivering Packet 2 to application
13

14
Even though Packet 2 arrived, application must wait
15
HTTP/2 streams are blocked by TCP HoL blocking
16

17
QUIC: Each stream has independent congestion control
18
Dropped packet only affects its stream
19
Other streams unaffected

Problem 3: Connection Establishment Too Expensive

1
HTTP/2 over TCP:
2
- Users expect instant page load
3
- Establish connection on first request
4
- 3 RTT overhead unacceptable
5

6
Mobile example:
7
- RTT: 100 ms (common on 4G)
8
- 3 RTT = 300 ms just for connection setup
9
- Adds to total load time
10

11
Reusing connection helps, but:
12
- Network change (WiFi to cellular): Connection drops
13
- Roaming between networks: Connection dies
14
- Mobile users move frequently
15
- Each new connection: 300 ms penalty

Problem 4: Congestion Control Per Connection

1
HTTP/2 scenario:
2

3
Single TCP connection for multiple streams
4
One packet loss → affects ALL streams
5

6
Example:
7
- Stream 1 (video): Can tolerate loss
8
- Stream 2 (API call): Needs low latency
9

10
Packet loss detected:
11
TCP backs off (exponential backoff)
12
Both streams slow down equally
13

14
Inefficient: Video stream can wait, API can't

HTTP/3 Solutions:

Solution 1: QUIC Protocol (UDP-based)

1
HTTP/3 uses QUIC instead of TCP
2

3
QUIC = Quick UDP Internet Connection
4
Runs on UDP (connectionless, fast)
5
Implements reliability at QUIC layer (not TCP)
6

7
Advantages:
8
- No TCP handshake overhead
9
- Faster connection setup
10
- Connection migration support
11
- Per-stream flow control
12
- Per-stream congestion control

Solution 2: 0-RTT Connection Establishment

1
TLS 1.3 + QUIC enable 0-RTT:
2

3
First connection:
4
Client → Initial (with ClientHello data)
5
Server ← Response + data
6
Latency: 1 RTT (down from 3!)
7

8
Resumed connection (within session ticket):
9
Client → ClientHello from cache
10
Server ← Data
11
Client sends HTTP request with first packet!
12
Latency: 0 RTT (literally sent with setup!)
13

14
Example on 100 ms RTT:
15
HTTP/2: 300 ms (3 RTT) setup
16
HTTP/3: 0 ms (0 RTT) for resumed, 100 ms (1 RTT) for fresh

Solution 3: Per-Stream Congestion Control

1
HTTP/2 TCP problem:
2
Single connection → single congestion window
3
One dropped packet → all streams slow down
4

5
HTTP/3 QUIC solution:
6
Each stream manages its own congestion
7

8
Scenario:
9
- Stream 1 (video): Congestion window backing off
10
- Stream 2 (API): Maintains own aggressive window
11
- Streams don't interfere
12

13
Video doesn't block API from sending
14
Better performance for mixed traffic

Solution 4: Connection Migration

1
TCP problem:
2
Connection identified by IP:port pair
3
IP changes → TCP connection drops
4

5
User scenario:
6
1. Download starts on WiFi
7
2. User walks out of WiFi range
8
3. Device switches to cellular (IP changes)
9
4. TCP connection drops
10
5. Download restarts (wasted time, data)
11

12
QUIC solution:
13
Connections identified by Connection ID (not IP)
14
IP change → Connection continues!
15

16
User scenario with HTTP/3:
17
1. Download starts on WiFi (QUIC connection)
18
2. User walks out of range
19
3. Device switches to cellular (IP changes)
20
4. QUIC sends: "Still me, same Connection ID"
21
5. Download resumes seamlessly
22
6. No latency penalty, no data loss

Solution 5: Faster Handshakes

1
QUIC handshake (1 RTT minimum):
2

3
Client → Initial packet
4
Server ← Handshake packet + encrypted data
5
Client → Acknowledgment
6

7
Then: HTTP/3 request sent immediately
8

9
Much faster than TCP (3 RTT) + TLS (2 RTT)

Real-World Performance Impact:

1
Scenario: Mobile user downloads webpage
2

3
HTTP/1.1:
4
- Network change: Connection drops
5
- Must reconnect: TCP (1 RTT) + TLS (2 RTT) = 300 ms
6
- Then: Re-download = slow
7

8
HTTP/2:
9
- Same problem: TCP + TLS overhead on reconnect
10
- Still multiplexed, but same TCP latency issue
11

12
HTTP/3:
13
- Network change: QUIC migrates automatically
14
- Connection continues: 0 RTT overhead
15
- Download resumes instantly
16
- Perception: Seamless, fast

Comparison:

Aspect	HTTP/2 (TCP)	HTTP/3 (QUIC)
Initial handshake	3 RTT (TCP) + 2 RTT (TLS) = 5 RTT	1 RTT + 0-RTT resumption
Head-of-line blocking	At TCP layer	Per-stream only
Connection migration	Breaks	Seamless
Mobile friendliness	Poor (reconnects drop)	Excellent (transparent migration)
High-latency networks	500-1000 ms setup	100-200 ms setup

Deployment Status:

1
HTTP/3 adoption:
2
- Chrome: Full support (2020+)
3
- Firefox: Full support (2021+)
4
- Safari: Full support (2022+)
5
- Major sites: Google, Facebook, Cloudflare, etc.
6

7
Benefits visible on:
8
- Mobile networks (high RTT)
9
- Network changes (WiFi → cellular)
10
- High packet loss scenarios

Conclusion:

HTTP/3.0 was motivated by TCP’s overhead (3+ RTT handshakes, per-connection congestion control) and poor performance on mobile/unreliable networks. QUIC (UDP-based) provides 0-RTT resumption, connection migration without dropping, per-stream congestion control, and faster handshakes. This results in dramatically better performance on mobile, high-latency, and unstable networks where connection drops are common.

Problem 2: Dynamic Host Configuration Protocol

We discussed in class that a host’s IP address can either be configured manually, or by Dynamic Host Configuration Protocol (DHCP).

(a) Describe the advantages and disadvantages of each approach.

Solution

Manual IP configuration assigns fixed addresses, providing stability and predictability but suffering from poor scalability, configuration errors, and IP conflicts. DHCP automatically assigns IP addresses and network parameters, scales well, and reduces errors, but depends on a DHCP server and may result in changing IP addresses.
(b) Describe how a host gets an IP address using DHCP.

Solution

A host uses DHCP via the DORA process: it broadcasts DHCPDISCOVER, receives a DHCPOFFER, responds with DHCPREQUEST, and receives DHCPACK, after which it configures its network interface.

Problem 3: Video Streaming Protocol Selection

Consider an application where a camera at a highway is capturing video of the passing cars at 30 frames/second and sending the video stream to a remote video viewing station over the Internet. You are hired to design an application-layer protocol to solve this problem. Which transport-layer protocol, UDP or TCP, would you use for this application and why? Justify your answer.

Solution

UDP is preferred for real-time video streaming because it is delay-sensitive and can tolerate packet loss. UDP avoids retransmissions, congestion control delays, and head-of-line blocking present in TCP, resulting in smoother playback.

Problem 4: DNS Recursive vs Iterative Queries

Consider a host H within qc.cuny.edu domain, whose name server is ns.qc.cuny.edu. Suppose that H tries to learn the IP address of the host ringding.cs.umd.edu. Assume that ns.qc.cuny.edu does not have the IP address of ringding.cs.umd.edu in its cache. Further assume that root DNS servers only know the authoritative name server for umd.edu domain.

(a) Describe how the IP address of ringding.cs.umd.edu will be resolved assuming no DNS server implements recursive queries.

Solution

With iterative queries, the local DNS server queries the root server, then the umd.edu server, then the cs.umd.edu server, finally obtaining the IP address of ringding.cs.umd.edu and returning it to the host.
(b) Redo (a) assuming ALL DNS servers implement recursive queries.

Solution

With recursive queries, the host sends one query to its local DNS server, which recursively contacts all necessary DNS servers and returns the final IP address.

Problem 5: DNS and HTTP Web Page Retrieval

Suppose within your Web browser you click on a link to obtain a Web page. Suppose that the IP address for the associated URL is not cached in your local host so that a DNS look-up is necessary to obtain the IP address. Suppose that DNS servers are visited before your host receives the IP address from DNS; the successive visits incur RTTs of . Let be the RTT between your local host and the Web server containing the Web page and let bits/sec be the sustained bandwidth.

(a) Suppose that the Web page consists of a single object of size bits. Further suppose that the DNS queries are sent over UDP. How much time elapses from when the client clicks on the link until the client receives the object?

Solution

Total time:
(b) Redo (a) assuming that the DNS queries are sent over TCP.

Solution

Total time:
(c) Assume that the user clicks on a link within the just downloaded page and starts downloading a new web page of size bits residing at the same server. Assume this page also consists of a single object. How much time elapses from when the client clicks on the new link until the client receives the new object? Assume that the DNS uses UDP as in (a).

Solution

Total time:
(d) Now assume that the web page to be downloaded in (c) has 6 other embedded objects each with size . Assuming that the Web browser implements HTTP/1.0 with non-persistent connections and no parallel TCP connections, how much time elapses from when the client clicks on the new link until the client receives all objects?

Solution

Total time:
(e) Redo (d) assuming the Web browser implements 4-parallel TCP connections with non-persistent connections.

Solution

Total time:
(f) Redo (d) assuming the Web client uses HTTP/1.1 with persistent connections (no pipelining).

Solution

Total time:

Problem 6: Instant Messaging System Architecture

Suppose you were to implement an instant message such as Yahoo messenger, which allows any number of users to exist in the system and establish instant messaging sessions among them.

(a) Describe the architecture of your system (system components, protocol messages exchanged etc.) to enable users to dynamically learn each other’s current IP addresses and port numbers so that they can seamlessly start instant messaging sessions.

Solution

Users register IP addresses and ports with a centralized directory server, which clients query to discover peers, while messages are exchanged peer-to-peer.
(b) Suppose you were to allow users to have “buddy lists” and learn about the current communication status of their buddies. How would you extend your system to enable this feature?

Solution

The directory server maintains buddy lists and presence information and notifies users of status changes.

Problem 7: Web Proxy Caching Performance

Assume that Queens College decided to use a Web Proxy, i.e., a Web cache. In this model, each Web browser is set up to send their requests to the Web proxy rather than sending the request directly to the actual Web server. Recall that a Web browser also maintains a local cache. Suppose a user accesses 100 objects one after the other using HTTP/1.0. The size of each object is 10000 bits. Assume that the sustained bandwidth between the user’s PC and the Web Proxy is 10Mbps and has an RTT of 1ms, and the sustained bandwidth between the Web proxy and a Web server is 1Mbps and has an RTT of 100ms.

(a) What is the average object retrieval time in the absence of any cache hits?

Solution

With no cache hits, the average object retrieval time is approximately 101 ms.
(b) Assume that of all user requests, 20 percent is found in the Browser cache, half of the remaining user requests are satisfied from the Web Proxy cache, and the remaining requests make it up to the Web server. What’s the average object retrieval time now?

Solution

With browser and proxy caching, the average retrieval time is approximately 40.8 ms.

Problem 8: Alert Notification Protocol Selection

Consider a network-attached burglar alarm which is programmed to notify the police when a burglar enters the house. Suppose that you are to use either HTTP or SMTP to send the notification message. How would you use each protocol to send the message? Which protocol makes more sense to use for this application?

Solution

SMTP is preferable to HTTP for alarm notifications because it provides reliable store-and-forward delivery.

Problem 9: Email with MIME Attachments

Suppose you want to send an e-mail message M with 4 attachments, A1, A2, A3 and A4. Describe how your e-mail client, e.g., Outlook, would send this e-mail?

Solution

The client constructs a MIME multipart message with Base64-encoded attachments and sends it using SMTP.

Problem 10: POP3 and IMAP Mail Protocols

What are POP3 and IMAP used for? What are the advantages of IMAP over POP3?

Solution

POP3 downloads email locally, while IMAP keeps email on the server and supports synchronization and folders, making IMAP more flexible.