Foundations of Python Network Programming 978-1-4302-3004-5

Recommendations

Info

CHAPTER 3 ■ TCP $ python tcp_deadlock.py client 1073741824 Sending 1073741824 bytes of data, in chunks of 16 bytes 734816 bytes sent Why have both client and server been brought to a halt? The answer is that the server’s output buffer and the client’s input buffer have both finally filled, and TCP has used its window adjustment protocol to signal this fact and stop the socket from sending more data that would have to be discarded and later re-sent. Consider what happens as each block of data travels. The client sends it with sendall(). Then the server accepts it with recv(), processes it, and then transmits its capitalized version back out with another sendall() call. And then what? Well, nothing! The client is never running any recv() calls—not while it still has data to send—so more and more capitalized data backs up, until the operating system is not willing to accept any more. During the run shown previously, about 600KB was buffered by the operating system in the client’s incoming queue before the network stack decided that it was full. At that point, the server blocks in its sendall() call, and is paused there by the operating system until the logjam clears and it can send more data. With the server no longer processing data or running any more recv() calls, it is now the client’s turn to have data start backing up. The operating system seems to have placed a limit of around 130KB to the amount of data it would queue up in that direction, because the client got roughly another 130KB into producing the stream before finally being brought to a halt as well. On a different system, you will probably find that different limits are reached. So the foregoing numbers are arbitrary and based on the mood of my laptop at the moment; they are not at all inherent in the way TCP works. And the point of this example is to teach you two things—besides, of course, showing that recv(1024) indeed returns fewer bytes than 1,024 if a smaller number are immediately available! First, this example should make much more concrete the idea that there are buffers sitting inside the TCP stacks on each end of a network connection. These buffers can hold data temporarily so that packets do not have to be dropped and eventually re-sent if they arrive at a moment that their reader does not happen to be inside of a recv() call. But the buffers are not limitless; eventually, a TCP routine trying to write data that is never being received or processed is going to find itself no longer able to write, until some of the data is finally read and the buffer starts to empty. Second, this example makes clear the dangers involved in protocols that do not alternate lock-step between the client requesting and the server acknowledging. If a protocol is not strict about the server reading a complete request until the client is done sending, and then sending a complete response in the other direction, then a situation like that created here can cause both of them to freeze without any recourse other than killing the program manually, and then rewriting it to improve its design! But how, then, are network clients and servers supposed to process large amounts of data without entering deadlock? There are, in fact, two possible answers: either they can use socket options to turn off blocking, so that calls like send() and recv() return immediately if they find that they cannot send any data yet. We will learn more about this option in Chapter 7, where we look in earnest at the possible ways to architect network server programs. Or, the programs can use one of several techniques to process data from several inputs at a time, either by splitting into separate threads or processes—one tasked with sending data into a socket, perhaps, and another tasked with reading data back out—or by running operating system calls like select() or poll() that let them wait on busy outgoing and incoming sockets at the same time, and respond to whichever is ready. Finally, note carefully that the foregoing scenario cannot ever happen when you are using UDP! This is because UDP does not implement flow control. If more datagrams are arriving up than can be processed, then UDP can simply discard some of them, and leave it up to the application to discover that they went missing. 47
CHAPTER 3 ■ TCP Closed Connections, Half-Open Connections There are two more points that should be made, on a different subject, from the foregoing example. First, Listing 3–2 shows us how a Python socket object behaves when an end-of-file is reached. Just like a Python file object returns an empty string upon a read() when there is no more data left, a socket simply returns an empty string when the socket is closed. We never worried about this in Listing 3–1 because in that case we had imposed a strict enough structure on our protocol—exchanging a pair of messages of exactly 16 bytes—that we did not need to close the socket to signal when communication was done. The client and server simply sent their messages, and then could close their sockets separately without needing to do any further checks. But in Listing 3–2, the client sends—and thus the server also processes and sends back—an arbitrary amount of data whose length is not decided until the user enters a number of bytes on the command line. And so you can see in the code, twice, the same pattern: a while loop that runs until it finally sees an empty string returned from recv(). Note that this normal Pythonic pattern will not work once we reach Chapter 7 and explore non-blocking sockets—in that case, recv() might return an empty string simply because no data is available at the moment, and other techniques are used to determine whether the socket has closed. Second, you will see that the client makes a shutdown() call on the socket after it finishes sending its transmission. This solves an important problem: if the server is going to read forever until it sees end-offile, then how will the client avoid having to do a full close() on the socket and thus forbid itself from doing the many recv() calls that it still needs to make to receive the server’s response? The solution is to “half-close” the socket—that is, to permanently shut down communication in one direction but without destroying the socket itself—so that the server can no longer read any data, but can still send any remaining reply back in the other direction, which will still be open. The shutdown() call can be used to end either direction of communication in a two-way socket like this; its argument can be one of three symbols: • SHUT_WR: This is the most common value used, since in most cases a program knows when its own output is finished but not about when its conversation partner will be done. This value says that the caller will be writing no more data into the socket, and that reads from its other end should act like it is closed. • SHUT_RD: This is used to turn off the incoming socket stream, so that an end-of-file error is encountered if your peer tries to send any more data to you on the socket. • SHUT_RDWR: This closes communication in both directions on the socket. It might not, at first, seem useful, because you can also just perform a close() on the socket and communication is similarly ended in both directions. The difference is a rather advanced one: if several programs on your operating system are allowed to share a single socket, then close() just ends your process’s relationship with the socket, but keeps it open as long as another process is still using it; but shutdown() will always immediately disable the socket for everyone using it. Since you are not allowed to create unidirectional sockets through a standard socket() call, many programmers who need to send information only in one direction over a socket will first create the socket, then—as soon as it is connected—immediately run shutdown() for the direction that they do not need. This means that no operating system buffers will be needlessly filled if the peer with which they are communicating accidentally tries to send data in a direction that it should not. Running shutdown() immediately on sockets that should really be unidirectional also provides a more obvious error message for a peer that does get confused and tries to send data. Otherwise their data will either simply be ignored, or might even fill a buffer and cause a deadlock because it will never be read. 48
Page 2 and 3:
Foundations of Python Network Progr
Page 4 and 5:
To the Python community for creatin
Page 6 and 7:
Contents ■Contents at a Glance ..
Page 8 and 9:
■ CONTENTS Asking getaddrinfo() W
Page 10 and 11:
■ CONTENTS Using Message Queues f
Page 12 and 13:
■ CONTENTS Parsing Dates ........
Page 14 and 15:
■ CONTENTS Telnet ...............
Page 16 and 17:
About the Authors ■ Brandon Craig
Page 18 and 19: Acknowledgements This book owes its
Page 20 and 21: ■ INTRODUCTION If you do know som
Page 22 and 23: C H A P T E R 1 ■ ■ ■ Introdu
Page 24 and 25: CHAPTER 1 ■ INTRODUCTION TO CLIEN
Page 36 and 37: C H A P T E R 2 ■ ■ ■ UDP The
Page 38 and 39: CHAPTER 2 ■ UDP server with SSH.
Page 40 and 41: CHAPTER 2 ■ UDP them anywhere in
Page 42 and 43: CHAPTER 2 ■ UDP command-line argu
Page 44 and 45: CHAPTER 2 ■ UDP » » » » raise
Page 46 and 47: CHAPTER 2 ■ UDP world itself give
Page 48 and 49: CHAPTER 2 ■ UDP socket that is no
Page 50 and 51: CHAPTER 2 ■ UDP So binding to an
Page 52 and 53: CHAPTER 2 ■ UDP s.connect((hostna
Page 54 and 55: CHAPTER 2 ■ UDP else: » print >>
Page 56 and 57: C H A P T E R 3 ■ ■ ■ TCP The
Page 58 and 59: CHAPTER 3 ■ TCP situation), and t
Page 60 and 61: CHAPTER 3 ■ TCP » reply = recv_a
Page 62 and 63: CHAPTER 3 ■ TCP guess when the in
Page 64 and 65: CHAPTER 3 ■ TCP the system has no
Page 66 and 67: CHAPTER 3 ■ TCP » » » print '\
Page 70 and 71: CHAPTER 3 ■ TCP Using TCP Streams
Page 72 and 73: CHAPTER 4 ■ SOCKET NAMES AND DNS
Page 92 and 93: CHAPTER 5 ■ NETWORK DATA AND NETW
Page 106 and 107: C H A P T E R 6 ■ ■ ■ TLS and
Page 108 and 109: CHAPTER 6 ■ TLS AND SSL systems a
Page 110 and 111: CHAPTER 6 ■ TLS AND SSL • He wi
Page 112 and 113: CHAPTER 6 ■ TLS AND SSL discussio
Page 114 and 115: CHAPTER 6 ■ TLS AND SSL • The s
Page 116 and 117: CHAPTER 6 ■ TLS AND SSL The Links
Page 118 and 119:
C H A P T E R 7 ■ ■ ■ Server
Page 120 and 121:
CHAPTER 7 ■ SERVER ARCHITECTURE P
Page 122 and 123:
CHAPTER 7 ■ SERVER ARCHITECTURE
Page 124 and 125:
CHAPTER 7 ■ SERVER ARCHITECTURE N
Page 126 and 127:
CHAPTER 7 ■ SERVER ARCHITECTURE L
Page 128 and 129:
CHAPTER 7 ■ SERVER ARCHITECTURE F
Page 130 and 131:
Page 132 and 133:
CHAPTER 7 ■ SERVER ARCHITECTURE p
Page 134 and 135:
CHAPTER 7 ■ SERVER ARCHITECTURE N
Page 136 and 137:
CHAPTER 7 ■ SERVER ARCHITECTURE L
Page 138 and 139:
Page 140 and 141:
CHAPTER 7 ■ SERVER ARCHITECTURE s
Page 142 and 143:
CHAPTER 7 ■ SERVER ARCHITECTURE c
Page 144 and 145:
C H A P T E R 8 ■ ■ ■ Caches,
Page 146 and 147:
CHAPTER 8 ■ CACHES, MESSAGE QUEUE
Page 148 and 149:
Page 150 and 151:
Page 152 and 153:
Page 154 and 155:
Page 156 and 157:
C H A P T E R 9 ■ ■ ■ HTTP Th
Page 158 and 159:
CHAPTER 9 ■ HTTP Here, the URL sp
Page 160 and 161:
CHAPTER 9 ■ HTTP Relative URLs Ve
Page 162 and 163:
CHAPTER 9 ■ HTTP From now on, I a
Page 164 and 165:
CHAPTER 9 ■ HTTP • 303 See Othe
Page 166 and 167:
CHAPTER 9 ■ HTTP You cannot tell
Page 168 and 169:
CHAPTER 9 ■ HTTP Instead of stuff
Page 170 and 171:
CHAPTER 9 ■ HTTP POST And APIs Al
Page 172 and 173:
CHAPTER 9 ■ HTTP Content Type Neg
Page 174 and 175:
CHAPTER 9 ■ HTTP HTTP Caching Man
Page 176 and 177:
CHAPTER 9 ■ HTTP If the connectio
Page 178 and 179:
CHAPTER 9 ■ HTTP >>> import cooki
Page 180 and 181:
CHAPTER 9 ■ HTTP So the technique
Page 182 and 183:
C H A P T E R 10 ■ ■ ■ Screen
Page 184 and 185:
CHAPTER 10 ■ SCREEN SCRAPING Figu
Page 186 and 187:
CHAPTER 10 ■ SCREEN SCRAPING cont
Page 188 and 189:
CHAPTER 10 ■ SCREEN SCRAPING Thir
Page 190 and 191:
CHAPTER 10 ■ SCREEN SCRAPING Ther
Page 192 and 193:
CHAPTER 10 ■ SCREEN SCRAPING Beau
Page 194 and 195:
CHAPTER 10 ■ SCREEN SCRAPING If y
Page 196 and 197:
CHAPTER 10 ■ SCREEN SCRAPING Cond
Page 198 and 199:
C H A P T E R 11 ■ ■ ■ Web Ap
Page 200 and 201:
CHAPTER 11 ■ WEB APPLICATIONS Thi
Page 202 and 203:
CHAPTER 11 ■ WEB APPLICATIONS But
Page 204 and 205:
CHAPTER 11 ■ WEB APPLICATIONS the
Page 206 and 207:
CHAPTER 11 ■ WEB APPLICATIONS •
Page 208 and 209:
CHAPTER 11 ■ WEB APPLICATIONS hig
Page 210 and 211:
CHAPTER 11 ■ WEB APPLICATIONS The
Page 212 and 213:
CHAPTER 11 ■ WEB APPLICATIONS the
Page 214 and 215:
CHAPTER 11 ■ WEB APPLICATIONS The
Page 216 and 217:
C H A P T E R 12 ■ ■ ■ E-mail
Page 218 and 219:
CHAPTER 12 ■ E-MAIL COMPOSITION A
Page 220 and 221:
Page 222 and 223:
Page 224 and 225:
Page 226 and 227:
Page 228 and 229:
Page 230 and 231:
Page 232 and 233:
Page 234 and 235:
Page 236 and 237:
C H A P T E R 13 ■ ■ ■ SMTP A
Page 238 and 239:
CHAPTER 13 ■ SMTP anyway. Outgoin
Page 240 and 241:
CHAPTER 13 ■ SMTP How SMTP Is Use
Page 242 and 243:
CHAPTER 13 ■ SMTP This mechanism
Page 244 and 245:
CHAPTER 13 ■ SMTP s = smtplib.SMT
Page 246 and 247:
CHAPTER 13 ■ SMTP ETRN STARTTLS X
Page 248 and 249:
CHAPTER 13 ■ SMTP » s = smtplib.
Page 250 and 251:
CHAPTER 13 ■ SMTP exchange mail o
Page 252 and 253:
CHAPTER 13 ■ SMTP username = sys.
Page 254 and 255:
C H A P T E R 14 ■ ■ ■ POP PO
Page 256 and 257:
CHAPTER 14 ■ POP ■ Caution! Whi
Page 258 and 259:
CHAPTER 14 ■ POP finally: » p.qu
Page 260 and 261:
CHAPTER 14 ■ POP Subject: Backup
Page 262 and 263:
CHAPTER 15 ■ IMAP THE IMAP PROTOC
Page 264 and 265:
CHAPTER 15 ■ IMAP '(\\HasNoChildr
Page 266 and 267:
CHAPTER 15 ■ IMAP Examining Folde
Page 268 and 269:
CHAPTER 15 ■ IMAP Listing 15-5. D
Page 270 and 271:
CHAPTER 15 ■ IMAP key that IMAP h
Page 272 and 273:
CHAPTER 15 ■ IMAP » » print def
Page 274 and 275:
CHAPTER 15 ■ IMAP » From: Brando
Page 276 and 277:
CHAPTER 15 ■ IMAP • \Flagged: T
Page 278 and 279:
CHAPTER 15 ■ IMAP An IMAP message
Page 280 and 281:
CHAPTER 15 ■ IMAP display or summ
Page 282 and 283:
CHAPTER 16 ■ TELNET AND SSH cloud
Page 284 and 285:
CHAPTER 16 ■ TELNET AND SSH Unix
Page 286 and 287:
CHAPTER 16 ■ TELNET AND SSH Do yo
Page 288 and 289:
CHAPTER 16 ■ TELNET AND SSH As we
Page 290 and 291:
CHAPTER 16 ■ TELNET AND SSH tabif
Page 292 and 293:
CHAPTER 16 ■ TELNET AND SSH repla
Page 294 and 295:
CHAPTER 16 ■ TELNET AND SSH Listi
Page 296 and 297:
CHAPTER 16 ■ TELNET AND SSH def p
Page 298 and 299:
CHAPTER 16 ■ TELNET AND SSH We wi
Page 300 and 301:
CHAPTER 16 ■ TELNET AND SSH • p
Page 302 and 303:
CHAPTER 16 ■ TELNET AND SSH You w
Page 304 and 305:
CHAPTER 16 ■ TELNET AND SSH » »
Page 306 and 307:
CHAPTER 16 ■ TELNET AND SSH Listi
Page 308 and 309:
CHAPTER 16 ■ TELNET AND SSH Summa
Page 310 and 311:
CHAPTER 17 ■ FTP The biggest prob
Page 312 and 313:
CHAPTER 17 ■ FTP f.login() print
Page 314 and 315:
CHAPTER 17 ■ FTP if os.path.exist
Page 316 and 317:
CHAPTER 17 ■ FTP f = FTP(host) f.
Page 318 and 319:
CHAPTER 17 ■ FTP Windows servers
Page 320 and 321:
CHAPTER 17 ■ FTP » try: » » f.
Page 322 and 323:
C H A P T E R 18 ■ ■ ■ RPC Re
Page 324 and 325:
CHAPTER 18 ■ RPC sort of proxy ex
Page 326 and 327:
CHAPTER 18 ■ RPC The SimpleXMLRPC
Page 328 and 329:
CHAPTER 18 ■ RPC Traceback (most
Page 330 and 331:
CHAPTER 18 ■ RPC 8.0 If this
Page 332 and 333:
CHAPTER 18 ■ RPC Note that the po
Page 334 and 335:
CHAPTER 18 ■ RPC up being, simply
Page 336 and 337:
CHAPTER 18 ■ RPC such as Python i
Page 338 and 339:
CHAPTER 18 ■ RPC • Google Proto
Page 340 and 341:
■ INDEX mod_python, 194 Qpid, 131
Page 342 and 343:
■ INDEX Common Gateway Interface.
Page 344 and 345:
■ INDEX international characters
Page 346 and 347:
■ INDEX front-end web servers, 17
Page 348 and 349:
■ INDEX deleting folders, 260 del
Page 350 and 351:
■ INDEX mechanize, 138, 163 Memca
Page 352 and 353:
■ INDEX pausing terminal output,
Page 354 and 355:
■ INDEX resources. See also RFCs
Page 356 and 357:
■ INDEX shutdown(), 48 shutting d
Page 358 and 359:
■ INDEX terminals, 270-74 bufferi
Page 360 and 361:
■ INDEX ■ V validating cached r
show all

Foundations of Python Network Programming 978-1-4302-3004-5

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?