Copyright
前言
第一章：數(shù)據(jù)結(jié)構(gòu)和算法
1. 1.1 解壓序列賦值給多個(gè)變量
2. 1.2 解壓可迭代對象賦值給多個(gè)變量
3. 1.3 保留最后N個(gè)元素
4. 1.4 查找最大或最小的N個(gè)元素
5. 1.5 實(shí)現(xiàn)一個(gè)優(yōu)先級隊(duì)列
6. 1.6 字典中的鍵映射多個(gè)值
7. 1.7 字典排序
8. 1.8 字典的運(yùn)算
9. 1.9 查找兩字典的相同點(diǎn)
10. 1.10 刪除序列相同元素并保持順序
11. 1.11 命名切片
12. 1.12 序列中出現(xiàn)次數(shù)最多的元素
13. 1.13 通過某個(gè)關(guān)鍵字排序一個(gè)字典列表
14. 1.14 排序不支持原生比較的對象
15. 1.15 通過某個(gè)字段將記錄分組
16. 1.16 過濾序列元素
17. 1.17 從字典中提取子集
18. 1.18 映射名稱到序列元素
19. 1.19 轉(zhuǎn)換并同時(shí)計(jì)算數(shù)據(jù)
20. 1.20 合并多個(gè)字典或映射
第二章：字符串和文本
1. 2.1 使用多個(gè)界定符分割字符串
2. 2.2 字符串開頭或結(jié)尾匹配
3. 2.3 用Shell通配符匹配字符串
4. 2.4 字符串匹配和搜索
5. 2.5 字符串搜索和替換
6. 2.6 字符串忽略大小寫的搜索替換
7. 2.7 最短匹配模式
8. 2.8 多行匹配模式
9. 2.9 將Unicode文本標(biāo)準(zhǔn)化
10. 2.10 在正則式中使用Unicode
11. 2.11 刪除字符串中不需要的字符
12. 2.12 審查清理文本字符串
13. 2.13 字符串對齊
14. 2.14 合并拼接字符串
15. 2.15 字符串中插入變量
16. 2.16 以指定列寬格式化字符串
17. 2.17 在字符串中處理html和xml
18. 2.18 字符串令牌解析
19. 2.19 實(shí)現(xiàn)一個(gè)簡單的遞歸下降分析器
20. 2.20 字節(jié)字符串上的字符串操作
第三章：數(shù)字日期和時(shí)間
1. 3.1 數(shù)字的四舍五入
2. 3.2 執(zhí)行精確的浮點(diǎn)數(shù)運(yùn)算
3. 3.3 數(shù)字的格式化輸出
4. 3.4 二八十六進(jìn)制整數(shù)
5. 3.5 字節(jié)到大整數(shù)的打包與解包
6. 3.6 復(fù)數(shù)的數(shù)學(xué)運(yùn)算
7. 3.7 無窮大與NaN
8. 3.8 分?jǐn)?shù)運(yùn)算
9. 3.9 大型數(shù)組運(yùn)算
10. 3.10 矩陣與線性代數(shù)運(yùn)算
11. 3.11 隨機(jī)選擇
12. 3.12 基本的日期與時(shí)間轉(zhuǎn)換
13. 3.13 計(jì)算最后一個(gè)周五的日期
14. 3.14 計(jì)算當(dāng)前月份的日期范圍
15. 3.15 字符串轉(zhuǎn)換為日期
16. 3.16 結(jié)合時(shí)區(qū)的日期操作
第四章：迭代器與生成器
1. 4.1 手動遍歷迭代器
2. 4.2 代理迭代
3. 4.3 使用生成器創(chuàng)建新的迭代模式
4. 4.4 實(shí)現(xiàn)迭代器協(xié)議
5. 4.5 反向迭代
6. 4.6 帶有外部狀態(tài)的生成器函數(shù)
7. 4.7 迭代器切片
8. 4.8 跳過可迭代對象的開始部分
9. 4.9 排列組合的迭代
10. 4.10 序列上索引值迭代
11. 4.11 同時(shí)迭代多個(gè)序列
12. 4.12 不同集合上元素的迭代
13. 4.13 創(chuàng)建數(shù)據(jù)處理管道
14. 4.14 展開嵌套的序列
15. 4.15 順序迭代合并后的排序迭代對象
16. 4.16 迭代器代替while無限循環(huán)
第五章：文件與IO
1. 5.1 讀寫文本數(shù)據(jù)
2. 5.2 打印輸出至文件中
3. 5.3 使用其他分隔符或行終止符打印
4. 5.4 讀寫字節(jié)數(shù)據(jù)
5. 5.5 文件不存在才能寫入
6. 5.6 字符串的I/O操作
7. 5.7 讀寫壓縮文件
8. 5.8 固定大小記錄的文件迭代
9. 5.9 讀取二進(jìn)制數(shù)據(jù)到可變緩沖區(qū)中
10. 5.10 內(nèi)存映射的二進(jìn)制文件
11. 5.11 文件路徑名的操作
12. 5.12 測試文件是否存在
13. 5.13 獲取文件夾中的文件列表
14. 5.14 忽略文件名編碼
15. 5.15 打印不合法的文件名
16. 5.16 增加或改變已打開文件的編碼
17. 5.17 將字節(jié)寫入文本文件
18. 5.18 將文件描述符包裝成文件對象
19. 5.19 創(chuàng)建臨時(shí)文件和文件夾
20. 5.20 與串行端口的數(shù)據(jù)通信
21. 5.21 序列化Python對象
第六章：數(shù)據(jù)編碼和處理
1. 6.1 讀寫CSV數(shù)據(jù)
2. 6.2 讀寫JSON數(shù)據(jù)
3. 6.3 解析簡單的XML數(shù)據(jù)
4. 6.4 增量式解析大型XML文件
5. 6.5 將字典轉(zhuǎn)換為XML
6. 6.6 解析和修改XML
7. 6.7 利用命名空間解析XML文檔
8. 6.8 與關(guān)系型數(shù)據(jù)庫的交互
9. 6.9 編碼和解碼十六進(jìn)制數(shù)
10. 6.10 編碼解碼Base64數(shù)據(jù)
11. 6.11 讀寫二進(jìn)制數(shù)組數(shù)據(jù)
12. 6.12 讀取嵌套和可變長二進(jìn)制數(shù)據(jù)
13. 6.13 數(shù)據(jù)的累加與統(tǒng)計(jì)操作
第八章：類與對象
1. 8.1 改變對象的字符串顯示
2. 8.2 自定義字符串的格式化
3. 8.3 讓對象支持上下文管理協(xié)議
4. 8.4 創(chuàng)建大量對象時(shí)節(jié)省內(nèi)存方法
5. 8.5 在類中封裝屬性名
6. 8.6 創(chuàng)建可管理的屬性
7. 8.7 調(diào)用父類方法
8. 8.8 子類中擴(kuò)展property

第七章：函數(shù)

第九章：元編程

第十章：模塊與包

第十一章：網(wǎng)絡(luò)與Web編程

第十二章：并發(fā)編程

第十三章：腳本編程與系統(tǒng)管理

第十四章：測試調(diào)試和異常

第十五章：C語言擴(kuò)展

附錄A

關(guān)于譯者

Roadmap

閱讀(34.4k) 書簽贊(0) 我要糾錯

12.7 創(chuàng)建一個(gè)線程池

2018-02-24 15:27 更新

問題

You want to create a pool of worker threads for serving clients or performing other kindsof work.

解決方案

The concurrent.futures library has a ThreadPoolExecutor class that can be used forthis purpose. Here is an example of a simple TCP server that uses a thread-pool to serveclients:

from socket import AF_INET, SOCK_STREAM, socketfrom concurrent.futures import ThreadPoolExecutor

def echo_client(sock, client_addr):
‘''Handle a client connection‘''print(‘Got connection from', client_addr)while True:

msg = sock.recv(65536)if not msg:

break

sock.sendall(msg)

print(‘Client closed connection')sock.close()

def echo_server(addr):
pool = ThreadPoolExecutor(128)sock = socket(AF_INET, SOCK_STREAM)sock.bind(addr)sock.listen(5)while True:

client_sock, client_addr = sock.accept()pool.submit(echo_client, client_sock, client_addr)

echo_server((‘',15000))

If you want to manually create your own thread pool, it’s usually easy enough to do itusing a Queue. Here is a slightly different, but manual implementation of the same code:

from socket import socket, AF_INET, SOCK_STREAMfrom threading import Threadfrom queue import Queue

def echo_client(q):
‘''Handle a client connection‘''sock, client_addr = q.get()print(‘Got connection from', client_addr)while True:

msg = sock.recv(65536)if not msg:

break

sock.sendall(msg)

print(‘Client closed connection')

sock.close()

def echo_server(addr, nworkers):

Launch the client workersq = Queue()for n in range(nworkers):

t = Thread(target=echo_client, args=(q,))t.daemon = Truet.start()

Run the serversock = socket(AF_INET, SOCK_STREAM)sock.bind(addr)sock.listen(5)while True:

client_sock, client_addr = sock.accept()q.put((client_sock, client_addr))

echo_server((‘',15000), 128)

One advantage of using ThreadPoolExecutor over a manual implementation is that itmakes it easier for the submitter to receive results from the called function. For example,you could write code like this:

from concurrent.futures import ThreadPoolExecutorimport urllib.request

def fetch_url(url):u = urllib.request.urlopen(url)data = u.read()return data
pool = ThreadPoolExecutor(10)# Submit work to the poola = pool.submit(fetch_url, ‘http://www.python.org‘)b = pool.submit(fetch_url, ‘http://www.pypy.org‘)

Get the results backx = a.result()y = b.result()

The result objects in the example handle all of the blocking and coordination neededto get data back from the worker thread. Specifically, the operation a.result() blocksuntil the corresponding function has been executed by the pool and returned a value.

討論

Generally, you should avoid writing programs that allow unlimited growth in the num‐ber of threads. For example, take a look at the following server:

from threading import Threadfrom socket import socket, AF_INET, SOCK_STREAM

def echo_client(sock, client_addr):
‘''Handle a client connection‘''print(‘Got connection from', client_addr)while True:

msg = sock.recv(65536)if not msg:

break

sock.sendall(msg)

print(‘Client closed connection')sock.close()

def echo_server(addr, nworkers):

Run the serversock = socket(AF_INET, SOCK_STREAM)sock.bind(addr)sock.listen(5)while True:

client_sock, client_addr = sock.accept()t = Thread(target=echo_client, args=(client_sock, client_addr))t.daemon = Truet.start()

echo_server((‘',15000))

Although this works, it doesn’t prevent some asynchronous hipster from launching anattack on the server that makes it create so many threads that your program runs outof resources and crashes (thus further demonstrating the “evils” of using threads). Byusing a pre-initialized thread pool, you can carefully put an upper limit on the amountof supported concurrency.You might be concerned with the effect of creating a large number of threads. However,modern systems should have no trouble creating pools of a few thousand threads.Moreover, having a thousand threads just sitting around waiting for work isn’t going tohave much, if any, impact on the performance of other code (a sleeping thread does justthat—nothing at all). Of course, if all of those threads wake up at the same time andstart hammering on the CPU, that’s a different story—especially in light of the GlobalInterpreter Lock (GIL). Generally, you only want to use thread pools for I/O-boundprocessing.One possible concern with creating large thread pools might be memory use. For ex‐ample, if you create 2,000 threads on OS X, the system shows the Python process usingup more than 9 GB of virtual memory. However, this is actually somewhat misleading.When creating a thread, the operating system reserves a region of virtual memory tohold the thread’s execution stack (often as large as 8 MB). Only a small fragment of thismemory is actually mapped to real memory, though. Thus, if you look a bit closer, youmight find the Python process is using far less real memory (e.g., for 2,000 threads, only

70 MB of real memory is used, not 9 GB). If the size of the virtual memory is a concern,you can dial it down using the threading.stack_size() function. For example:

import threadingthreading.stack_size(65536)

If you add this call and repeat the experiment of creating 2,000 threads, you’ll find thatthe Python process is now only using about 210 MB of virtual memory, although theamount of real memory in use remains about the same. Note that the thread stack sizemust be at least 32,768 bytes, and is usually restricted to be a multiple of the systemmemory page size (4096, 8192, etc.).

以上內(nèi)容是否對您有幫助：

← 12.6 保存線程的狀態(tài)信息

12.8 簡單的并行編程 →

寫筆記

我要補(bǔ)充

12.7 創(chuàng)建一個(gè)線程池

問題

解決方案

Launch the client workersq = Queue()for n in range(nworkers):

Run the serversock = socket(AF_INET, SOCK_STREAM)sock.bind(addr)sock.listen(5)while True:

Get the results backx = a.result()y = b.result()

討論

Run the serversock = socket(AF_INET, SOCK_STREAM)sock.bind(addr)sock.listen(5)while True:

推薦文章

推薦教程

推薦課程