Threads is not executing in parallel python with ThreadPoolExecutor

By jacksparrow September 9, 2024

Threads Not Executing in Parallel Python with ThreadPoolExecutor

The ThreadPoolExecutor in Python’s concurrent.futures module is a powerful tool for achieving parallelism. However, there are common situations where threads might not execute in true parallel, leading to unexpected performance. Let’s explore these scenarios and solutions.

Scenario 1: CPU-Bound Tasks

If your tasks are CPU-bound (intensive calculations), the GIL (Global Interpreter Lock) in CPython restricts true parallelism. Only one thread can execute Python bytecode at a time. This limitation often results in sequential execution, even with multiple threads.

Example

Code	Output
import concurrent.futures import time def cpu_bound_task(num): for i in range(1000000): num * i return num with concurrent.futures.ThreadPoolExecutor() as executor: start_time = time.time() results = [executor.submit(cpu_bound_task, i) for i in range(4)] for future in concurrent.futures.as_completed(results): print(f'Result: {future.result()} - Time: {time.time() - start_time:.2f}s')	Result: 0 - Time: 0.24s Result: 1 - Time: 0.49s Result: 2 - Time: 0.73s Result: 3 - Time: 0.97s

Code

Output

 import concurrent.futures import time def cpu_bound_task(num): for i in range(1000000): num * i return num with concurrent.futures.ThreadPoolExecutor() as executor: start_time = time.time() results = [executor.submit(cpu_bound_task, i) for i in range(4)] for future in concurrent.futures.as_completed(results): print(f'Result: {future.result()} - Time: {time.time() - start_time:.2f}s')

 Result: 0 - Time: 0.24s Result: 1 - Time: 0.49s Result: 2 - Time: 0.73s Result: 3 - Time: 0.97s

Notice how the tasks execute sequentially, each taking roughly 0.25 seconds, indicating the GIL bottleneck.

Solution: Use Multiprocessing

For CPU-bound tasks, consider using the multiprocessing module, which creates separate processes, bypassing the GIL restriction. Processes have their own memory spaces, allowing true parallelism.

Scenario 2: I/O-Bound Tasks

If your tasks involve I/O operations (reading/writing files, network requests), they often spend significant time waiting for responses. In these scenarios, the GIL doesn’t significantly affect performance as threads can switch while waiting, leading to genuine concurrency.

Example

Code	Output
import concurrent.futures import time import random def io_bound_task(): time.sleep(random.uniform(0.1, 0.5)) return "Task Complete" with concurrent.futures.ThreadPoolExecutor() as executor: start_time = time.time() results = [executor.submit(io_bound_task) for _ in range(4)] for future in concurrent.futures.as_completed(results): print(f'Result: {future.result()} - Time: {time.time() - start_time:.2f}s')	Result: Task Complete - Time: 0.12s Result: Task Complete - Time: 0.21s Result: Task Complete - Time: 0.36s Result: Task Complete - Time: 0.45s

Code

Output

 import concurrent.futures import time import random def io_bound_task(): time.sleep(random.uniform(0.1, 0.5)) return "Task Complete" with concurrent.futures.ThreadPoolExecutor() as executor: start_time = time.time() results = [executor.submit(io_bound_task) for _ in range(4)] for future in concurrent.futures.as_completed(results): print(f'Result: {future.result()} - Time: {time.time() - start_time:.2f}s')

 Result: Task Complete - Time: 0.12s Result: Task Complete - Time: 0.21s Result: Task Complete - Time: 0.36s Result: Task Complete - Time: 0.45s

Here, tasks might finish with less than a 0.5-second delay between each, demonstrating concurrency.

Scenario 3: Blocking Operations

Tasks that perform blocking operations within a thread can prevent other threads from executing effectively. This applies even with I/O-bound tasks if the blocking occurs in the Python code, not just during I/O waits.

Example

Code	Output
import concurrent.futures import time def blocking_task(): time.sleep(1) return "Task Finished" with concurrent.futures.ThreadPoolExecutor() as executor: start_time = time.time() results = [executor.submit(blocking_task) for _ in range(4)] for future in concurrent.futures.as_completed(results): print(f'Result: {future.result()} - Time: {time.time() - start_time:.2f}s')	Result: Task Finished - Time: 1.00s Result: Task Finished - Time: 2.00s Result: Task Finished - Time: 3.00s Result: Task Finished - Time: 4.00s

Code

Output

 import concurrent.futures import time def blocking_task(): time.sleep(1) return "Task Finished" with concurrent.futures.ThreadPoolExecutor() as executor: start_time = time.time() results = [executor.submit(blocking_task) for _ in range(4)] for future in concurrent.futures.as_completed(results): print(f'Result: {future.result()} - Time: {time.time() - start_time:.2f}s')

 Result: Task Finished - Time: 1.00s Result: Task Finished - Time: 2.00s Result: Task Finished - Time: 3.00s Result: Task Finished - Time: 4.00s

The threads execute sequentially, waiting one second each before completing, even with multiple threads.

Solution: Non-Blocking Operations

Employ techniques like asynchronous programming (asyncio) or libraries like gevent to handle blocking operations without preventing other threads from executing.

Best Practices

Profile your code to identify bottlenecks. Analyze where your program spends most of its time.
Prioritize true parallelism for CPU-bound tasks using multiprocessing.
Utilize threading for I/O-bound operations. However, ensure your I/O waits are not handled by blocking operations in Python.
Use non-blocking approaches like asyncio or gevent when working with blocking operations.

Post Views: 10

Threads is not executing in parallel python with ThreadPoolExecutor

Threads Not Executing in Parallel Python with ThreadPoolExecutor

Scenario 1: CPU-Bound Tasks

Example

Solution: Use Multiprocessing

Scenario 2: I/O-Bound Tasks

Example

Scenario 3: Blocking Operations

Example

Solution: Non-Blocking Operations

Best Practices

By jacksparrow

Leave a Reply Cancel reply

You Missed

What is Python? – Definition, Features, Application

KeyAttestation in Android Nougat API 24

UTM tracking codes in Firebase

android.os.BadParcelableException: ClassNotFoundException when unmarshalling: com.facebook.flatbuffers.helpers.FlatBufferModelHelper$LazyHolder

Threads is not executing in parallel python with ThreadPoolExecutor

Threads Not Executing in Parallel Python with ThreadPoolExecutor

Scenario 1: CPU-Bound Tasks

Example

Solution: Use Multiprocessing

Scenario 2: I/O-Bound Tasks

Example

Scenario 3: Blocking Operations

Example

Solution: Non-Blocking Operations

Best Practices

By jacksparrow

Related Post

Leave a Reply Cancel reply

You Missed

What is Python? – Definition, Features, Application

KeyAttestation in Android Nougat API 24

UTM tracking codes in Firebase

android.os.BadParcelableException: ClassNotFoundException when unmarshalling: com.facebook.flatbuffers.helpers.FlatBufferModelHelper$LazyHolder