Threading History and Implementation - Classes

Threading 

History and Implementation 

Matthew Atwood 

Intel Atom Group - OSU

Moore's Law(s) 

Moore's Law 

"The number of transistors and resistors on a chip doubles every 18 

months." 

Moore's 2nd Law 

"The capital cost of a semiconductor fab also increases exponentially 

over time." 

Gordon E. Moore 

Intel Co-Founder

Computing Power 

1. There has always been two schools of thought on how to increase 

computing power. 

2. Increase the number of transistors 

1. Primary way to increase computing power in the last 50 years. 

3. Increase the number of cores 

1. A strong idea but more difficult to implement.

What is Threading? 

Thread of execution: 

1. Smallest unit of processing that can be scheduled by an operating 

system. 

2. In most modern operating systems, a thread is contained within a 

process (Unix and Unix like operating systems). 

○ Linux does not differentiate between processes and threads. A 

thread is merely a special kind of process. 

3. On single processor a process's time is divided among threads. 

4. On multiple core machines threads can run concurrently thus 

achieving true parallelism.

Threads vs Processes 

● Processes are typically (not always) independent, threads 

exist as part of process 

● The state information of threads is much less dense 

● Threads share their address space 

● Process can only communicate through inter-process 

communication: sockets, file descriptors, etc.. 

● Context switching between threads (of the same process) is 

typically much faster than context switching among 

processes.

Why Threading? Why Now? 

Part I 

1. Approaching the limit for how small a transistor can get 

○ April 18, 2011 a 7 atom transistor was made 

2. The cost in terms of power consumption, and in turn heat, 

for faster chips is becoming more and more unacceptable 

3. In order to continue increasing computing power harnessing 

multiple cores instead of using high power single core 

machines is critical.

Why Treading? Why Now? 

Part II 

1. Threads are much cheaper to create than processes 

○ pthread_create() is often less than 1/5th the creation 

time of fork(). 

2. Thread inter-communication is much faster than process 

inter-communication. 

○ Especially if I/O is used to communicate between 

processes. 

3. Context switching between threads is much less expensive 

than processes. 

○ This is due mainly to the shared memory model.

Candidates for Parallelism 

If there are two independent tasks that can be run concurrently, 

interleaved, overlapped then we can potentially achieve an 

optimization through threading

Thread Safety 

If Parallelism, more specifically threading, is so great why not 

use it for everything? 

● Parallelism is hard. Humans think serially, many existing 

algorithms cannot be done in parallel (by design). 

● Interactions between threads in which data is accessed and 

manipulated is called a critical section 

○ Use of "locking" is used to prevent this 

● If it is possible for two threads to be in the same critical 

section at the same time we call this a race condition. 

○ A race condition means that the order of execution 

determines the output.

Thread Synchronization 

● Use parallel design patterns, these are often similar to 

regular design patterns. 

○ Master/ Slave 

○ Assembly Line 

○ Peer 

● To lock critical sections use of atomic variables is required. 

An atomic variable can guarantee only one thread acts on it 

at a time. 

○ Semaphores: Typically used in a gate design pattern 

○ Mutex: Typical locking mechanism for shared data 

○ Spinlocks: Linux Kernel locking mechanism

Threading Libraries 

1. Posix Threads (pthreads) 

○ Available on almost every platform (was originally 

intended for Unix and Unix like operating systems) 

○ Extremely customizable allowing for greater degrees for 

control 

2. Intel Thread Building Blocks 

○ A higher level library for threading 

3. OpenMP 

○ An open source higher level threading library 

A

Matt's Tips for Threading 

1. Don't design algorithms serially 

○ Start out expecting your code to only be run in parallel 

2. Pick the "right" pattern 

○ Identify what your goal is (memory usage, speed, etc..) 

and what you're willing to sacrifice to get there. 

3. Avoid lots of critical sections 

○ Not to mean by shear quantity but rather how often you 

will need to enter those critical sections. 

4. Think carefully about your division of labor

Recommended Reading 

1. The Little Book of Semaphores: Allen B. Downey 

○ available for free: http://greenteapress. 

com/semaphores/downey08semaphores.pdf 

2. Pthread Tutorial: 

○ https://computing.llnl.gov/tutorials/pthreads/ 

3. And if you're interested in the Kernel... 

○ Linux Kernel Development: Robert Love

Threading History and Implementation - Classes

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?