Добавил:

korayakov Опубликованный материал нарушает ваши авторские права? Сообщите нам.

Вуз:

Национальный исследовательский университет «МИЭТ»

Предмет:

Операционные системы

Файл:

ПО и параллельная революция intel_070925

.pdf

Скачиваний:

Добавлен:

16.04.2013

Размер:

1.48 Mб

Скачать

☆

<<< < Предыдущая 12 / 32 3 > Следующая >>>

The Issue Is (Mostly) On the Client

“Solved”: Server apps (e.g., DB servers, web services).

Lots of independent requests – one thread per request is easy.

Typical to execute many copies of the same code.

Shared data usually via structured databases: Automatic implicit concurrency control via transactions.

With some care, “concurrency problem is already solved” here.

Not solved: Typical client apps (i.e., not Photoshop).

Somehow employ many threads per user “request.”

Highly atypical to execute many copies of the same code.

Shared data in unstructured shared memory:

Error prone explicit locking – where are the transactions?

A Tale of Six Software Waves

Each was born during 1958-73, bided its time until 1990s/00s, then took 5+ years to build a mature tool/language/framework/runtime ecosystem.

GUIs Objects GC Generics

Net

Concurrency

1990

1992

1994

1996

1998

2000

2002

2004

2006

2008

2010

2012

The biggest difference: We can see this one coming.

Comparative Impacts Across the Stack

	GUIs	Objects	GC	Generics	Web	Concurrency

Application						
programming
model
Libraries and						
frameworks
Languages						
and compilers
Runtimes						
and OS
Tools (design,						
measure, test)

Extent of impact and/or enabling importance

 = Some, could drive one major product release

 = Major, could drive more than one major product release

 = New way of thinking / essential enabler, or multiple major releases

O(1), O(K), or O(N) Concurrency?

O(1): As good as sequential.

The free lunch is over (if CPU-bound): Flat or merely incremental perf. improvements.

Potentially poor responsiveness.

O(K): Explicitly fixed throughput.

Hardwired # of threads that prefer K CPUs (for a given input workload).

Can penalize <K CPUs,

doesn’t scale >K CPUs.

 O(N): Scalable throughput.

 Workload decomposed into a

“sea” of heterogeneous chores.

 Lots of latent concurrency we can map down to N cores.

Some Lead Bullets (useful, but mostly mined)

Automatic parallelization (e.g., compilers, ILP):

Amdahl’s Law: Sequential programs tend to be… well, sequential.

Requires accurate program analysis: Challenging for simple languages (Fortran), intractable for languages with pointers.

Doesn’t actually shield programmers from having to know about concurrency.

Functional languages:

Contain natural parallelism… except it’s too fine-grained.

Use pure immutable data… except those in commercial use.

Not known to be adoptable by mainstream developers.

Borrow some key abstractions/styles from these languages (e.g., closures) and support them in imperative languages.

OpenMP, OpenMPI, et al.:

“Industrial-strength duct tape,” but useful where applicable.

Truths

Consequences

Futures

An OO for Concurrency

Fortran, C, …

threads + locks

asm semaphores

Three Pillars of the Dawn

A Framework for Evaluating Concurrency

	I. Isolation Via	II. Scalability Via	III. Consistency Via
	Asynchronous Agents	Concurrent Collections	Safely Shared Resources

Tagline	Avoid blocking	Re-enable the free lunch	Don’t corrupt shared state

Summary	Stay responsive by	Use more cores to get the	Avoid races by
	running tasks	answer faster by running	synchronizing access to
	independently and	operations on groups of	shared resources, esp.
	communicating via	things; exploit parallelism	mutable objects in shared
	messages	in data/algorithm structures	memory

Examples	GUIs, web services,	Trees, quicksort,	Mutable shared objects in
	bkgd print/compile	compilation	memory, database tables

Key metrics	Responsiveness	Throughput, scalability	Racefree, deadlock-free

Requirements	Isolated, independent	Low overhead, sideeffect free	Composable, serializable

Today’s	Threads,	Thread pools, OpenMP	Explicit locks, lock-free
abstractions	message queues		libraries, transactions

Possible new	Active objects,	Chores, futures, parallel	Transactional memory,
abstractions	futures	STL, PLINQ	improved locking

The Trouble With Locks

 How may I harm thee? Let me count the ways:

{

Lock lock1( mutTable1 ); Lock lock2( mutTable2 ); table1.erase( x ); table2.insert( x );

}

Locks are the best we have, and known to be inadequate:

Which lock? The connection between a lock and the data it protects exists primarily in the mind of a programmer.

Deadlock? If the mutexes aren’t acquired in the same order.

Simplicity or scalability? Coarse-grained locks are simpler to use correctly, but easily become bottlenecks.

Not composable. In today’s world, this is a deadly sin.

Enter Transactional Memory (?)

 A transactional programming model:

atomic { table1.erase( x ); table2.insert( x );

}

The programmer just delimits the transactions.

The system provides database-like automatic concurrency control with rollback-and-retry for competing transactions.

Benefits:

No specific lock: No need to remember which lock to take.

No deadlock: No need to remember a sync order discipline.

Simple and scalable: Fine-grained without sacrificing correctness.

Best of all: Composable. We can once again build a big (correct) program out of smaller (correct) programs.

Drawbacks:

Still active research. And the elephant in the room is I/O: What can you write inside an atomic block, exactly?

<<< < Предыдущая 12 / 32 3 > Следующая >>>

Соседние файлы в папке Лекции_2007

#
16.04.201365.02 Кб106Лекция ОС - ВУМ.ppt
#
16.04.2013106.5 Кб107Лекция ОС - Организация памяти.ppt
#
16.04.2013116.74 Кб102Лекция ОС - Потоки.ppt
#
16.04.201387.04 Кб104Лекция ОС - Тупики.ppt
#
16.04.2013157.18 Кб143Лекция ОС - управление процессорами.ppt
#
16.04.20131.48 Mб83ПО и параллельная революция intel_070925.pdf