High-Performance Computing and Concurrency

Video description

It's deja vu all over again. In the old days (35 plus years ago), developers wanting to write half-decent programs had to know their hardware. Those days are back. Clock frequencies have peaked and hardware can no longer be abstracted behind high-level languages. Designed for developers with high performance requirements (games, finance analysis, scientific computation, etc.), this course teaches you what really happens when programs are executed and the subtle details that make a program go slow or fast.

With a focus on concurrency, specifically local concurrency (multi-threading), the course is all about writing efficient programs that make the best use of the computing resources available to you. While the sample code is written in C++, the course is not C++ specific. If you can read C++ code, but don’t use it in your work, you will still learn from this class.

  • Learn how programs execute in hardware and the subtle details that affect program speed
  • Practice writing efficient programs that get the most out of today’s CPUs, caches, and memory
  • Discover how single and multi-core CPUs interact with memory and how to avoid memory slowness
  • Explore memory models, concurrent data structures, lock-free concurrency, and lock-based concurrency
  • Acquire the tools needed to measure the performance of programs and their components
Fedor G. Pikus is a chief engineering scientist in the Design-to-Silicon division of Mentor Graphics and a former senior software engineer at Google. Fedor builds the design automation tools used by the people who build the chips in your computers, cars, and more. He has over 25 patents, and over 90 papers and conference presentations on physics, EDA, software design, and the C++ language. He holds a Ph.D. in Applied Physics from Peter the Great St. Petersburg Polytechnic University.

Table of contents

  1. Introduction
    1. Introduction And Course Overview
    2. About The Author
  2. Memory Architecture And Performance Impact
    1. Overview
    2. Overview (Continued)
    3. Access Patterns And Impact On Algorithms And Data Structure Design
    4. Many Threads (Multi-Core Access)
  3. Measuring Time In Programs
    1. Real Time And CPU Time
    2. TSC Timers
    3. Profiling Tools
  4. Threads
    1. Overview
    2. Threads In C++
    3. Avoiding Data Races And Its Cost
  5. How Threads Interact With Memory
    1. Concurrency And Memory
    2. Data Sharing
    3. False Data Sharing
  6. Synchronization Of Memory Accesses
    1. Locks (Mutexes) Part - 1
    2. Locks (Mutexes) Part - 2
    3. Locks (Spinlocks)
    4. Lock-Free Synchronization And Other Options - Part 1
    5. Lock-Free Synchronization And Other Options - Part 2
  7. Memory Models
    1. Memory Model
    2. C++ Memory Model
    3. Memory Order
    4. Memory Order Guarantees In C++
  8. Memory Barriers
    1. Need For Memory Barriers
    2. Memory Barriers
    3. Synchronization, Revisited - Part 1
    4. Synchronization, Revisited - Part 2
  9. Lock-Based And Lock-Free Programming
    1. Efficient Concurrency; Types Of Concurrent Programs
    2. Problems With Locks Part - 1
    3. Problems With Locks Part - 2
    4. Thread-Safe Data Structures Part - 1
    5. Thread-Safe Data Structures Part - 2
    6. Introduction To Lock-Free Programming
  10. Lock-Free Data Structures
    1. Shared Pointer Part - 1
    2. Shared Pointer Part - 2
    3. Shared Pointer Part - 3
    4. Shared Pointer Part - 4
    5. Shared Pointer Part - 5
    6. Shared Pointer Part - 6
    7. Node-Based Containers Part - 1 (List)
    8. Node-Based Containers Part - 2 (List)
    9. Node-Based Containers Part - 3 (List)
    10. Node-Based Containers Part - 4 (List)
    11. Node-Based Containers Part - 5 (List)
    12. Node-Based Containers Part - 6 (List)
    13. Node-Based Containers Part - 7 (List)
    14. Sequential Containers Part - 1 (Queue)
    15. Sequential Containers Part - 2 (Queue)
    16. Sequential Containers Part - 3 (Queue)
    17. Sequential Containers Part - 4 (Queue)
    18. Sequential Containers Part - 5 (Queue)
    19. Sequential Containers Part - 6 (Queue)
  11. Performance In Real Life
    1. Practical Performance
    2. Factors Affecting Performance Part -1
    3. Factors Affecting Performance Part -2
  12. Concurrent Data Structures In Depth
    1. Concurrency, Performance, And Order Guarantees Part - 1
    2. Concurrency, Performance, And Order Guarantees Part - 2
    3. Toward More General Data Structures
  13. Conclusion
    1. Conclusions And Where To Go From Here

Product information

  • Title: High-Performance Computing and Concurrency
  • Author(s): Fedor G. Pikus
  • Release date: November 2016
  • Publisher(s): Infinite Skills
  • ISBN: 9781491967553