Books & Videos

Table of Contents

  1. Availability

    1. Chapter 1 What Is Availability?

      1. Availability Versus Reliability
      2. What Causes Poor Availability?
    2. Chapter 2 Five Focuses to Improve Application Availability

      1. Focus #1: Build with Failure in Mind
      2. Focus #2: Always Think About Scaling
      3. Focus #3: Mitigate Risk
      4. Focus #4: Monitor Availability
      5. Focus #5: Respond to Availability Issues in a Predictable and Defined Way
      6. Being Prepared
    3. Chapter 3 Measuring Availability

      1. The Nines
      2. Don’t Be Fooled
      3. Availability by the Numbers
    4. Chapter 4 Improving Your Availability When It Slips

      1. Measure and Track Your Current Availability
      2. Automate Your Manual Processes
      3. Improve Your Systems
      4. Your Changing and Growing Application
      5. Keeping on Top of Availability
  2. Risk Management

    1. Chapter 5 What Is Risk Management?

      1. Managing Risk
      2. Identify Risk
      3. Remove Worst Offenders
      4. Mitigate
      5. Review Regularly
      6. Managing Risk Summary
    2. Chapter 6 Likelihood Versus Severity

      1. The Top 10 List: Low Likelihood, Low Severity Risk
      2. The Order Database: Low Likelihood, High Severity Risk
      3. Custom Fonts: High Likelihood, Low Severity Risk
      4. T-Shirt Photos: High Likelihood, High Severity Risk
    3. Chapter 7 The Risk Matrix

      1. Scope of the Risk Matrix
      2. Creating the Risk Matrix
      3. Using the Risk Matrix for Planning
      4. Maintaining the Risk Matrix
    4. Chapter 8 Risk Mitigation

      1. Recovery Plans
      2. Disaster Recovery Plans
      3. Improving Our Risk Situation
    5. Chapter 9 Game Days

      1. Staging Versus Production Environments
      2. Concerns with Running Game Days in Production
      3. Game Day Testing
    6. Chapter 10 Building Systems with Reduced Risk

      1. Redundancy
      2. Examples of Idempotent Interfaces
      3. Redundancy Improvements That Increase Complexity
      4. Independence
      5. Security
      6. Simplicity
      7. Self-Repair
      8. Operational Processes
  3. Services and Microservices

    1. Chapter 11 Why Use Services?

      1. The Monolith Application
      2. The Service-Based Application
      3. The Ownership Benefit
      4. The Scaling Benefit
    2. Chapter 12 Using Microservices

      1. What Should Be a Service?
      2. Going Too Far
      3. The Right Balance
    3. Chapter 13 Dealing with Service Failures

      1. Cascading Service Failures
      2. Responding to a Service Failure
      3. Determining Failures
      4. Appropriate Action
  4. Scaling Applications

    1. Chapter 14 Two Mistakes High

      1. What Is “Two Mistakes High”?
      2. “Two Mistakes High” in Practice
      3. Managing Your Applications
      4. The Space Shuttle
    2. Chapter 15 Service Ownership

      1. Single Team Owned Service Architecture
      2. Advantages of a STOSA Application and Organization
      3. What Does it Mean to Be a Service Owner?
    3. Chapter 16 Service Tiers

      1. Application Complexity
      2. What Are Service Tiers?
      3. Assigning Service Tier Labels to Services
      4. Example: Online Store
      5. What’s Next?
    4. Chapter 17 Using Service Tiers

      1. Expectations
      2. Responsiveness
      3. Dependencies
      4. Summary
    5. Chapter 18 Service-Level Agreements

      1. What are Service-Level Agreements?
      2. External Versus Internal SLAs
      3. Why Are Internal SLAs Important?
      4. SLAs as Trust
      5. SLAs for Problem Diagnosis
      6. Performance Measurements for SLAs
      7. How Many and Which Internal SLAs?
      8. Additional Comments on SLAs
    6. Chapter 19 Continuous Improvement

      1. Examine Your Application Regularly
      2. Microservices
      3. Service Ownership
      4. Stateless Services
      5. Where’s the Data?
      6. Data Partitioning
      7. The Importance of Continuous Improvement
  5. Cloud Services

    1. Chapter 20 Change and the Cloud

      1. What Has Changed in the Cloud?
      2. Change Continues
    2. Chapter 21 Distributing the Cloud

      1. AWS Architecture
      2. Architecture Overview
      3. Availability Zones Are Not Data Centers
      4. Maintaining Location Diversity for Availability Reasons
    3. Chapter 22 Managed Infrastructure

      1. Structure of Cloud-Based Services
      2. Implications of Using Managed Resources
      3. Implications of Using Non-Managed Resources
      4. Monitoring and CloudWatch
    4. Chapter 23 Cloud Resource Allocation

      1. Allocated-Capacity Resource Allocation
      2. Usage-Based Resource Allocation
      3. The Pros and Cons of Resource Allocation Techniques
    5. Chapter 24 Scalable Computing Options

      1. Cloud-Based Servers
      2. Compute Slices
      3. Dynamic Containers
      4. Microcompute
      5. Now What?
    6. Chapter 25 AWS Lambda

      1. Using Lambda
      2. Advantages and Disadvantages of Lambda
  6. Conclusion

    1. Chapter 26 Putting It All Together

      1. Availability
      2. Risk Management
      3. Services
      4. Scaling
      5. Cloud
      6. Architecting for Scale