Something of The Book

PDF EPUB Library of e-Books

Practical Site Reliability Engineering

Practical Site Reliability Engineering

Author: Pethuru Raj Chelliah

Publisher: Packt Publishing Ltd

ISBN: 9781788838696

Category: Computers

Page: 390

View: 425

Download BOOK »
Create, deploy, and manage applications at scale using SRE principles Key FeaturesBuild and run highly available, scalable, and secure softwareExplore abstract SRE in a simplified and streamlined wayEnhance the reliability of cloud environments through SRE enhancementsBook Description Site reliability engineering (SRE) is being touted as the most competent paradigm in establishing and ensuring next-generation high-quality software solutions. This book starts by introducing you to the SRE paradigm and covers the need for highly reliable IT platforms and infrastructures. As you make your way through the next set of chapters, you will learn to develop microservices using Spring Boot and make use of RESTful frameworks. You will also learn about GitHub for deployment, containerization, and Docker containers. Practical Site Reliability Engineering teaches you to set up and sustain containerized cloud environments, and also covers architectural and design patterns and reliability implementation techniques such as reactive programming, and languages such as Ballerina and Rust. In the concluding chapters, you will get well-versed with service mesh solutions such as Istio and Linkerd, and understand service resilience test practices, API gateways, and edge/fog computing. By the end of this book, you will have gained experience on working with SRE concepts and be able to deliver highly reliable apps and services. What you will learnUnderstand how to achieve your SRE goalsGrasp Docker-enabled containerization conceptsLeverage enterprise DevOps capabilities and Microservices architecture (MSA)Get to grips with the service mesh concept and frameworks such as Istio and LinkerdDiscover best practices for performance and resiliencyFollow software reliability prediction approaches and enable patternsUnderstand Kubernetes for container and cloud orchestrationExplore the end-to-end software engineering process for the containerized worldWho this book is for Practical Site Reliability Engineering helps software developers, IT professionals, DevOps engineers, performance specialists, and system engineers understand how the emerging domain of SRE comes handy in automating and accelerating the process of designing, developing, debugging, and deploying highly reliable applications and services.

The Site Reliability Workbook

The Site Reliability Workbook

Author: Betsy Beyer

Publisher: "O'Reilly Media, Inc."

ISBN: 9781492029458

Category: Computers

Page: 512

View: 216

Download BOOK »
In 2016, Google’s Site Reliability Engineering book ignited an industry discussion on what it means to run production services today—and why reliability considerations are fundamental to service design. Now, Google engineers who worked on that bestseller introduce The Site Reliability Workbook, a hands-on companion that uses concrete examples to show you how to put SRE principles and practices to work in your environment. This new workbook not only combines practical examples from Google’s experiences, but also provides case studies from Google’s Cloud Platform customers who underwent this journey. Evernote, The Home Depot, The New York Times, and other companies outline hard-won experiences of what worked for them and what didn’t. Dive into this workbook and learn how to flesh out your own SRE practice, no matter what size your company is. You’ll learn: How to run reliable services in environments you don’t completely control—like cloud Practical applications of how to create, monitor, and run your services via Service Level Objectives How to convert existing ops teams to SRE—including how to dig out of operational overload Methods for starting SRE from either greenfield or brownfield

Practical Reliability Engineering

Practical Reliability Engineering

Author: Patrick O'Connor

Publisher: John Wiley & Sons

ISBN: 9780470979815

Category: Technology & Engineering

Page: 512

View: 975

Download BOOK »
With emphasis on practical aspects of engineering, this bestseller has gained worldwide recognition through progressive editions as the essential reliability textbook. This fifth edition retains the unique balanced mixture of reliability theory and applications, thoroughly updated with the latest industry best practices. Practical Reliability Engineering fulfils the requirements of the Certified Reliability Engineer curriculum of the American Society for Quality (ASQ). Each chapter is supported by practice questions, and a solutions manual is available to course tutors via the companion website. Enhanced coverage of mathematics of reliability, physics of failure, graphical and software methods of failure data analysis, reliability prediction and modelling, design for reliability and safety as well as management and economics of reliability programmes ensures continued relevance to all quality assurance and reliability courses. Notable additions include: New chapters on applications of Monte Carlo simulation methods and reliability demonstration methods. Software applications of statistical methods, including probability plotting and a wider use of common software tools. More detailed descriptions of reliability prediction methods. Comprehensive treatment of accelerated test data analysis and warranty data analysis. Revised and expanded end-of-chapter tutorial sections to advance students’ practical knowledge. The fifth edition will appeal to a wide range of readers from college students to seasoned engineering professionals involved in the design, development, manufacture and maintenance of reliable engineering products and systems. www.wiley.com/go/oconnor_reliability5

Site Reliability Engineering

Site Reliability Engineering

Author: Niall Richard Murphy

Publisher: "O'Reilly Media, Inc."

ISBN: 9781491951170

Category:

Page: 552

View: 348

Download BOOK »
The overwhelming majority of a software system’s lifespan is spent in use, not in design or implementation. So, why does conventional wisdom insist that software engineers focus primarily on the design and development of large-scale computing systems? In this collection of essays and articles, key members of Google’s Site Reliability Team explain how and why their commitment to the entire lifecycle has enabled the company to successfully build, deploy, monitor, and maintain some of the largest software systems in the world. You’ll learn the principles and practices that enable Google engineers to make systems more scalable, reliable, and efficient—lessons directly applicable to your organization. This book is divided into four sections: Introduction—Learn what site reliability engineering is and why it differs from conventional IT industry practices Principles—Examine the patterns, behaviors, and areas of concern that influence the work of a site reliability engineer (SRE) Practices—Understand the theory and practice of an SRE’s day-to-day work: building and operating large distributed computing systems Management—Explore Google's best practices for training, communication, and meetings that your organization can use

Hands-on Site Reliability Engineering

Hands-on Site Reliability Engineering

Author: Shamayel M. Farooqui

Publisher: BPB Publications

ISBN: 9789391030322

Category: Computers

Page: 236

View: 675

Download BOOK »
A comprehensive guide with basic to advanced SRE practices and hands-on examples. KEY FEATURES ● Demonstrates how to execute site reliability engineering along with fundamental concepts. ● Illustrates real-world examples and successful techniques to put SRE into production. ● Introduces you to DevOps, advanced techniques of SRE, and popular tools in use. DESCRIPTION Hands-on Site Reliability Engineering (SRE) brings you a tailor-made guide to learn and practice the essential activities for the smooth functioning of enterprise systems, right from designing to the deployment of enterprise software programs and extending to scalable use with complete efficiency and reliability. The book explores the fundamentals around SRE and related terms, concepts, and techniques that are used by SRE teams and experts. It discusses the essential elements of an IT system, including microservices, application architectures, types of software deployment, and concepts like load balancing. It explains the best techniques in delivering timely software releases using containerization and CI/CD pipeline. This book covers how to track and monitor application performance using Grafana, Prometheus, and Kibana along with how to extend monitoring more effectively by building full-stack observability into the system. The book also talks about chaos engineering, types of system failures, design for high-availability, DevSecOps and AIOps. WHAT YOU WILL LEARN ● Learn the best techniques and practices for building and running reliable software. ● Explore observability and popular methods for effective monitoring of applications. ● Workaround SLIs, SLOs, Error Budgets, and Error Budget Policies to manage failures. ● Learn to practice continuous software delivery using blue/green and canary deployments. ● Explore chaos engineering, SRE best practices, DevSecOps and AIOps. WHO THIS BOOK IS FOR This book caters to experienced IT professionals, application developers, software engineers, and all those who are looking to develop SRE capabilities at the individual or team level. TABLE OF CONTENTS 1. Understand the World of IT 2. Introduction to DevOps 3. Introduction to SRE 4. Identify and Eliminate Toil 5. Release Engineering 6. Incident Management 7. IT Monitoring 8. Observability 9. Key SRE KPIs: SLAs, SLOs, SLIs, and Error Budgets 10. Chaos Engineering 11. DevSecOps and AIOps 12. Culture of Site Reliability Engineering

The Practice of Cloud System Administration

The Practice of Cloud System Administration

Author: Thomas A. Limoncelli

Publisher: Addison-Wesley Professional

ISBN: 9780133478532

Category: Computers

Page: 560

View: 521

Download BOOK »
“There’s an incredible amount of depth and thinking in the practices described here, and it’s impressive to see it all in one place.” —Win Treese, coauthor of Designing Systems for Internet Commerce The Practice of Cloud System Administration, Volume 2, focuses on “distributed” or “cloud” computing and brings a DevOps/SRE sensibility to the practice of system administration. Unsatisfied with books that cover either design or operations in isolation, the authors created this authoritative reference centered on a comprehensive approach. Case studies and examples from Google, Etsy, Twitter, Facebook, Netflix, Amazon, and other industry giants are explained in practical ways that are useful to all enterprises. The new companion to the best-selling first volume, The Practice of System and Network Administration, Second Edition, this guide offers expert coverage of the following and many other crucial topics: Designing and building modern web and distributed systems Fundamentals of large system design Understand the new software engineering implications of cloud administration Make systems that are resilient to failure and grow and scale dynamically Implement DevOps principles and cultural changes IaaS/PaaS/SaaS and virtual platform selection Operating and running systems using the latest DevOps/SRE strategies Upgrade production systems with zero down-time What and how to automate; how to decide what not to automate On-call best practices that improve uptime Why distributed systems require fundamentally different system administration techniques Identify and resolve resiliency problems before they surprise you Assessing and evaluating your team’s operational effectiveness Manage the scientific process of continuous improvement A forty-page, pain-free assessment system you can start using today

Database Reliability Engineering

Database Reliability Engineering

Author: Laine Campbell

Publisher: "O'Reilly Media, Inc."

ISBN: 9781491926222

Category: Computers

Page: 294

View: 461

Download BOOK »
The infrastructure-as-code revolution in IT is also affecting database administration. With this practical book, developers, system administrators, and junior to mid-level DBAs will learn how the modern practice of site reliability engineering applies to the craft of database architecture and operations. Authors Laine Campbell and Charity Majors provide a framework for professionals looking to join the ranks of today’s database reliability engineers (DBRE). You’ll begin by exploring core operational concepts that DBREs need to master. Then you’ll examine a wide range of database persistence options, including how to implement key technologies to provide resilient, scalable, and performant data storage and retrieval. With a firm foundation in database reliability engineering, you’ll be ready to dive into the architecture and operations of any modern database. This book covers: Service-level requirements and risk management Building and evolving an architecture for operational visibility Infrastructure engineering and infrastructure management How to facilitate the release management process Data storage, indexing, and replication Identifying datastore characteristics and best use cases Datastore architectural components and data-driven architectures

Implementing Cloud Design Patterns for AWS

Implementing Cloud Design Patterns for AWS

Author: Sean Keery

Publisher: Packt Publishing Ltd

ISBN: 9781789130911

Category: Computers

Page: 274

View: 793

Download BOOK »
Create highly efficient design patterns for scalability, redundancy, and high availability in the AWS Cloud Key FeaturesBuild highly robust systems using the cloud infrastructureMake web applications resilient against scheduled and accidental downtimeExplore and apply Amazon-provided services in unique ways to solve common design problemsBook Description Whether you're just getting your feet wet in cloud infrastructure or already creating complex systems, this book will guide you through using the patterns to fit your system needs. Starting with patterns that cover basic processes such as source control and infrastructure-as-code, the book goes on to introduce cloud security practices. You'll then cover patterns of availability and scalability and get acquainted with the ephemeral nature of cloud environments. You'll also explore advanced DevOps patterns in operations and maintenance, before focusing on virtualization patterns such as containerization and serverless computing. In the final leg of your journey, this book will delve into data persistence and visualization patterns. You'll get to grips with architectures for processing static and dynamic data, as well as practices for managing streaming data. By the end of this book, you will be able to design applications that are tolerant of underlying hardware failures, resilient against an unexpected influx of data, and easy to manage and replicate. What you will learnImplement scaling policies on schedules, influxes in traffic, and deep health checksMake complete use of highly available and redundant storageDesign content delivery networks to improve user experienceOptimize databases through caching and shardingApply patterns to solve common problemsImplement repeatable processes for deploying systemsWho this book is for If you’re an architect, solution provider, or DevOps community member looking to implement repeatable patterns for deploying and maintaining services in the Amazon cloud infrastructure, this book is for you. You’ll need prior experience of using AWS understand key concepts covered in the book, as it focuses on the patterns rather than the basics of using AWS.

Achieving DevOps

Achieving DevOps

Author: Dave Harrison

Publisher: Apress

ISBN: 9781484243886

Category: Business & Economics

Page: 496

View: 261

Download BOOK »
Ben is stuck. A development lead with a strong vision for how the intersection of development and operations at his office can be improved, he can’t help but feel overwhelmed and discouraged by common problems such as slow turnaround time, rushed and ineffective handover documentation, mounting technical debt, and a lagging QA process. What steps should Ben take to build the momentum needed to create positive changes within his company? In this unique business novel by Dave Harrison and Knox Lively, two DevOps professionals with years of diverse experience in the industry, you follow Ben as he solves work frustrations in order to adopt Agile, DevOps, and microservices architectures for his organization. Achieving DevOps addresses the “Now what?” moment many DevOps professionals face on their journey. The story provides you with the knowledge you need to navigate the internal political waters, build management support, show measurable results, and bring DevOps successfully into your organization. Come away with practical lessons and timeless business concepts. You’ll know how to effect change in a company from the bottom up, gain support, and instill a pattern of progressively building on success. Experience Ben’s progress vicariously in Achieving DevOps and bridge the gap between inspiration and the implementation of your own DevOps practices. Who This Book Is For Those serving as change agents who are working to influence and move their organizations toward a DevOps approach to software development and deployment: those working to effect change from the bottom up such as development leads, QA leads, project managers, and individual developers; and IT directors, CTOs, and others at the top of an organization who are being asked to lend their support toward DevOps implementation efforts

High Performance MySQL

High Performance MySQL

Author: Silvia Botros

Publisher: "O'Reilly Media, Inc."

ISBN: 9781492080466

Category: Computers

Page: 388

View: 917

Download BOOK »
How can you realize MySQL's full power? With High Performance MySQL, you'll learn advanced techniques for everything from setting service-level objectives to designing schemas, indexes, and queries to tuning your server, operating system, and hardware to achieve your platform's full potential. This guide also teaches database administrators safe and practical ways to scale applications through replication, load balancing, high availability, and failover. Updated to reflect recent advances in cloud- and self-hosted MySQL, InnoDB performance, and new features and tools, this revised edition helps you design a relational data platform that will scale with your business. You'll learn best practices for database security along with hard-earned lessons in both performance and database stability. Dive into MySQL's architecture, including key facts about its storage engines Learn how server configuration works with your hardware and deployment choices Make query performance part of your software delivery process Examine enhancements to MySQL's replication and high availability Compare different MySQL offerings in managed cloud environments Explore MySQL's full stack optimization from application-side configuration to server tuning Turn traditional database management tasks into automated processes

Database Reliability Engineering

Database Reliability Engineering

Author: Laine Campbell

Publisher: "O'Reilly Media, Inc."

ISBN: 9781491926215

Category: Computers

Page: 294

View: 219

Download BOOK »
The infrastructure-as-code revolution in IT is also affecting database administration. With this practical book, developers, system administrators, and junior to mid-level DBAs will learn how the modern practice of site reliability engineering applies to the craft of database architecture and operations. Authors Laine Campbell and Charity Majors provide a framework for professionals looking to join the ranks of today’s database reliability engineers (DBRE). You’ll begin by exploring core operational concepts that DBREs need to master. Then you’ll examine a wide range of database persistence options, including how to implement key technologies to provide resilient, scalable, and performant data storage and retrieval. With a firm foundation in database reliability engineering, you’ll be ready to dive into the architecture and operations of any modern database. This book covers: Service-level requirements and risk management Building and evolving an architecture for operational visibility Infrastructure engineering and infrastructure management How to facilitate the release management process Data storage, indexing, and replication Identifying datastore characteristics and best use cases Datastore architectural components and data-driven architectures