-
Author Charity Majors , Liz Fong-Jones
Observability is critical for engineering, managing, and improving complex business-critical systems. Through this process, any software engineering team can gain a deeper understanding of system performance, so you can perform ongoing maintenance and ship the features yo....
Release Date 2021/12 -
97 Things Every SRE Should Know
Site reliability engineering (SRE) is more relevant than ever. Knowing how to keep systems reliable has become a critical skill. With this practical book, newcomers and old hats alike will explore a broad range of conversations happening in SRE. You'll get actionable advice on several topics, inclu.... -
Implementing Service Level Objectives
Author Alex Hidalgo
Although service-level objectives (SLOs) continue to grow in importance, there’s a distinct lack of information about how to implement them. Practical advice that does exist usually assumes that your team already has the infrastructure, tooling, and culture in place. In t....
Release Date 2020/08 -
Author Nora Jones , Casey Rosenthal
There’s more to chaos engineering than deliberately breaking stuff in production. With this book, QA engineers as well as program and product managers will examine the theory, history, and implementation of this full-fledged software engineering discipline. Chaos e....
Release Date 2020/05 -
Training Site Reliability Engineers
Author Jennifer Petoff , JC van Winkel , Preston Yoshioka
Learn how to train site reliability engineers at your organization in both general and domain-specific subject matter. With this detailed guide from Google’s SRE team, you’ll not only learn a set of training best practices Google uses for ramping up new SREs; you’ll also ....
Release Date 2020/02 -
Creating a Production Launch Plan
Author Vitaliy Shipitsyn , Alec Warner
So many things could go wrong during a production launch, and relying on ad hoc planning is simply inviting trouble. Using a launch plan as a template for products large and small could save a lot of time, money, and headaches. This practical report demonstrates ho....
Release Date 2020/01 -
Project Reliability Engineering: Pro Skills for Next Level Maker Projects
Author Eyal Shahar
Turn your projects from a weekend hack to a long-living creation! Loosely drawing from the field known in large software companies as Site Reliability Engineering (SRE), this book distills from these disciplines and addresses issues that matter to makers: keeping p....
Release Date 2019/09 -
A Case Study in Community-Driven Software Adoption
Author Richard Bondi
Many SRE tasks are the same across all types of software, yet individual teams often develop very different automation tools and processes and can resist standardization. Why does this diversity exist? And how can an organization prevent SRE teams from duplicating ....
Release Date 2019/07 -
Engineering Reliable Mobile Applications
Author Pranjal Deo , Devin Carraway , Venkat Patnala , Kristine Chen
Imagine a situation where your service reports as healthy and serving but you receive multiple user reports of poor availability. How are these users accessing your service? Most likely, they’re using a client application, such as a mobile phone. Traditionally, SRE....
Release Date 2019/07 -
Author Craig Sebenik , Kurt Andersen
Site Reliability Engineering is an outgrowth of the "always-on" world of online services. Initiated at Google more than a decade ago, SRE helps many of today’s sites run effectively and reliably, even as those sites continuously introduce new features. This ebook e....
Release Date 2019/06 -
Practical Applications of Bayesian Reliability
Author Athula I. Abeyratne , Yan Liu
Demonstrates how to solve reliability problems using practical applications of Bayesian modelsThis self-contained reference provides fundamental knowledge of Bayesian reliability and utilizes numerous examples to show how Bayesian models can solve real life reliabi....
Release Date 2019/05 -
Reliability Engineering and Services
Author Tongdan Jin
Offers a holistic approach to guiding product design, manufacturing, and after-sales support as the manufacturing industry transitions from a product-oriented model to service-oriented paradigm This book provides fundamental knowledge and best industry practices in....
Release Date 2019/03 -
Author Chaonan Wang , Gregory Levitin , Liudong Xing
Offers timely and comprehensive coverage of dynamic system reliability theoryThis book focuses on hot issues of dynamic system reliability, systematically introducing the reliability modeling and analysis methods for systems with imperfect fault coverage, systems w....
Release Date 2019/03 -
Reliability Prediction and Testing Textbook
Author Edward L. Anderson , Lev M. Klyatis
This textbook reviews the methodologies of reliability prediction as currently used in industries such as electronics, automotive, aircraft, aerospace, off-highway, farm machinery, and others. It then discusses why these are not successful; and, presents methods de....
Release Date 2018/11 -
Practical Site Reliability Engineering
Author Shailender Singh , Shreyash Naithani , Pethuru Raj Chelliah
Create, deploy, and manage applications at scale using SRE principlesKey FeaturesBuild and run highly available, scalable, and secure software Explore abstract SRE in a simplified and streamlined way Enhance the reliability of cloud environments through SRE enhance....
Release Date 2018/11 -
Author Mark Wilkins , Gary Sloper
For companies that deliver services and gather data at the edge of their networks, an infrastructure resilience strategy is essential. IT departments within many medium to large enterprises today rely on third-party cloud or data center providers to help manage tra....
Release Date 2018/10 -
Author David N. Blank-Edelman
Organizations big and small have started to realize just how crucial system and application reliability is to their business. They’ve also learned just how difficult it is to maintain that reliability while iterating at the speed demanded by the marketplace. Site R....
Release Date 2018/09 -
Author Stephen Thorne , Kent Kawahara , David K. Rensin , Niall Richard Murphy , Betsy Beye
In 2016, Google’s Site Reliability Engineering book ignited an industry discussion on what it means to run production services today—and why reliability considerations are fundamental to service design. Now, Google engineers who worked on that bestseller introduce ....
Release Date 2018/07 -
Reliability Modelling and Analysis in Discrete Time
Author N. Balakrishnan , P.G. Sankaran , Unnikrishnan Nair
Reliability Modelling and Analysis in Discrete Time provides an overview of the probabilistic and statistical aspects connected with discrete reliability systems. This engaging book discusses their distributional properties and dependence structures before explorin....
Release Date 2018/05 -
Database Reliability Engineering
Author Laine Campbell , Charity Majors
The infrastructure-as-code revolution in IT is also affecting database administration. With this practical book, developers, system administrators, and junior to mid-level DBAs will learn how the modern practice of site reliability engineering applies to the craft ....
Release Date 2017/11 -
JMP 13 Reliability and Survival Methods
Author SAS Institute
JMP 13 Reliability and Survival Methods provides details about evaluating and improving reliability in a product or system and analyzing survival data for people and products. The book explains how to fit the best distribution to your time-to-event data or analyze ....
Release Date 2016/09 -
Reliability of Engineering Systems and Technological Risks
Author Vladimir Rykov
In this book three main aspects are considered together: mathematical models for engineering systems reliability, the main concepts for technogeneous risk study, and insurance as methods for risks management. In the first part the author considers some special stat....
Release Date 2016/09 -
Author Sanjay K. Chaturvedi
In Engineering theory and applications, we think and operate in terms of logics and models with some acceptable and reasonable assumptions. The present text is aimed at providing modelling and analysis techniques for the evaluation of reliability measures (2-termin....
Release Date 2016/05 -
Author Jennifer Petoff , Niall Richard Murphy , Chris Jones , Betsy Beyer
The overwhelming majority of a software system’s lifespan is spent in use, not in design or implementation. So, why does conventional wisdom insist that software engineers focus primarily on the design and development of large-scale computing systems?In this collec....
Release Date 2016/04 -
Reliability, Maintainability, and Supportability: Best Practices for Systems Engineers
Author Michael Tortorella
Focuses on the core systems engineering tasks of writing, managing, and tracking requirements for reliability, maintainability, and supportability that are most likely to satisfy customers and lead to success for suppliersThis book helps systems engineers lead the d....
Release Date 2015/03 -
Author Kailash C. Kapur
An Integrated Approach to Product DevelopmentReliability Engineering presents an integrated approach to the design, engineering, and management of reliability activities throughout the life cycle of a product, including concept, research and development, design, man....
Release Date 2014/04 -
Author Peter W. Epperlein
This reference book provides a fully integrated novel approach to the development of high-power, single-transverse mode, edge-emitting diode lasers by addressing the complementary topics of device engineering, reliability engineering and device diagnostics in the sa....
Release Date 2013/03 -
Author Louis J. Gullo , Dev G. Raheja
A unique, design-based approach to reliability engineeringDesign for Reliability provides engineers and managers with a range of tools and techniques for incorporating reliability into the design process for complex systems. It clearly explains how to design for zer....
Release Date 2012/08 -
Practical Reliability Engineering, 5th Edition
Author Andre Kleyner , Patrick O'Connor
With emphasis on practical aspects of engineering, this bestseller has gained worldwide recognition through progressive editions as the essential reliability textbook. This fifth edition retains the unique balanced mixture of reliability theory and applications, tho....
Release Date 2012/01