Site Reliability Engineering
The Generative AI Era
Failed to add items
Sorry, we are unable to add the item because your shopping cart is already at capacity.
Add to Cart failed.
Please try again later
Add to Wish List failed.
Please try again later
Remove from wishlist failed.
Please try again later
Adding to library failed
Please try again
Follow podcast failed
Please try again
Unfollow podcast failed
Please try again
Audible Standard 30-day free trial
Select 1 audiobook a month from our entire collection of titles.
Yours as long as you’re a member.
Get unlimited access to bingeable podcasts.
Standard auto renews for $8.99 a month after 30 days. Cancel anytime.
Buy for $6.30
-
Narrated by:
-
Virtual Voice
-
By:
-
Ajit Singh
This title uses virtual voice narration
Virtual voice is computer-generated narration for audiobooks.
Philosophy:
The core philosophy of this book is "Reliability by Design for the AI Era." We posit that the non-deterministic, complex, and often opaque nature of GenAI systems demands a new, integrated approach to reliability. It is not enough to simply "bolt on" monitoring to a prototype. Instead, reliability must be a foundational concern woven into every stage of the lifecycle, from architectural design and data ingestion to model evaluation, deployment, and ongoing operations. We treat emergent challenges like hallucinations, prompt injection, and unpredictable costs as first-class reliability problems that require systematic, engineering-driven solutions.
Key Features:
1. Comprehensive Coverage: This book offers a single, coherent resource covering the entire lifecycle of a production-grade GenAI application, from architecture to security.
2. Production-Ready Architectures: A deep dive into the practical trade-offs between patterns like Retrieval-Augmented Generation (RAG), fine-tuning, and hybrid models.
3. Focus on LLM Observability (LLM-O11y): Dedicated coverage of the new art of monitoring, including tracking token cost, latency, hallucination rates, and user feedback loops.
4. Evaluation-Driven Development: Practical guidance on building robust evaluation suites and integrating them into a CI/CD pipeline to ensure quality and prevent regressions.
5. Actionable Optimization Techniques: Concrete strategies for caching, batching, and model selection to reduce costs and improve performance.
6. Full Capstone Project: A complete, step-by-step guide to building and deploying an observable, secure, and reliable GenAI application from scratch, including all working code.
To Whom This Book Is For:
1. B.Tech/M.Tech Computer Science Students: Serves as a primary textbook for courses on Cloud Computing, DevOps, MLOps, or specialized AI Engineering electives. It provides the foundational knowledge and practical skills needed for a career in modern software and AI.
2. Software Engineers & Developers: A practical guide for those tasked with integrating LLM features into new or existing applications and ensuring they are production-ready.
3. DevOps, SRE, and MLOps Engineers: A crucial resource for adapting existing reliability practices to the unique challenges of the GenAI stack, from vector databases to inference endpoints.
4. Technical Leads and Architects: Provides the strategic framework and architectural patterns needed to make informed decisions about building, deploying, and operating reliable and cost-effective GenAI services.
People who viewed this also viewed...
No reviews yet