My Account Log in

1 option

LLMs in Production : From Language Models to Successful Products.

O'Reilly Online Learning: Academic/Public Library Edition Available online

View online
Format:
Book
Author/Creator:
Brousseau, Christopher.
Contributor:
Sharp, Matt.
Language:
English
Subjects (All):
Artificial intelligence.
Chatbots.
Machine translating.
Natural language processing (Computer science).
Physical Description:
1 online resource (434 pages)
Edition:
1st ed.
Place of Publication:
New York : Manning Publications Co. LLC, 2025.
Summary:
Learn how to put Large Language Model-based applications into production safely and efficiently. This practical book offers clear, example-rich explanations of how LLMs work, how you can interact with them, and how to integrate LLMs into your own applications. Find out what makes LLMs so different from traditional software and ML, discover best practices for working with them out of the lab, and dodge common pitfalls with experienced advice. In LLMs in Production you will: Grasp the fundamentals of LLMs and the technology behind them Evaluate when to use a premade LLM and when to build your own Efficiently scale up an ML platform to handle the needs of LLMs Train LLM foundation models and finetune an existing LLM Deploy LLMs to the cloud and edge devices using complex architectures like PEFT and LoRA Build applications leveraging the strengths of LLMs while mitigating their weaknesses LLMs in Production delivers vital insights into delivering MLOps so you can easily and seamlessly guide one to production usage. Inside, you'll find practical insights into everything from acquiring an LLM-suitable training dataset, building a platform, and compensating for their immense size. Plus, tips and tricks for prompt engineering, retraining and load testing, handling costs, and ensuring security. About the Technology Most business software is developed and improved iteratively, and can change significantly even after deployment. By contrast, because LLMs are expensive to create and difficult to modify, they require meticulous upfront planning, exacting data standards, and carefully-executed technical implementation. Integrating LLMs into production products impacts every aspect of your operations plan, including the application lifecycle, data pipeline, compute cost, security, and more. Get it wrong, and you may have a costly failure on your hands. About the Book LLMs in Production teaches you how to develop an LLMOps plan that can take an AI app smoothly from design to delivery. You'll learn techniques for preparing an LLM dataset, cost-efficient training hacks like LORA and RLHF, and industry benchmarks for model evaluation. Along the way, you'll put your new skills to use in three exciting example projects: creating and training a custom LLM, building a VSCode AI coding extension, and deploying a small model to a Raspberry Pi. What's Inside Balancing cost and performance Retraining and load testing Optimizing models for commodity hardware Deploying on a Kubernetes cluster About the Reader For data scientists and ML engineers who know Python and the basics of cloud deployment. About the Authors Christopher Brousseau and Matt Sharp are experienced engineers who have led numerous successful large scale LLM deployments. Quotes Covers all the essential aspects of how to build and deploy LLMs. It goes into the deep and fascinating areas that most other books gloss over. - Andrew Carr, Cartwheel A must-read for anyone looking to harness the potential of LLMs in production environments. - Jepson Taylor, VEOX Inc. An exceptional guide that simplifies the building and deployment of complex LLMs. - Arunkumar Gopalan, Microsoft UK A thorough and practical guide for running LLMs in production. - Dinesh Chitlangia, AMD.
Contents:
Words' awakening: why large language models have captured attention
Large language models: a deep dive into language modeling
Large language model operations: building a platform for LLMs
Data engineering for large language models: setting up for success
Training large language models: how to generate the generator
Large language model services: a practical guide
Prompt engineering: becoming an LLM whisperer
Large language model applications: building an interactive experience
Creating an LLM project: reimplementing Llama 3
Creating a coding copilot project: this would have helped you earlier
Deploying an LLM on a Raspberry Pi: How low can you go?
Production, an ever-changing landscape: Things are just getting started.
Notes:
Description based on publisher supplied metadata and other sources.
ISBN:
9781638357254
1638357250
9781633437203
1633437205
OCLC:
1493579540

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account